SD card timeout issues in RPi4
Affected images versions
- not relevant (explain why)
- see the table below (list the build id and the apt or ostree deployment of the tested images in the appropriate cells)
Type | Arch | v2021 | v2022 | v2023pre |
---|---|---|---|---|
minimal/fixedfunction | amd64 | |||
minimal/fixedfunction | armhf | |||
minimal/fixedfunction | arm64 | x | ||
target/hmi | amd64 | |||
target/hmi | armhf | |||
target/hmi | arm64 | |||
basesdk | amd64 | |||
sdk | amd64 | |||
nfs | amd64 | |||
nfs | armhf | |||
nfs | arm64 | |||
lxc | amd64 | |||
lxc | armhf | |||
lxc | arm64 | |||
image-builder | ||||
package-source-builder |
To find the build id and the variant type you can:
- derive it from the image name
- for instance, with the
apertis_ostree_v2022pre-fixedfunction-amd64-uefi_20211031.0425.img.gz
image the build id is 20211031.0425, the variant isfixedfunction
the deployment type isostree
- for instance, with the
- obtain it from
/etc/os-release
using theBUILD_ID
andVARIANT_ID
keys
Steps to reproduce
https://lava.collabora.dev/scheduler/job/8073508#L10449
Expected result
The test should pass
Actual result
Test is not able to complete due to timeout:
[ 5.312072] irq 35: nobody cared (try booting with the "irqpoll" option)
[ 5.319140] handlers:
[ 5.321444] [<(____ptrval____)>] sdhci_irq [sdhci] threaded [<(____ptrval____)>] sdhci_thread_irq [sdhci]
[ 5.331198] Disabling IRQ #35
[ 15.331419] mmc0: Timeout waiting for hardware cmd interrupt.
[ 15.337282] mmc0: sdhci: ============ SDHCI REGISTER DUMP ===========
[ 15.343820] mmc0: sdhci: Sys addr: 0x00000000 | Version: 0x00001002
[ 15.350362] mmc0: sdhci: Blk size: 0x00000000 | Blk cnt: 0x00000000
[ 15.356902] mmc0: sdhci: Argument: 0x00000c00 | Trn mode: 0x00000000
[ 15.363441] mmc0: sdhci: Present: 0x1fff0000 | Host ctl: 0x00000001
[ 15.369979] mmc0: sdhci: Power: 0x0000000f | Blk gap: 0x00000080
[ 15.376517] mmc0: sdhci: Wake-up: 0x00000000 | Clock: 0x00007d07
[ 15.383055] mmc0: sdhci: Timeout: 0x00000000 | Int stat: 0x00018000
[ 15.389592] mmc0: sdhci: Int enab: 0x00ff1003 | Sig enab: 0x00ff1003
[ 15.396130] mmc0: sdhci: ACmd stat: 0x00000000 | Slot int: 0x00000001
[ 15.402667] mmc0: sdhci: Caps: 0x45ee6432 | Caps_1: 0x0000a525
[ 15.409205] mmc0: sdhci: Cmd: 0x0000341a | Max curr: 0x00080008
[ 15.415743] mmc0: sdhci: Resp[0]: 0x00000000 | Resp[1]: 0x00000000
[ 15.422280] mmc0: sdhci: Resp[2]: 0x00000000 | Resp[3]: 0x00000000
[ 15.428815] mmc0: sdhci: Host ctl2: 0x00000000
[ 15.433323] mmc0: sdhci: ADMA Err: 0x00000000 | ADMA Ptr: 0x00000000
[ 15.439859] mmc0: sdhci: ============================================
Reproducibility
How often the issue is hit when repeating the test and changing nothing (same device, same image, etc.)?
Put the
- always
-
✅ often, but not always - rarely
Impact of bug
This issue causes the system on RPi4 not to boot.
Root cause
describe in one line what caused the issue to give a hint to product teams whether they may be impacted or not
Outcomes
TBD
Management data
This section is for management only, it should be the last one in the description.
/cc @andrunko @em @sagar @sudarshan @wlozano
Phabricator link: https://phabricator.apertis.org/T9412
Edited by Apertis CI robot