Since AUM is a critical component, it is heavily tested and its testing involves a lot of interactions, it is nice to have a task to have a high level view of the failures.
Management data
This section is for management only, it should be the last one in the description.
aum-offline-upgrade-branch allv2022pre not available -> https://gitlab.apertis.org/infrastructure/apertis-issues/-/issues/418aum-offline-upgrade bcm2711-rpi-4-b-cbg-0bootloader-interrupt timed out after 895 seconds -> sd card issue in bcm2711-rpi-4-b-cbg-0?https://lava.collabora.dev/scheduler/job/11239653aum-out-of-space bcm2711-rpi-4-b-cbg-0bootloader-interrupt timed out after 895 seconds -> sd card issue in bcm2711-rpi-4-b-cbg-0?https://lava.collabora.dev/scheduler/job/11239665apparmor* imx6q-sabrelite-lava-cbg-1dd timeout -> sd card error on imx6q-sabrelite-lava-cbg-1 https://lava.collabora.dev/scheduler/job/11241106
v2023
aum-ota-out-of-space r8a7796-m3ulcb-cbg-1ext4 fshttps://lava.collabora.dev/scheduler/job/11241388aum-offline-upgrade bcm2711-rpi-4-b-cbg-0bootloader-interrupt timed out after 895 seconds -> sd card issue in bcm2711-rpi-4-b-cbg-0?https://lava.collabora.dev/scheduler/job/11241164aum-offline-upgrade-branch aaeon-UPN-EHLX4RE-A10-0864-cbg-0login timeout -> https://gitlab.apertis.org/infrastructure/apertis-issues/-/issues/339https://lava.collabora.dev/scheduler/job/11241106aum-ota-api aaeon-UPN-EHLX4RE-A10-0864-cbg-2login timeout -> https://gitlab.apertis.org/infrastructure/apertis-issues/-/issues/339https://lava.collabora.dev/scheduler/job/11241145aum-ota-out-of-space aaeon-UPN-EHLX4RE-A10-0864-cbg-1login timeout -> https://gitlab.apertis.org/infrastructure/apertis-issues/-/issues/339https://lava.collabora.dev/scheduler/job/11241147
v2024dev3
aum-ota-signed all v2024dev3need to investigateaum-offline-upgrade imx6q-sabrelite-lava-cbg-1 dd timeout -> sd card error on imx6q-sabrelite-lava-cbg-1 https://lava.collabora.dev/scheduler/job/11239153aum-offline-upgrade-branch bcm2711-rpi-4-b-cbg-2booting errorhttps://lava.collabora.dev/scheduler/job/11239350aum-rollback-blacklist bcm2711-rpi-4-b-cbg-0bootloader-interrupt timed out after 895 seconds -> sd card issue in bcm2711-rpi-4-b-cbg-0?https://lava.collabora.dev/scheduler/job/11239358aum-offline-upgrade-branch aaeon-UPN-EHLX4RE-A10-0864-cbg-2 login timeout -> https://gitlab.apertis.org/infrastructure/apertis-issues/-/issues/339https://lava.collabora.dev/scheduler/job/11239308
After investigating a bit more the failures in imx6q-sabrelite-lava-cbg-4 logs like this were found
Couldn't find partition mmc 0:1Error selecting deviceFile System is consistentfile found, deletingFile System is consistent
The first two entries seem to point to the fact that u-boot is unable to read the mmc and later it stores a bootcount without any issues (no error log). This causes bootcount to be reset and making ostree tests to fail. I guess this could be caused by a very slow mmc and this u-boot version does not handle it properly?
During the regular check it was noted that aum-offline-upgrade-branch #364 (closed) always fails on RPi4 for v2022. This issue is probably related to the fact the we used v2021 as base release where support for RPi4 was no completed. For this reason and for since EOL v2022 is near this issue won't be fixed.
The error shown is
Oct 17 02:03:05 apertis apertis-update-[1059]: Ostree upgrade failed: Installing kernel: regfile copy: No space left on device
Another interesting pattern is that out-of-space tests tend to fail in amd64. Somehow the upgrade applies even if the space is limited. This needs to be investigated.