Fix dhcpcd and virtual interface handling for native containers #4052

rene · 2024-07-03T13:17:00Z

This PR solves an issue like the log below, observed while deploying native containers and in some Eden tests:

ERROR: App BM-1 uuid 98c7224d-07bb-42a0-99fd-8afcd0d0e808 state HALTED error: failed to create shim task: OCI runtime create failed: runc create failed: unable to start container process: error during container init: error mounting "/run/tasks/vifs/98c7224d-07bb-42a0-99fd-8afcd0d0e808.7.1/etc/resolv.conf.new" to rootfs at "/etc/resolv.conf": stat /run/tasks/vifs/98c7224d-07bb-42a0-99fd-8afcd0d0e808.7.1/etc/resolv.conf.new: no such file or directory: unknown

Two commits are provided to solve the issue (descriptions below):

Implement retry mechanism for veth.sh

When deploying native containers, during the setup of the virtual network interfaces, the Bridge device might no be ready, leading to errors like the following:

"brctl: bridge bn1: Resource busy - Cannot find device nbu1x1.1"

This commit provides a workaround by implementing a retry mechanism, so in case of error the script will retry the operation after 5 seconds for at most 3 times before fail.

Do not remove directory for dhcpcd

The dhcpcd.sh script creates the /run/task/vifs/<APP_UUID>/ directories on an up command (configured as a Prestart hook in the container OCI interface) and removes it entirely on a down command (configured as Poststop). However, the etc/resolv.conf.new file is created by pillar and should be mounted inside the container. If any issue arises while setting up the bridge + virtual network interface during container initialization, pillar will retry to start the container, but at this point the etc/resolv.conf.new will not be available anymore since it was removed by along with the other directories created by the dhcpcd.sh script.

This commit solves this issue by simply not removing the entire directory, but only the resolv.conf file that is created during the setup. Pillar already handles the creation and removal of this directory, so no changes are required on his side and the directory will be removed when not needed anymore.

cc: @milan-zededa

rene · 2024-07-05T12:42:16Z

Updates in this PR:

Rebased on top of master
Increase timeout from 3s to 5s
Slightly change the fix: we can keep everything inside the same dhcpcd folder (as it was before), since pillar already handles the removal of the folder, we just need to remove the created file in the dhcpcd.sh script instead the whole folder. This solution was tested with QEMU (where the issue was observed) and it's working.

deitch · 2024-07-07T14:38:08Z

This commit provides a workaround by implementing a retry mechanism, so in case of error the script will retry the operation after 5 seconds for at most 3 times before fail.

So it still would fail after 3 retries, but this gives it a little more time to get there? Is there any way we can do this with a "wait for it", i.e. check the status of the underlying resource, rather than arbitrary retries?

Then again, I guess that doesn't buy us anything. If in the end we decide that we will wait up to 15 seconds, then either way we will wait 15 second and then fail. So then what you have here is just as good. 👍

by simply not removing the entire directory

Is there a circumstance where we would want to remove it? I would think in a normal container removal we want it gone? Or is it that we are conflating container down with container delete?

milan-zededa · 2024-07-08T08:23:33Z

So it still would fail after 3 retries, but this gives it a little more time to get there? Is there any way we can do this with a "wait for it", i.e. check the status of the underlying resource, rather than arbitrary retries?

@deitch We have the same also in pillar for configuring VLANs on a bridge. For some reason, bridge is not immediately available after being created (some async ops continue apparently) and returns EBUSY if we try to use it too soon.
I couldn't find anything in netlink API that would allow us to find out if the bridge is ready, other than trying to attach interface to it or configuring a VLAN.

milan-zededa · 2024-07-08T08:25:38Z

pkg/pillar/scripts/veth.sh

+    while [ "$RETRIES" -gt 0 ]; do
+        if ! "$@"; then
+            RETRIES="$(( RETRIES - 1 ))"
+            sleep 5


I think 5 seconds is a bit too long. What about we retry every second 15 times to avoid delaying boot of a native container too much?

Btw. in pillar we retry with 1 second period: https://github.com/lf-edge/eve/blob/master/pkg/pillar/nireconciler/linuxitems/vlanbridge.go#L133-L169

Yeah, makes sense... I will change it...

rene · 2024-07-08T09:34:32Z

So it still would fail after 3 retries, but this gives it a little more time to get there? Is there any way we can do this with a "wait for it", i.e. check the status of the underlying resource, rather than arbitrary retries?
@deitch We have the same also in pillar for configuring VLANs on a bridge. For some reason, bridge is not immediately available after being created (some async ops continue apparently) and returns EBUSY if we try to use it too soon. I couldn't find anything in netlink API that would allow us to find out if the bridge is ready, other than trying to attach interface to it or configuring a VLAN.

@deitch , answering your second question, the removal of the directory is already done by pillar, we don't need to do it from the dhcpcd.sh script, once the container is gone, pillar will take care and remove the whole vifs directory...

rene · 2024-07-08T10:04:30Z

Updates in this PR:

Rebased on top of master
Addressed reviewers comments

eriknordmark

LGTM

milan-zededa · 2024-07-11T09:19:31Z

@europaul We finally caught that OCI error:

eclient2	lfedge/eden-eclient:b1c1de6	292b6beb-1c1a-4d76-a330-87f0773cef83	-		-		0 B/0 B		IN_CONFIG	INSTALLED: [description:"setting up OCI spec for domain 292b6beb-1c1a-4d76-a330-87f0773cef83.1.2 failed unexpected end of JSON input"  timestamp:{seconds:1720687417  nanos:909445634}  severity:SEVERITY_ERROR]

Here should be your extra logs: https://github.com/lf-edge/eve/actions/runs/9864092485/artifacts/1690177256

When deploying native containers, during the setup of the virtual network interfaces, the Bridge device might no be ready, leading to errors like the following: "brctl: bridge bn1: Resource busy - Cannot find device nbu1x1.1" This commit provides a workaround by implementing a retry mechanism, so in case of error the script will retry the operation after 5 seconds for at most 3 times before fail. Signed-off-by: Renê de Souza Pinto <rene@renesp.com.br>

The dhcpcd.sh script creates the /run/task/vifs/<APP_UUID>/ directories on an up command (configured as a Prestart hook in the container OCI interface) and removes it entirely on a down command (configured as Poststop). However, the etc/resolv.conf.new file is created by pillar and should be mounted inside the container. If any issue arises while setting up the bridge + virtual network interface during container initialization, pillar will retry to start the container, but at this point the etc/resolv.conf.new will not be available anymore since it was removed by along with the other directories created by the dhcpcd.sh script. This commit solves this issue by simply not removing the entire directory, but only the resolv.conf file that is created during the setup. Pillar already handles the creation and removal of this directory, so no changes are required on his side and the directory will be removed when not needed anymore. Signed-off-by: Renê de Souza Pinto <rene@renesp.com.br>

rene · 2024-07-11T10:26:39Z

Updates in this PR:

Rebased onto master

europaul · 2024-07-12T14:29:18Z

@milan-zededa I got the bug narrowed down to an empty image-config.json file in the volume root directory, but I still don't know why it gets empty. In the current case it happens after a reboot - before reboot the same app runs fine, meaning the config file should contain information. I added more logs in #4088, don't know if they are too much for prod version however.

milan-zededa · 2024-07-12T14:36:36Z

@milan-zededa I got the bug narrowed down to an empty image-config.json file in the volume root directory, but I still don't know why it gets empty. In the current case it happens after a reboot - before reboot the same app runs fine, meaning the config file should contain information. I added more logs in #4088, don't know if they are too much for prod version however.

Is this config file persisted or recreated after reboot?

europaul · 2024-07-12T14:38:33Z

Is this config file persisted or recreated after reboot?

it should be persistent and touched / created only when the container volume is created

europaul · 2024-07-12T14:40:15Z

@rene observed the same error when testing with native containers locally. In his case it wasn't after a reboot, but during the initial deployment if I understand correctly

milan-zededa · 2024-07-12T14:41:03Z

Is this config file persisted or recreated after reboot?

it should be persistent and touched / created only when the container volume is created

Ah, Ok. Because I was wondering if we are missing a sync call after writing into the config file and before publishing volume info stating that it is ready (i.e. a race between volumemgr and domainmgr).

europaul · 2024-07-12T15:13:46Z

@milan-zededa we sync the directory after the file was written and before we publish anything

eve/pkg/pillar/volumehandlers/containerhandler.go

Line 68 in c307b34

if err := utils.DirSync(fileLocation); err != nil {

rene requested a review from eriknordmark as a code owner July 3, 2024 13:17

rene requested a review from milan-zededa July 3, 2024 13:17

milan-zededa approved these changes Jul 3, 2024

View reviewed changes

rene marked this pull request as draft July 5, 2024 12:03

rene force-pushed the fix-dhcpcd-native-containers branch from 2e7cfaf to 92ec5af Compare July 5, 2024 12:37

rene marked this pull request as ready for review July 5, 2024 12:40

milan-zededa reviewed Jul 8, 2024

View reviewed changes

rene force-pushed the fix-dhcpcd-native-containers branch 2 times, most recently from 316ce4a to 8538a43 Compare July 8, 2024 10:03

milan-zededa approved these changes Jul 8, 2024

View reviewed changes

eriknordmark approved these changes Jul 9, 2024

View reviewed changes

rene added 2 commits July 11, 2024 12:25

rene force-pushed the fix-dhcpcd-native-containers branch from 8538a43 to 2a0a655 Compare July 11, 2024 10:25

github-actions bot requested a review from eriknordmark July 11, 2024 10:25

milan-zededa merged commit bafe5b7 into lf-edge:master Jul 11, 2024
19 checks passed

europaul mentioned this pull request Jul 12, 2024

fix: handle empty image spec JSON in PrepareContainerRootDir #4088

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix dhcpcd and virtual interface handling for native containers #4052

Fix dhcpcd and virtual interface handling for native containers #4052

rene commented Jul 3, 2024 •

edited

Loading

rene commented Jul 5, 2024

deitch commented Jul 7, 2024

milan-zededa commented Jul 8, 2024

milan-zededa Jul 8, 2024 •

edited

Loading

milan-zededa Jul 8, 2024

rene Jul 8, 2024

rene commented Jul 8, 2024

rene commented Jul 8, 2024

eriknordmark left a comment

milan-zededa commented Jul 11, 2024

rene commented Jul 11, 2024

europaul commented Jul 12, 2024

milan-zededa commented Jul 12, 2024

europaul commented Jul 12, 2024

europaul commented Jul 12, 2024

milan-zededa commented Jul 12, 2024

europaul commented Jul 12, 2024

Fix dhcpcd and virtual interface handling for native containers #4052

Fix dhcpcd and virtual interface handling for native containers #4052

Conversation

rene commented Jul 3, 2024 • edited Loading

Implement retry mechanism for veth.sh

Do not remove directory for dhcpcd

rene commented Jul 5, 2024

deitch commented Jul 7, 2024

milan-zededa commented Jul 8, 2024

milan-zededa Jul 8, 2024 • edited Loading

Choose a reason for hiding this comment

milan-zededa Jul 8, 2024

Choose a reason for hiding this comment

rene Jul 8, 2024

Choose a reason for hiding this comment

rene commented Jul 8, 2024

rene commented Jul 8, 2024

eriknordmark left a comment

Choose a reason for hiding this comment

milan-zededa commented Jul 11, 2024

rene commented Jul 11, 2024

europaul commented Jul 12, 2024

milan-zededa commented Jul 12, 2024

europaul commented Jul 12, 2024

europaul commented Jul 12, 2024

milan-zededa commented Jul 12, 2024

europaul commented Jul 12, 2024

rene commented Jul 3, 2024 •

edited

Loading

milan-zededa Jul 8, 2024 •

edited

Loading