-
Notifications
You must be signed in to change notification settings - Fork 1.4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Windows CNI broken after latest EKS image update #9043
Comments
@davidgiga1993 can you confirm if you have Windows patch KB5040437 installed? |
Yes I can confirm. I just hope it's related to the Windows update issue as my error message differs from the ones reported by others |
Do you want to share the behavior observed with your pods in the Windows-Containers issue linked about so that folks at Microsoft are aware of various ways this is affecting the Pod behavior (and that this is widespread) ? |
@davidgiga1993 Please follow these steps and let me know if it resolves the issue with the July or August update installed.
Name : FwPerfImprovementChange
|
I'll try on Monday, however I'm not sure I can actually reboot the machine as the autoscaling group will detect the node as dead and remove/terminate it. |
@JamesKehr you might want to share this in microsoft/Windows-Containers#516 -- there a lot more folks subscribed there with different configurations |
I'm not on AWS/EKS, but self hosted Rancher+rke2. This worked for me |
@avin3sh Done! Thanks for the tip! @ilueckel Thank you for the confirmation! @davidgiga1993 You will likely need to work with AWS support to make that change. That registry value is read when the HNS service starts. You can try, but no guarantees, to set the reg value, stop all the k8s/Calico containers and services, restart the Host Networking Service (HNS) in Windows, and then fire everything back up. Assuming you have that level of control over the node, that might work. Please let me know either way. |
@JamesKehr following your comment with the restart less fix (in the windows container issue) we were able to apply the hotfix on EKS with Calico and the networking works again. (I'm working together with @davidgiga1993) |
@JamesKehr @Argannor @davidgiga1993 thanks for the fix and the updates, closing this now. |
Hi @davidgiga1993 . It seems that you were able to deploy EKS with Linux and Windows Node using Calico Overlay. I'm a bit curious how you achieve this as the documentation is sometimes ambiguous. On my side, Windows pods cannot consume IP addresses from Calico IPAM. Your tip will be much appreciated. |
We're using EKS on AWS with Calcio VXlan.
After updating the node image from
ami-05b4e05d429e7759b (Windows_Server-2022-English-Core-EKS_Optimized-1.29-2024.06.17)
to
ami-0f11d4c28a09d26d2 (Windows_Server-2022-English-Core-EKS_Optimized-1.29-2024.07.10)
it is not possible anymore to reach any IP anymore:
Expected Behavior
The IPs should be reachable
Current Behavior
No IPs are reachable from inside the container, on the node itself (and host containers) network communication works fine.
Possible Solution
Steps to Reproduce (for bugs)
Context
Downgrading the AMI resolves the issue, thus I suspect it's somehow related to the
CVE-2024-5321
as this was (according to amazon) the only change in this image.Maybe related to #9019
Your Environment
The text was updated successfully, but these errors were encountered: