-
Notifications
You must be signed in to change notification settings - Fork 82
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Bug]: Error: Not connected - is it possible to make connection timeout configurable? #519
Comments
Hey @akramarev, can you post some of the logs before the error? Preferably add What I did see in the past is builds that overload the machine's resources to the point where the ssh server hangs. You can try running |
Thanks for the reply @royra. I observe this problem only when I offload the build to the GHA runner (please check "My setup" section above for details), if I use the default builder (build happens on the remove Lightsail instance) I don't have this issue. So my suspicion is while GHA runner is building the image preevy doesn't actively use the opened earlier SSH connection and it's timing out by the moment GHA finishes the build and is ready to upload artifacts. During the build, I can ssh to the Lightsail machine and see that it's almost idle. Attaching logs (build details in the middle manually reducted): |
Sorry I missed the fact that you're already offloading the build. Can you look at the ssh server logs on the lightsail instance? If there's nothing interesting try changing the LogLevel setting at /etc/ssh/sshd_config |
Thank you for your reply @royra and livecycle team for keeping this github issue open. Noticed in /var/log/auth.log that SSH server restarted right at the moment when preevy action reported that it successfully configured the new lightsail machine: At the same time /var/log/syslog indicates that cloud-init is the process that restarted sshd.
As expected any further attempts to restart the job:
Is there anything you can suggest in this situation? |
What happened?
Please compare these two outputs from livecycle/[email protected]:
and
The first one is a cold run of my workflow where the build stage took >10m, and succeeded, but artifacts copying failed right after it. The second output is from the same workflow retried - it reused cached images and thus build phase finished in 30s and preevy-up-action didn't have any problems with copying the artifacts to the runner.
I suspect that there is a timeout (ssh connection timeout) that approximately equals 10m somewhere. Wondering if it's possible to make it configurable for docker-compose stacks that require a longer build phase?
Add screenshots
please see the previous section with error details
Steps to reproduce the behavior
My setup:
GHA:
Expected behavior
Avoid
Error: Not connected
error when the build step takes a long time, i.e. either make the timeout configurable or retry the connection just a few times.What OS are you seeing the problem on?
Linux
Additional context
No response
Record
The text was updated successfully, but these errors were encountered: