Skip to content

response error when submitting and waiting on batch jobs over SSH #3156

Answered by SteVwonder
SteVwonder asked this question in Q&A
Discussion options

You must be logged in to vote

Thanks @grondo for the debugging suggestions! I instrumented that error location and went to reproduce when I realized that I was launching flux with:

srun --mpi=none --mpibind=off --pty flux start bash -c 'echo ssh://$(hostname)/$(flux getattr rundir)/0/local > flux.info; zsh'

When it should have been:

srun --mpi=none --mpibind=off --pty /collab/usr/global/tools/flux/toss_3_x86_64_ib/flux-c0.18.0-s0.10.0/bin/flux start bash -c 'echo ssh://$(hostname)/$(flux getattr rundir)/0/local > flux.info; zsh'

The response error went away as did the job-shell error. So this was a total PEBCAK, and one of the pitfalls of using an SSH proxy.

I thought we had an open issue about some sort of version …

Replies: 2 comments

Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Answer selected by grondo
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants