Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[PSI] The process fails if the service stops #91

Closed
hakavlad opened this issue May 3, 2020 · 2 comments
Closed

[PSI] The process fails if the service stops #91

hakavlad opened this issue May 3, 2020 · 2 comments
Labels
bug Something isn't working

Comments

@hakavlad
Copy link
Owner

hakavlad commented May 3, 2020

май 03 15:46:06 PC nohang[1972]: PSI:    0.0 | MemAvail: 8917 M, 91.1 % | SwapFree: 9469 M,  96.7 % | dMem:     0 M/s
май 03 15:46:10 PC nohang[1972]: PSI:    0.0 | MemAvail: 8917 M, 91.1 % | SwapFree: 9469 M,  96.7 % | dMem:     0 M/s
май 03 15:46:13 PC nohang[1972]: PSI:    0.0 | MemAvail: 8909 M, 91.0 % | SwapFree: 9469 M,  96.7 % | dMem:    -3 M/s
май 03 15:46:16 PC nohang[1972]: Traceback (most recent call last):
май 03 15:46:16 PC nohang[1972]:   File "/usr/bin/nohang", line 3834, in <module>
май 03 15:46:16 PC nohang[1972]:     psi_avg_value = find_psi_metrics_value(psi_path, psi_metrics)
май 03 15:46:16 PC nohang[1972]:   File "/usr/bin/nohang", line 1312, in find_psi_metrics_value
май 03 15:46:16 PC nohang[1972]:     return float(rline1(psi_path).split(' ')[1].split('=')[1])
май 03 15:46:16 PC nohang[1972]:   File "/usr/bin/nohang", line 1508, in rline1
май 03 15:46:16 PC nohang[1972]:     with open(path) as f:
май 03 15:46:16 PC nohang[1972]: FileNotFoundError: [Errno 2] No such file or directory: '/sys/fs/cgroup/system.slice/fail2ban.service/memory.pressure'
май 03 15:46:16 PC systemd[1]: nohang.service: Main process exited, code=exited, status=1/FAILURE
май 03 15:46:16 PC systemd[1]: nohang.service: Unit entered failed state.
май 03 15:46:16 PC systemd[1]: nohang.service: Failed with result 'exit-code'.
май 03 15:46:16 PC systemd[1]: nohang.service: Service has no hold-off time, scheduling restart.
@hakavlad
Copy link
Owner Author

hakavlad commented May 3, 2020

It may happen with non-system-wide psi_path like

psi_path = /sys/fs/cgroup/system.slice/fail2ban.service/memory.pressure

If the service stops, the PSI file of which we are monitoring, then nohang fails.

@hakavlad hakavlad added the bug Something isn't working label May 3, 2020
hakavlad added a commit that referenced this issue May 3, 2020
@hakavlad
Copy link
Owner Author

hakavlad commented May 3, 2020

01f17c4

@hakavlad hakavlad closed this as completed May 3, 2020
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

1 participant