Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jobs in error are not displayed by oarstat #48

Open
bzizou opened this issue Apr 2, 2024 · 2 comments
Open

Jobs in error are not displayed by oarstat #48

bzizou opened this issue Apr 2, 2024 · 2 comments
Assignees

Comments

@bzizou
Copy link
Contributor

bzizou commented Apr 2, 2024

oar=# select state from jobs where job_id = 1905;   
 state                                              
-------                                             
 Error                                              
root@dahu-oar3:~# oarstat -fj 1905
root@dahu-oar3:~#
@bzizou
Copy link
Contributor Author

bzizou commented Apr 2, 2024

Seems that is not always true, it might only be related to this particular 1905 job

@bzizou
Copy link
Contributor Author

bzizou commented Apr 2, 2024

Sql for this job:

 job_id | array_id | array_index |                                                                 initial_request                                                                 |         job_name          | job_env | job_type | info_type  | state | reservation |           message           | scheduler_info | job_user | project | job_group |                command                 | exit_code | queue_name |  properties   |                launching_directory                | submission_time | start_time | stop_time  | file_id | accounted | notify | assigned_moldable_job | checkpoint | checkpoint_signal |                 stdout_file                  |                 stderr_file                  | resubmit_job_id | suspended 
--------+----------+-------------+-------------------------------------------------------------------------------------------------------------------------------------------------+---------------------------+---------+----------+------------+-------+-------------+-----------------------------+----------------+----------+---------+-----------+----------------------------------------+-----------+------------+---------------+---------------------------------------------------+-----------------+------------+------------+---------+-----------+--------+-----------------------+------------+-------------------+----------------------------------------------+----------------------------------------------+-----------------+-----------
   1905 |     1905 |           1 | -a 1904 -S ./apptainer_multifast_128_procs_v3.oar -n test_apptainer_multinodes -t devel -l /nodes=4/core=32,walltime=00:30:00 --project pr-test | test_apptainer_multinodes |         | PASSIVE  | dahu-oar3: | Error | None        | Job killed by Leon directly |                | arrondeb | test    |           | ./apptainer_multifast_128_procs_v3.oar |           | default    | devel = 'YES' | /home/arrondeb/WORKSPACE/Codes/DNS/multifast_test |      1711471145 | 1711483238 | 1711483238 |         | YES       |        |                     0 |          0 |                12 | OAR.test_apptainer_multinodes.%jobid%.stdout | OAR.test_apptainer_multinodes.%jobid%.stderr |               0 | NO
(1 row)

@bzizou bzizou self-assigned this Nov 26, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant