Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

cat-log: list out/err files when available via tailer #6480

Merged
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
1 change: 1 addition & 0 deletions changes.d/6480.fix.md
Original file line number Diff line number Diff line change
@@ -0,0 +1 @@
`cat-log`: List log files which are available via a configured tailer/viewer command.
51 changes: 37 additions & 14 deletions cylc/flow/scripts/cat_log.py
Original file line number Diff line number Diff line change
Expand Up @@ -596,28 +596,51 @@
cmd.append('--prepend-path')
cmd.append(workflow_id)
# TODO: Add Intelligent Host selection to this
proc = None

Check warning on line 599 in cylc/flow/scripts/cat_log.py

View check run for this annotation

Codecov / codecov/patch

cylc/flow/scripts/cat_log.py#L599

Added line #L599 was not covered by tests
with suppress(KeyboardInterrupt):
# (Ctrl-C while tailing)
# NOTE: This will raise NoHostsError if the platform is not
# contactable
remote_cylc_cmd(
proc = remote_cylc_cmd(

Check warning on line 604 in cylc/flow/scripts/cat_log.py

View check run for this annotation

Codecov / codecov/patch

cylc/flow/scripts/cat_log.py#L604

Added line #L604 was not covered by tests
cmd,
platform,
capture_process=False,
capture_process=(mode == 'list-dir'),
manage=(mode == 'tail'),
text=False
text=(mode == 'list-dir'),
)
if (
mode == 'list-dir'
and os.path.exists(
os.path.join(
local_log_dir,
'job-activity.log'
)
)
):
# add the local-only job-activity.log file to the remote-list
print('job-activity.log')

# add and missing items to file listing results
if isinstance(proc, Popen):
# i.e: if mode=='list-dir' and ctrl+c not pressed
out, err = proc.communicate()
MetRonnie marked this conversation as resolved.
Show resolved Hide resolved
Comment on lines +604 to +615
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think we need to include the proc.communicate() in the suppressing of the KeyboardInterrupt

Copy link
Member Author

@oliver-sanders oliver-sanders Feb 12, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This is a suppress block, so the code execution will continue on if KeyboardInterrupt is caught leading to an error further on in the code.

As it stands, if you ctrl+c here, you'll get a KeyboardInterrupt traceback:

Traceback (most recent call last):
...
KeyboardInterrupt

But if we do the awaiting inside this block, it just turns into a different error:

Traceback (most recent call last):
...
File "cylc/flow/scripts/cat_log.py", line 607, in _main
AttributeError: 'NoneType' object has no attribute 'splitlines'

I don't think kill signals are very well handled here at present, not really the fault of this PR.

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

If the whole if isinstance block is included in the suppress block, it avoids traceback on ctrl+c. But not that important I guess

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The traceback is a cosmetic problem, but I think there are pre-existing functional issues here, #6609

files = out.splitlines()

Check warning on line 616 in cylc/flow/scripts/cat_log.py

View check run for this annotation

Codecov / codecov/patch

cylc/flow/scripts/cat_log.py#L615-L616

Added lines #L615 - L616 were not covered by tests

# add files which can be accessed via a tailer
if live_job_id is not None:
if (
# NOTE: only list the file if it can be viewed in
# both modes
(platform['out tailer'] and platform['out viewer'])
and 'job.out' not in files
):
files.append('job.out')

Check warning on line 626 in cylc/flow/scripts/cat_log.py

View check run for this annotation

Codecov / codecov/patch

cylc/flow/scripts/cat_log.py#L626

Added line #L626 was not covered by tests
if (
(platform['err tailer'] and platform['err viewer'])
and 'job.err' not in files
):
files.append('job.err')

Check warning on line 631 in cylc/flow/scripts/cat_log.py

View check run for this annotation

Codecov / codecov/patch

cylc/flow/scripts/cat_log.py#L631

Added line #L631 was not covered by tests

# add the job-activity.log file which is always local
if os.path.exists(
os.path.join(local_log_dir, 'job-activity.log')
):
files.append('job-activity.log')

Check warning on line 637 in cylc/flow/scripts/cat_log.py

View check run for this annotation

Codecov / codecov/patch

cylc/flow/scripts/cat_log.py#L637

Added line #L637 was not covered by tests

files.sort()
print('\n'.join(files))
print(err, file=sys.stderr)
oliver-sanders marked this conversation as resolved.
Show resolved Hide resolved
sys.exit(proc.returncode)

Check warning on line 642 in cylc/flow/scripts/cat_log.py

View check run for this annotation

Codecov / codecov/patch

cylc/flow/scripts/cat_log.py#L639-L642

Added lines #L639 - L642 were not covered by tests

else:
# Local task job or local job log.
logpath = os.path.join(local_log_dir, options.filename)
Expand Down
87 changes: 87 additions & 0 deletions tests/functional/cylc-cat-log/13-remote-out-err-tailer.t
Original file line number Diff line number Diff line change
@@ -0,0 +1,87 @@
#!/usr/bin/env bash
# THIS FILE IS PART OF THE CYLC WORKFLOW ENGINE.
# Copyright (C) NIWA & British Crown (Met Office) & Contributors.
#
# This program is free software: you can redistribute it and/or modify
# it under the terms of the GNU General Public License as published by
# the Free Software Foundation, either version 3 of the License, or
# (at your option) any later version.
#
# This program is distributed in the hope that it will be useful,
# but WITHOUT ANY WARRANTY; without even the implied warranty of
# MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the
# GNU General Public License for more details.
#
# You should have received a copy of the GNU General Public License
# along with this program. If not, see <http://www.gnu.org/licenses/>.
#-------------------------------------------------------------------------------
# Test "cylc cat-log" with custom out/err tailers
export REQUIRE_PLATFORM='loc:remote runner:background fs:indep comms:tcp'
. "$(dirname "$0")/test_header"
#-------------------------------------------------------------------------------
set_test_number 12
#-------------------------------------------------------------------------------
# run the workflow
TEST_NAME="${TEST_NAME_BASE}-validate"
install_workflow "${TEST_NAME_BASE}" "${TEST_NAME_BASE}"
run_ok "${TEST_NAME}" cylc validate "${WORKFLOW_NAME}"
workflow_run_ok "${TEST_NAME_BASE}-run" cylc play -N "${WORKFLOW_NAME}"
#-------------------------------------------------------------------------------
# change the platform the task ran on to the remote platform
sqlite3 "${HOME}/cylc-run/${WORKFLOW_NAME}/log/db" "
UPDATE
task_jobs
SET
platform_name = '${CYLC_TEST_PLATFORM}',
run_status = null
WHERE
name = 'foo'
AND cycle = '1'
;"
#-------------------------------------------------------------------------------
# test cylc cat-log --mode=list-dir will not list job.out / err
# (no tailer / viewer configured)
create_test_global_config "" "
[platforms]
[[$CYLC_TEST_PLATFORM]]
out tailer =
err tailer =
out viewer =
err viewer =
"
TEST_NAME="${TEST_NAME_BASE}-list-dir-no-tailers"
# NOTE: command will fail due to missing remote directory (this tests remote
# error code is preserved)
run_fail "${TEST_NAME}" cylc cat-log "${WORKFLOW_NAME}//1/foo" -m 'list-dir'
# the job.out and job.err filees
grep_fail "job.out" "${TEST_NAME}.stdout"
grep_fail "job.err" "${TEST_NAME}.stdout"
#-------------------------------------------------------------------------------
# test cylc cat-log --mode=list-dir lists the tailed files
# (both tailer and viewer configured)
create_test_global_config "" "
[platforms]
[[$CYLC_TEST_PLATFORM]]
out tailer = echo OUT
err tailer = echo ERR
out viewer = echo OUT
err viewer = echo ERR
"
# test cylc cat-log --mode=list-dir lists the tailed files
TEST_NAME="${TEST_NAME_BASE}-list-dir-with-tailers"
# NOTE: command will fail due to missing remote directory (this tests remote
# error code is preserved)
run_fail "${TEST_NAME}" cylc cat-log "${WORKFLOW_NAME}//1/foo" -m 'list-dir'
# the job.out and job.err filees
grep_ok "job.out" "${TEST_NAME}.stdout"
grep_ok "job.err" "${TEST_NAME}.stdout"
#-------------------------------------------------------------------------------
# test cylc cat-log runs the custom tailers
TEST_NAME="${TEST_NAME_BASE}-cat-out"
run_ok "${TEST_NAME}" cylc cat-log "${WORKFLOW_NAME}//1/foo" -f o -m t
grep_ok "OUT" "${TEST_NAME}.stdout"
run_ok "${TEST_NAME}" cylc cat-log "${WORKFLOW_NAME}//1/foo" -f e -m t
grep_ok "ERR" "${TEST_NAME}.stdout"
#-------------------------------------------------------------------------------
purge
exit
25 changes: 25 additions & 0 deletions tests/functional/cylc-cat-log/13-remote-out-err-tailer/flow.cylc
Original file line number Diff line number Diff line change
@@ -0,0 +1,25 @@
[scheduler]
[[events]]
abort on stall timeout = True
stall timeout = PT2M

[scheduling]
[[graph]]
R1 = foo

[runtime]
[[foo]]
script = """
# wait for the started message to be received
cylc__job__poll_grep_workflow_log -E 'foo.*running'

# remove the out/err files
rm "${CYLC_TASK_LOG_DIR}/job.out"
rm "${CYLC_TASK_LOG_DIR}/job.err"

# stop the workflow, orphaning this job
cylc stop --now --now "${CYLC_WORKFLOW_ID}" 2>/dev/null >/dev/null

# suppress any subsequent messages
rm "${CYLC_WORKFLOW_RUN_DIR}/.service/contact"
"""
Loading