Extract Full Model Execution Tests into Nightly Tests #379

jameszianxuTT · 2025-02-27T21:41:50Z

Ticket

Problem description

Full model execution tests are only attempted in push/pr triggered
workflows, but not in nightlies. This complicates queries and provides
no single source of truth on actual full model execution status from
tt-torch main.

What's changed

Duplicate the workflow in run-tests.yml into a self contained
job that can be run in nightly tests. Execution tests are refactored
to be run in parallel.

Checklist

New/Existing tests provide coverage for changes

This reverts commit b9de42e.

This reverts commit 7691278. (used for quick testing of nightly tests CI)

nightly execution time by too much

jameszianxuTT · 2025-02-27T21:46:54Z

Two Additional Notes

I have manually run the Nightly Tests workflow on a subset of the models to ensure this change doesn't break nightlies.
See: https://github.com/tenstorrent/tt-torch/actions/runs/13575338428. This works.
I am disabling the torchvision execution test just for nightlies as those will add a significant amount of time to the nightly tests. Average test runtime is ~5m with the longest at ~10m and are run in parallel. The torchvision tests collectively take a bit more than an hour to run.

jameszianxuTT · 2025-02-27T21:51:16Z

A later PR should refactor the run-tests.yml to run those in parallel as well.

They should also probably be separate from the nightly tests since the number of models want to execute e2e should grow and not all of them need to be included in the on PR / on Push checks.

codecov-commenter · 2025-02-27T21:57:14Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 75.81%. Comparing base (186ad28) to head (99e76fc).

✅ All tests successful. No failed tests found.

Additional details and impacted files

@@           Coverage Diff           @@
##             main     #379   +/-   ##
=======================================
  Coverage   75.81%   75.81%           
=======================================
  Files           8        8           
  Lines        1199     1199           
=======================================
  Hits          909      909           
  Misses        290      290

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

github-actions · 2025-02-27T23:40:01Z

	Tests	Passed ✅	Skipped ⚠️	Failed
TT-Torch Tests	435 ran	428 passed	7 skipped	0 failed

Test	Result
No test annotations available

github-actions · 2025-02-28T14:58:32Z

	Tests	Passed ✅	Skipped ⚠️	Failed
TT-Torch Tests	435 ran	428 passed	7 skipped	0 failed

Test	Result
No test annotations available

jameszianxuTT · 2025-02-28T19:50:38Z

The most recent verification run is here: https://github.com/tenstorrent/tt-torch/actions/runs/13589631720

Here, I rerun nightly tests fully, and stop it when I'm sure everything has been correctly parsed and can run to avoid locking up CI.

The full execution test job I added does pass in this context.

brataTT · 2025-02-28T19:53:15Z

.github/workflows/run-full-model-execution-tests.yml

+          },
+          # {
+          #   runs-on: wormhole_b0, name: "torchvision_image_classification", tests: "
+          #         tests/models/torchvision/test_torchvision_image_classification.py::test_torchvision_image_classification[full-eval-mobilenet_v2]


why are torchvision tests commented out?

The torchvision tests take over an hour to run, and I don't want to extend nightly testing too much. All the other tests take <10m and are run in parallel.

I am not removing functionality from the onPR and onPush workflows, just duplicating a subset of their functionality to add more test coverage to the nightly tests. (Nightly tests do not currently have any full model execution tests).

The list of execution tests run in nightly will likely change anyways, and I don't know which ones are the most important. I've included most of the ones run in the onPR/onPush workflows as a proof-of-concept demonstration that this added CI job does work, with the assumption that this list of tests will likely be changed once Aleks gets back.

ref

jameszianxuTT added 7 commits February 27, 2025 20:11

Get more data from e2e test failures

31c9381

Refactor On PR full model execution tests into a separate file

8e0826b

Revert "Get more data from e2e test failures"

6f235a1

This reverts commit b9de42e.

Enable all nightlies

a6b78af

Disable test subset for CI run abbreviation

7691278

Revert "Disable test subset for CI run abbreviation"

81e9903

This reverts commit 7691278. (used for quick testing of nightly tests CI)

Remove the long-running torchvision tests to avoid extending

0e8b8a2

nightly execution time by too much

jameszianxuTT marked this pull request as draft February 27, 2025 22:01

jameszianxuTT changed the title ~~Jameszianxu/full model exec test extract~~ Extract Full Model Execution Tests into Nightly Tests Feb 27, 2025

Fix testname duplication issue for fetch_jobid

beccd6b

Test rename to avoid name substring match issue with job_id

99e76fc

jameszianxuTT marked this pull request as ready for review February 28, 2025 19:36

jameszianxuTT requested review from mmanzoorTT, ddilbazTT and brataTT February 28, 2025 19:52

brataTT reviewed Feb 28, 2025

View reviewed changes

brataTT approved these changes Feb 28, 2025

View reviewed changes

jameszianxuTT merged commit 5fbaf8d into main Feb 28, 2025
23 of 53 checks passed

jameszianxuTT deleted the jameszianxu/full_model_exec_test_extract branch February 28, 2025 23:37

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Extract Full Model Execution Tests into Nightly Tests #379

Extract Full Model Execution Tests into Nightly Tests #379

jameszianxuTT commented Feb 27, 2025 •

edited

Loading

jameszianxuTT commented Feb 27, 2025

jameszianxuTT commented Feb 27, 2025 •

edited

Loading

codecov-commenter commented Feb 27, 2025 •

edited

Loading

github-actions bot commented Feb 27, 2025

github-actions bot commented Feb 28, 2025

jameszianxuTT commented Feb 28, 2025 •

edited

Loading

brataTT Feb 28, 2025

jameszianxuTT Feb 28, 2025 •

edited

Loading

Extract Full Model Execution Tests into Nightly Tests #379

Extract Full Model Execution Tests into Nightly Tests #379

Conversation

jameszianxuTT commented Feb 27, 2025 • edited Loading

Ticket

Problem description

What's changed

Checklist

jameszianxuTT commented Feb 27, 2025

Two Additional Notes

jameszianxuTT commented Feb 27, 2025 • edited Loading

codecov-commenter commented Feb 27, 2025 • edited Loading

Codecov Report

github-actions bot commented Feb 27, 2025

github-actions bot commented Feb 28, 2025

jameszianxuTT commented Feb 28, 2025 • edited Loading

brataTT Feb 28, 2025

Choose a reason for hiding this comment

jameszianxuTT Feb 28, 2025 • edited Loading

Choose a reason for hiding this comment

jameszianxuTT commented Feb 27, 2025 •

edited

Loading

jameszianxuTT commented Feb 27, 2025 •

edited

Loading

codecov-commenter commented Feb 27, 2025 •

edited

Loading

jameszianxuTT commented Feb 28, 2025 •

edited

Loading

jameszianxuTT Feb 28, 2025 •

edited

Loading