Add workflow for running models ops test generated by model analysis … #1316

chandrasekaranpradeep · 2025-02-25T09:10:05Z

Created workflow for running model ops test generated(#1234) by model analysis pipeline.

vmilosevic

Don't forget to remove push trigger :)

.github/workflows/on-nightly-models-ops.yml

chandrasekaranpradeep · 2025-02-25T10:08:45Z

Don't forget to remove push trigger :)

Sure @vmilosevic will remove it

ashokkumarkannan1

LGTM!

github-actions · 2025-02-25T10:53:22Z

	Tests	Passed ✅	Skipped ⚠️	Failed
TT-Forge-FE Tests	604 ran	478 passed	126 skipped	0 failed

Test	Result
No test annotations available

github-actions · 2025-02-25T10:54:44Z

	Tests	Passed ✅	Skipped ⚠️	Failed
TT-Forge-FE Tests	662 ran	525 passed	137 skipped	0 failed

Test	Result
No test annotations available

github-actions · 2025-02-25T11:00:29Z

	Tests	Passed ✅	Skipped ⚠️	Failed
TT-Forge-FE Tests	604 ran	478 passed	126 skipped	0 failed

Test	Result
No test annotations available

github-actions · 2025-02-25T11:18:48Z

	Tests	Passed ✅	Skipped ⚠️	Failed
TT-Forge-FE Tests	662 ran	525 passed	137 skipped	0 failed

Test	Result
No test annotations available

…pipeline

github-actions · 2025-02-25T13:57:20Z

	Tests	Passed ✅	Skipped ⚠️	Failed
TT-Forge-FE Tests	670 ran	536 passed	134 skipped	0 failed

Test	Result
No test annotations available

github-actions · 2025-02-25T14:02:31Z

	Tests	Passed ✅	Skipped ⚠️	Failed
TT-Forge-FE Tests	611 ran	482 passed	129 skipped	0 failed

Test	Result
No test annotations available

github-actions · 2025-02-25T14:02:54Z

	Tests	Passed ✅	Skipped ⚠️	Failed
TT-Forge-FE Tests	611 ran	482 passed	129 skipped	0 failed

Test	Result
No test annotations available

github-actions · 2025-02-25T14:18:52Z

	Tests	Passed ✅	Skipped ⚠️	Failed
TT-Forge-FE Tests	670 ran	536 passed	134 skipped	0 failed

Test	Result
No test annotations available

…est failure updation script (#1234) ### Summary: 1. Generated models ops tests by extracting the unique ops configurations across all the pytorch models present inside the forge/test/models directory path. 2. Skipped avgpool1d, avgpool3d, conv2d, pad and reshape ops test cases due to segmentation fault in CI and transpose ops test cases due to Insufficient host DRAM (requires 30 GB of memory) 3. Reimplemented the model ops test failure updation script: The `scripts/model_analysis/models_ops_test_failure_update.py` script automates the extraction of detailed test **failure information** from pytest logs by identifying test cases along with their statuses (**FAILED, XFAIL, or SKIP**) and corresponding **error messages**, then compiles this data into a comprehensive **Excel report** with columns for **test cases**, **marker statuses**, and **reasons**. This report enables users to manually refine error messages by updating the "Reason" column, and once revised, the script can be rerun using the `--use_report` option with the `report file path` to automatically update the model ops test files with marker and more descriptive failure messages 4. The adv_index ops test are failing with indexerror out of bound error(i.e `IndexError: index 563 is out of bounds for dimension 0 with size 448`) in evaluation. The adv_index ops takes two inputs one is reference tensor and other is indicies tensor based upon the indicies tensor values will pick the reference tensor in respective dim but the indicies values exceeds the range of reference tensor. To resolve these issue, added pytest param max_int which will be used by the create_from_shape method in forge/forge/tensor.py to generate the indices tensor values with the range of the reference tensor ### Process: 1. Triggered the nightly models ops pipeline for collecting the pytest logs **Pipeline:** https://github.com/tenstorrent/tt-forge-fe/actions/runs/13536357567 **Collected logs:** [test-log-n150-1](https://github.com/tenstorrent/tt-forge-fe/actions/runs/13536357567/artifacts/2653622726) [test-log-n150-2](https://github.com/tenstorrent/tt-forge-fe/actions/runs/13536357567/artifacts/2653622477) [test-log-n150-3](https://github.com/tenstorrent/tt-forge-fe/actions/runs/13536357567/artifacts/2653619334) [test-log-n150-4](https://github.com/tenstorrent/tt-forge-fe/actions/runs/13536357567/artifacts/2653707424) 2. Using the collected pytest logs, generated excel report by using `scripts/model_analysis/models_ops_test_failure_update.py` script **Command:** `python scripts/model_analysis/models_ops_test_failure_update.py --log_files ci_logs/pytest_1.log ci_logs/pytest_2.log ci_logs/pytest_3.log ci_logs/pytest_4.log` **Generated models ops test report:** [model_ops_tests_report.xlsx](https://github.com/user-attachments/files/18982115/model_ops_tests_report.xlsx) 3. Anlalyzed the excel report and update the models ops tests with xfail marker and failure reason by using the `scripts/model_analysis/models_ops_test_failure_update.py` script with `use_report` option enabled **Command:** `python scripts/model_analysis/models_ops_test_failure_update.py --report_file_path model_ops_tests_report.xlsx --use_report` 4. Triggered the nightly models ops pipeline and all the test are passed. **Pipeline:** https://github.com/tenstorrent/tt-forge-fe/actions/runs/13540060061 ### Note: This models ops test takes around 2hrs to run in push pipeline, so created a separate workflow to run this model ops test- #1316. So renamed the model ops test marker from `push` to `nightly_models_ops` --------- Co-authored-by: chandrasekaranpradeep <[email protected]> Co-authored-by: chandrasekaranpradeep <[email protected]>

chandrasekaranpradeep requested review from vmilosevic, nvukobratTT, pilkicTT and dgolubovicTT as code owners February 25, 2025 09:10

vmilosevic approved these changes Feb 25, 2025

View reviewed changes

chandrasekaranpradeep force-pushed the pchandrasekaran/models_ops_workflow branch from 656f30b to 57ed837 Compare February 25, 2025 09:37

nvukobratTT reviewed Feb 25, 2025

View reviewed changes

.github/workflows/on-nightly-models-ops.yml Show resolved Hide resolved

.github/workflows/on-nightly-models-ops.yml Outdated Show resolved Hide resolved

chandrasekaranpradeep force-pushed the pchandrasekaran/models_ops_workflow branch from 57ed837 to 3c8bcda Compare February 25, 2025 10:14

chandrasekaranpradeep requested a review from nvukobratTT February 25, 2025 10:21

ashokkumarkannan1 approved these changes Feb 25, 2025

View reviewed changes

nvukobratTT approved these changes Feb 25, 2025

View reviewed changes

chandrasekaranpradeep force-pushed the pchandrasekaran/models_ops_workflow branch from 3c8bcda to a7ec886 Compare February 25, 2025 13:04

Add workflow for running models ops test generated by model analysis …

e6eebcd

…pipeline

chandrasekaranpradeep force-pushed the pchandrasekaran/models_ops_workflow branch from a7ec886 to e6eebcd Compare February 25, 2025 13:05

chandrasekaranpradeep merged commit e251bc4 into main Feb 25, 2025
10 checks passed

chandrasekaranpradeep deleted the pchandrasekaran/models_ops_workflow branch February 25, 2025 14:19

chandrasekaranpradeep mentioned this pull request Feb 26, 2025

Generate and update models ops tests and reimplement the models ops test failure updation script #1234

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add workflow for running models ops test generated by model analysis … #1316

Add workflow for running models ops test generated by model analysis … #1316

chandrasekaranpradeep commented Feb 25, 2025

vmilosevic left a comment

chandrasekaranpradeep commented Feb 25, 2025 •

edited

Loading

ashokkumarkannan1 left a comment

github-actions bot commented Feb 25, 2025

github-actions bot commented Feb 25, 2025

github-actions bot commented Feb 25, 2025

github-actions bot commented Feb 25, 2025

github-actions bot commented Feb 25, 2025

github-actions bot commented Feb 25, 2025

github-actions bot commented Feb 25, 2025

github-actions bot commented Feb 25, 2025

Add workflow for running models ops test generated by model analysis … #1316

Add workflow for running models ops test generated by model analysis … #1316

Conversation

chandrasekaranpradeep commented Feb 25, 2025

vmilosevic left a comment

Choose a reason for hiding this comment

chandrasekaranpradeep commented Feb 25, 2025 • edited Loading

ashokkumarkannan1 left a comment

Choose a reason for hiding this comment

github-actions bot commented Feb 25, 2025

github-actions bot commented Feb 25, 2025

github-actions bot commented Feb 25, 2025

github-actions bot commented Feb 25, 2025

github-actions bot commented Feb 25, 2025

github-actions bot commented Feb 25, 2025

github-actions bot commented Feb 25, 2025

github-actions bot commented Feb 25, 2025

chandrasekaranpradeep commented Feb 25, 2025 •

edited

Loading