Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[RFC] Run GPU Tests on ROCm in Addition to CUDA #47260

Open
iarspider opened this issue Feb 4, 2025 · 7 comments
Open

[RFC] Run GPU Tests on ROCm in Addition to CUDA #47260

iarspider opened this issue Feb 4, 2025 · 7 comments

Comments

@iarspider
Copy link
Contributor

We would like to change how GPU-accelerated tests are executed.

Currently, GPU tests (unit tests, relvals, and baseline comparisons) are only executed using NVIDIA GPUs (CUDA). After this change is merged, the same tests will also be executed using AMD GPUs (ROCm).

To prevent PR tests from getting stuck if all nodes with ROCm GPUs (currently LUMI) are offline, Jenkins will automatically terminate pending GPU test jobs after one hour. This timeout is preliminary and subject to discussion.

@iarspider
Copy link
Contributor Author

@cms-sw/heterogeneous-l2 FYI

@cmsbuild
Copy link
Contributor

cmsbuild commented Feb 4, 2025

cms-bot internal usage

@cmsbuild
Copy link
Contributor

cmsbuild commented Feb 4, 2025

A new Issue was created by @iarspider.

@Dr15Jones, @antoniovilela, @makortel, @mandrenguyen, @rappoccio, @sextonkennedy, @smuzaffar can you please review it and eventually sign/assign? Thanks.

cms-bot commands are listed here

@iarspider
Copy link
Contributor Author

assign core

@iarspider
Copy link
Contributor Author

iarspider commented Feb 4, 2025

type documentation

@cmsbuild
Copy link
Contributor

cmsbuild commented Feb 4, 2025

New categories assigned: core

@Dr15Jones,@makortel,@smuzaffar you have been requested to review this Pull request/Issue and eventually sign? Thanks

@fwyzard
Copy link
Contributor

fwyzard commented Feb 4, 2025

Thank you for the update 👍🏻

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

3 participants