Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

WIP : MI300A Support #170

Open
wants to merge 1 commit into
base: develop
Choose a base branch
from

Conversation

PaulMullowney
Copy link
Contributor

This PR is a work in progress however I wanted to get feedback sooner rather than later.

For optimal performance, MI300A requires that buffers used in GPU aware MPI be allocated via hipMalloc. To implement this, I added new public APIs: RESERVE_GAM and GET_ALLOCATION_GAM. Any device buffer which is used in GPU aware communication needs to be allocated through these methods.

The buffered and growing allocator classes get new data members/methods that mimic the original implementation. The distinction is that once it is time to perform an allocation deep in REALLOCATE_GROWING_GAM_ALLOCATION in growing_allocator_mod.F90, the code does a hipMalloc for AMD hardware. For Nvidia we use the original ALLOCATE and ACC ENTER DATA CREATE pathway.

@FussyDuck
Copy link

FussyDuck commented Nov 8, 2024

CLA assistant check
All committers have signed the CLA.

@PaulMullowney PaulMullowney changed the base branch from main to develop November 8, 2024 20:44
@samhatfield samhatfield self-requested a review November 11, 2024 14:02
@samhatfield
Copy link
Collaborator

Thanks Paul. I will take a look at this shortly. In the mean time, could you check the metadata for commit 8fbf57e? It looks like there are two names on that commit, and it's confusing the license approval mechanism.

@PaulMullowney
Copy link
Contributor Author

I rebased against the current develop and squashed all commits into 1.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request gpu
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants