Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

GPU operators #163

Draft
wants to merge 10 commits into
base: main
Choose a base branch
from

Conversation

Abdelrahman912
Copy link

Just initial rough ideas for the design of GPU linear operators πŸ§‘β€πŸŽ„

@termi-official termi-official changed the title init design (no working implementation) GPU operators Dec 19, 2024
@termi-official termi-official linked an issue Dec 23, 2024 that may be closed by this pull request
3 tasks
element_cache = setup_element_cache(protocol, element_qr, ip, sdh)
push!(eles_caches, element_cache)
end
return dh.subdofhandlers |> cu, eles_caches |> cu
Copy link
Owner

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Note that this will be super slow, as we need to do this possibly at each time step.

I already thought about whether it might make sense (and how) to pass the device type around to give more precise control over this funny stuff here.

Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I was thinking about the same issue because yes, you are absolutely right this is gonna be super slow. The problem though is that I am trying to change right now the gpudofhandler and gpusubdofhandler so I needed to commit everything in case things blow up, didn't actually intend to push tho πŸ˜‚ but working after Holidays is like forgetting everthing 😒 .

Copy link
Owner

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

No worries. You can also just start with copy pasting the existing linear operator and changing the internals to your liking to figure out a better API. :)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

GPU assembly of linear forms
2 participants