Multi-GPU via threading instead of processes #50

marius311 · 2021-02-27T22:01:01Z

WIP towards using one-GPU-per-thread instead of one-GPU-per-process. The big advantage is you don't need to launch multiple processes (which is slow and at least doubles the startup time) and memory can be shared between the GPUs using unified memory, instead of having to be serialized and distributed between the different processes (which necessarily passes through the CPU memory).

Right now this works:

tmap(collect(devices())) do dev
    device!(dev)
    for i=1:N
         gradient(ϕ -> norm(LenseFlow(ϕ)*f), ϕ)
    end
end

although MAP and sampling is still WIP.

Right now this needs CUDA 11.2 and my branch of CUDA.jl https://github.com/marius311/CUDA.jl/tree/no_gc_ctx_switch

marius311 added 2 commits February 26, 2021 02:23

WIP one-GPU-per-thread

7263998

add function to adapt to unified GPU memory

d2fa591

marius311 marked this pull request as draft February 27, 2021 22:01

marius311 force-pushed the master branch from 85a63d8 to 4fc14e9 Compare April 14, 2021 08:36

marius311 added 3 commits April 15, 2021 02:36

Merge branch 'cuda3' into threadgpu

cc42d60

Merge branch 'origin/master' into threadgpu

7f6dacd

leave CUDA's unsafe_execute be

f6ada72

marius311 force-pushed the threadgpu branch from adead5a to 4bd8c5a Compare April 28, 2021 18:24

turn off TimerOutputs when using threads

2e875d6

marius311 force-pushed the master branch from 43316b9 to a445f9a Compare April 28, 2021 18:47

marius311 force-pushed the threadgpu branch from 4bd8c5a to 2e875d6 Compare April 28, 2021 18:47

marius311 force-pushed the master branch 2 times, most recently from bb06adc to 9f016a7 Compare January 6, 2023 01:43

marius311 force-pushed the master branch from 0374568 to 9b821d4 Compare January 20, 2023 18:03

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Multi-GPU via threading instead of processes #50

Multi-GPU via threading instead of processes #50

marius311 commented Feb 27, 2021

Multi-GPU via threading instead of processes #50

Are you sure you want to change the base?

Multi-GPU via threading instead of processes #50

Conversation

marius311 commented Feb 27, 2021