Use adaptive CUDA launch config to fully utilize GPU devices #111

kloudkl · 2014-02-15T01:57:16Z

@sguada published the first profiling result in #81. Starting from there, we can do some more in depth analysis of key success factors such as occupancy to improve device utilization. Occupancy is defined as the ratio of active warps versus the maximum number of warps of a GPU. CUDA visual profiler and occupancy calculator both provide such data.

The best practice guide gives some general principles of execution configuration optimizations to effectively manage the resource utilization. Jared Hoberock, a NVIDIA researcher and co-creator of CUDA template library Thrust, put them into practice with adaptive CUDA launch configurations whose only essential dependency is cuda_runtime_api.h which will not introduce any new dependency into Caffe.

kloudkl · 2014-08-28T19:33:41Z

CUDA Pro Tip: Occupancy API Simplifies Launch Configuration
http://devblogs.nvidia.com/parallelforall/cuda-pro-tip-occupancy-api-simplifies-launch-configuration/

shelhamer · 2017-03-23T06:50:16Z

Closing as this is not a significant bottleneck at this point.

sergeyk added interface and removed interface labels Feb 25, 2014

kloudkl mentioned this issue Mar 11, 2014

Sliding Window, Varying input/output size and Dense, multiscale extraction #189

Closed

kloudkl mentioned this issue Apr 10, 2014

Kernel in pooling layer got a "invalid parameter" error #304

Closed

shelhamer added speed-up and removed enhancement labels Dec 30, 2014

shelhamer closed this as completed Mar 23, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use adaptive CUDA launch config to fully utilize GPU devices #111

Use adaptive CUDA launch config to fully utilize GPU devices #111

kloudkl commented Feb 15, 2014

kloudkl commented Aug 28, 2014

shelhamer commented Mar 23, 2017

Use adaptive CUDA launch config to fully utilize GPU devices #111

Use adaptive CUDA launch config to fully utilize GPU devices #111

Comments

kloudkl commented Feb 15, 2014

kloudkl commented Aug 28, 2014

shelhamer commented Mar 23, 2017