How to train the network in three GPUs #3

XGBoost · 2019-04-12T08:44:43Z

I have noticed you have mentioned that it takes four hours to finish the training on three NVIDIA GTX 1080 Ti GPUs. However, you do not describe how to train the network on three GPUs in README.md.
When I run
python train.py --id resnet50_rnn --use_rnn
, it only takes a single GPU, and the batch size of it is eight which is different from that mentioned in your paper.
Could you please describe the process of training in detail.

The text was updated successfully, but these errors were encountered:

sunset1995 · 2019-04-13T04:33:29Z

Hi @XGBoost
To train on multiple GPU, you have to modify few lines of code.
You can check https://pytorch.org/tutorials/beginner/former_torchies/parallelism_tutorial.html
This repo only work on single GPU.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to train the network in three GPUs #3

How to train the network in three GPUs #3

XGBoost commented Apr 12, 2019

sunset1995 commented Apr 13, 2019

How to train the network in three GPUs #3

How to train the network in three GPUs #3

Comments

XGBoost commented Apr 12, 2019

sunset1995 commented Apr 13, 2019