NF Net training hparams #507
Unanswered
shenfalong
asked this question in
Q&A
Replies: 2 comments
-
You can get some idea for training configs here: https://github.com/rwightman/pytorch-image-models/blob/master/docs/training_hparam_examples.md |
Beta Was this translation helpful? Give feedback.
0 replies
-
There is a bit in #499 ... you need gradient clipping, either global norm clipping or AGC, you need fairy heavy augmentation and regularization. The NFNet paper describes hparams that will work quite well for NF-ResNet50. |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
I am reproducing NF-ResNet50, This model has been released with Top1=80%. However, I have trained the model by myself with Top=72%, I think the training setting is differnet, such as pretrained-weights, dropout, droppath, lr scheduler... could you provide the training log along with the released model? Many thanks.
Beta Was this translation helpful? Give feedback.
All reactions