Skip to content

How does DropPath's implementation relate to the paper? #593

Answered by rwightman
amaarora asked this question in Q&A
Discussion options

You must be logged in to vote

@amaarora they are the same (as used here), drop_connect was the name used for stochastic depth in the original TF efficientnet code, and I adopted that name, then realized it conflicts with another, different use of 'drop connect'. Since 'stochastic depth' is a mouth/keyboard full and more of a concept name than a good layer name, I used drop_path as the layer to implement stochastic depth (by dropping paths in the residual). The arg for drop_connect -> drop_path

Replies: 1 comment 8 replies

Comment options

You must be logged in to vote
8 replies
@rwightman
Comment options

@amaarora
Comment options

@rwightman
Comment options

@rwightman
Comment options

@amaarora
Comment options

Answer selected by amaarora
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants