Replies: 1 comment
-
If my question is unclear, I would be happy to reformulate :-) |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
For small image datasets (few hundreds of samples) one might want to use pretrained (on ImageNet) models to extract features from images and perform a classification task using a ML model (e.g. RandomForests, XGBoost,...). Pretrained models from torchvision.models can be used this way. The documentation of torchvision provides some recommendations regarding the use of pretrained models:
Pretrained models from timm could be used as well. Although I think that similar recommendations would apply for timm models, I was not able to find any documentation on this specific point. I have seen people do the following:
However, inputs should be normalized before they are passed to the model, right?
The following seems more appropriate:
Am I missing something?
Although
extractor
is fully convolutional, should the height and width of input images matchdefault_cfg['input_size']
?Beta Was this translation helpful? Give feedback.
All reactions