Hi-ResNet is an expansion of the original ResNet50 architecture to allow for higher resolution inputs (448x448, 896x896, or 1792x1792). It was created as an alternative to image tiling and may prove useful in analyzing large images with fine details necessary for classification. The inception came from a personal coin grading project, which relies on evaluating fine details of both the front and back of the entirety of the coin. I quickly realized that a 224x224 image completely obscures fine details needed to grade the coin. The model saw ~40% accuracy increase when switching to Hi-ResNet and I believe it could even see more if I had a larger dataset.
The actual efficiency and effectiveness on datasets of various sizes has not been thoroughly tested. Other composite or tiling methods may achieve higher accuracy but will come with a longer and more complex pipeline.
The architecture is just a continuation from the original paper. I have included an architecture diagram for the original ResNet as well as the model heads for the three Hi-ResNet models below. The pattern from the original paper is continued down to the correct input size and number of features. The initial 7x7 conv and max pooling layers are removed from the original architecture and replaced by the Hi-ResNet head.
There are three Hi-ResNet model heads to choose from, depending on your image input size. The three arguments to pass through are:
- size (Int)
(448, 896, or 1792) Size of the input image. - weights (Str)
("Res50" or "None") Set to "Res50" to train with original ResNet50 imagenet weights within the base model and randomly initialized weights for the Hi-ResNet head. "None" will randomly initialize everything. - classes (Int)
Will set the number of output neurons for the final layer.
If you want to tweak anything else such as the fully connected layers, dropout, regularization, you will need to revise the code.
model = HiResNet(size, weights, classes) #will return the Hi-ResNet model.
Example:
!git clone 'https://github.com/johnGettings/Hi-ResNet'
%cd Hi-ResNet
from HiResNet import HiResNet
from tensorflow.keras.optimizers import SGD
model = HiResNet(896, "None", 29)
optimizer = SGD(learning_rate=0.01, momentum=0.9)
model.compile(loss="categorical_crossentropy", optimizer=optimizer, metrics=['accuracy'])
from keras.callbacks import ModelCheckpoint
INITIAL_EPOCHS = 99
checkpoint = ModelCheckpoint("HiResNet.hdf5",monitor='val_loss',verbose=1,mode='min',save_best_only=True,save_weights_only=True)
history = model.fit(ds_train,
epochs=INITIAL_EPOCHS,
callbacks=[checkpoint],
validation_data=ds_val)