Genericness of model loading #44

annaproxy · 2021-07-18T21:29:24Z

Right now, model loader reconstructs model according to state_dict by using size of lstm.hh, which is not awful but also not too beautiful if the architecture changes to e.g. two layers, or if the model parameter names slightly change (e.g. lstm to rnn).

Could definitely be improved even though it is not an issue for models right now.

(The most generic solution would be to save the model directly (as a "binary") and not a state_dict, which makes the model instantly torch.load-able, but does introduce dependencies on the current exact RNN implementation/location of the definition, torch version, etc, which is why it is not the recommended way to save.)

The text was updated successfully, but these errors were encountered:

annaproxy added the enhancement New feature or request label Jul 18, 2021

annaproxy self-assigned this Jul 18, 2021

This was referenced Jul 18, 2021

Add vocabulary object. Model loading infers RNN size. #38

Merged

Un-hardcode hardcoded hidden size when loading RNN #31

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Genericness of model loading #44

Genericness of model loading #44

annaproxy commented Jul 18, 2021

Genericness of model loading #44

Genericness of model loading #44

Comments

annaproxy commented Jul 18, 2021