Name		Name	Last commit message	Last commit date
parent directory ..
regression_args		regression_args
rosetta_scores		rosetta_scores
splits		splits
trained_models		trained_models
README.md		README.md

README.md

Publication directory

We provide various files to facilitate retraining the models from the publication.

The splits directory contains the exact train/tune/test sets we used for each dataset. These splits consist of numerical indices for each of the sets that index into the corresponding dataset .tsv files in the data directory. See train_test_split.ipynb for information about the standard splits and extrapolation.ipynb for information about the positional and mutational based splits.

The regression_args directory contains argument files meant to be used with regression.py. These argument files have been set to use the same train/test splits, network architectures, and hyperparameters we used to train the models in the publication. Simply call python code/regression.py @pub/regression_args/<desired model> from the root directory to train your desired model. The output, which includes the trained model, evaluation metrics, and predictions on each of the train/tune/tests sets, will automatically be placed in training_logs. Due to the stochastic nature of training neural networks, your results may not match ours exactly, but they should be fairly close.

The trained_models directory contains pre-trained models that are similar to the ones from the publication. You can use these to perform inference and make predictions for new variants by following the example in the inference.ipynb notebook. These models are retrainings of the models used in the publication, and again, due to the stochastic nature of training neural networks, they may not match the models from the publication exactly.

The rosetta_scores directory contains the Rosetta scores for each dataset. These scores were used to compute correlations for the manuscript.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

pub

pub

README.md

Publication directory

Files

pub

Directory actions

More options

Directory actions

More options

Latest commit

History

pub

Folders and files

parent directory

README.md

Publication directory