lottery-ticket

Exploring lottery ticket hypothesis

Setup

mkdir model
mkdir plots
python3 -m venv venv
source venv/bin/activate
python -m pip install -r requirements.txt

source venv/bin/activate
python src/train_feedforward_mnist.py

I haven't yet added dropout or other regularization to the network but that should be trivial
I used ReLU instead of tanh as that is the norm nowadays
Pruning the final layer at only half the rate makes a big difference!
Pruning before training makes the model train much worse, as expected
Iterative pruning should be easy to do with the callback argument I added to the training step.
The paper uses 50 epochs; my example script uses only 10
Takes only a few minutes to train on a relatively beefy machine with ~48 CPUs but not GPU. I think there are a couple of lines of code that would be needed to make it run on GPUs.

src/train_feedforward_mnist.py: Train a single FF model on MNIST, prune it to varying degrees, and save the output models
src/analyze_feedforward_mnist.py: Statistics from the saved models after running the above script
src/train_correlated_mnist.py: Initialze a model, and then add varying degrees of correlation between input/output weights of the same nodes, train for a quarter of an epoch, and report accuracy.
scr/many_mnist.py: Train and analyze 100 FF models pruned 75%, printing summary statistics.

Name		Name	Last commit message	Last commit date
Latest commit History 45 Commits
notebooks		notebooks
src		src
.gitignore		.gitignore
README.md		README.md
Weight_Correlation_and_Layer_Effects.pdf		Weight_Correlation_and_Layer_Effects.pdf
__init__.py		__init__.py
requirements.txt		requirements.txt