Skip to content

callmeBalloch/google-brain-ventilator-pressure-prediction

Repository files navigation

Part of 1st place solution (Simple LSTM) of Google-Brain-Ventilator competition

PyTorch Lightning Config: Hydra Template
Paper Conference

NOTE: This code is based the lightning-hydra-template. Give it a try!

Description

Simple LSTM model used in the https://www.kaggle.com/c/ventilator-pressure-prediction competition.

This is the first (out of two) model we used for the strong ensemble in the ventilator-pressure-prediction competition. This model is the one that is probably the easiest to explain, but also the one that resulted in the best LB score of 0.1209. It is based on the starter notebook from starter notebook from theoviel. It uses the following 9 features:

  • u_in
  • u_out
  • time_step
  • dummy features for R (3) and C (3)

the featurize function is contained in here and yes! that is all! You can find how the dataset is generate in the datamodule.

The current model is trained using a ReducelrOnPlateau with high patiente (30 epochs!) Why? May you ask. While we were working on the network training we noted that fancy features may not be "all we need" in this comp. In particular all the fast annealeing scheduler we tried were easily overfitting the training the data, but the validation MAE was not decreasing as fast.

In the following graphs you can clearly see the red experiments named "stakes". "stakes" converged much faster but validation MAE stopped at more than 0.150 (that is really high). All the others experiments we tried, letting the model training at higher lr were in hte end better, converging at validation MAE < 0.135.

The "cloudy" (pink) model reaches nearly 0.135 for every fold.

Different enperiments using many schedulers

The model configurations can be found configs/experiments

How to run

Install dependencies

# clone project
git clone https://github.com/whoknowsB/google-brain-ventilator-pressure-prediction
cd google-brain-ventilator-pressure-prediction

# [OPTIONAL] create conda environment
bash bash/setup_conda.sh

# install requirements
pip install -r requirements.txt

Download and extract the model weights in the root directory. You should have a structure that looks like this google-brain-ventilator-pressure-prediction/logs/experiments/

Update the dataset directory in the file configs/config.yaml

# path to folder with data
data_dir: /input/ventilator-pressure-prediction

Train model with chosen experiment configuration from configs/experiment/

python run.py experiment=cloudy ++trainer.gpus=[0] ++datamodule.fold=0 

You can easily run all the folds by

for fold in 0 1 2 3 4 5 6 7 8 9 10; do python run.py experiment=cloudy ++trainer.gpus=[0] ++datamodule.fold=$fold; done

You can easily get the oof and preds for a fold

python run_inference.py experiment=cloudy ++trainer.gpus=[0] ++datamodule.fold=0

You can easily get all the oof and preds by fold using:

for fold in 0 1 2 3 4 5 6 7 8 9 10; do python run_inference.py experiment=cloudy ++trainer.gpus=[0] ++datamodule.fold=$fold; done

oof and preds will be save inside the logs/experiments/cloudy/fold_number

To easily gather all the oofs and preds we suggest to use the notebook folding, just replace the model name, and the folds you need.

names = ['cloudy'] 
folds = range(11) # [0,1,2,3]

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published