Hands-On Python Machine Learning - IAP at MIT

In this short IAP I will teach concepts and algorithms that are used repeatedly in practical supervised learning. I hope that taking this IAP will encourage you to use machine learning (ML) in your research, and will facilitate reading ML litrature.

Machine learning is a very broad field and naturally I will not be able to cover all aspects/topics/practices of ML. But the good news is that learning only two main algorithms can be sufficient for almost all practical purposes of supervised ML.

Decision tree based models (i.e. Random Forests and Gradient Boosting Machines), successful mainly for structured data (tabular data)
Neural networks, successful mainly for unstructured data (such as audio, vision, and natural language), although recently also becoming popular in tabular data (see fastai courses, which I have used repeatedly while preparing this course).

Most of other ML algorithms (that gained popularity at some point during their lifetime), are outdated and are not very useful in most cases.

In this course I will not invest a lot of time on rigorous derivations, proofs and etc. Instead we will use our time to gain some intuition how ML models work and make our hands ''dirty'' with coding. This is very different from a typical academic courses, which are usually very rigorous and invest their time explaining in details every aspect of the material.

Every session will be devided to a teaching session (that will be given with jupyter notebooks - lesson*.ipynb) and a practice session where you will try code things that we learn (basically using machine learning to get predictions for some data set).

The main topics of the sessions are:

Classification with Random Forest
Regression with Random forest and XGBoost
Fully connected Neural Networks (using pytorch)
Convolutional Neural Networks
Transfer Learning (using fastai)

Getting Started

The easiest option to get the course material is cloning this git repository. To do so type git clone In order to get all the material for the course I suggest cloning this git repository. To do so type:

git clone https://github.com/yaniyuval/ML1_IAP.git

Alternatively, you can download a zip file.

Prerequisites

Installing

Installing anaconda:

To install Anaconda please follow the instructions here, since the installation depends on the OS you have, I cannot provide the exact way how you will install it.

After you installed anaconda, please update your conda version before continuing with these instructions. This is done by typing

conda update --all

If you do not want to update all your packages - please read here about other options how to do partial update (not recommended unless you know what you are doing).

Updating python version

I am using Python 3.7.5, and I encourage you to update your python version to be at least 3.6. You can update to the latest python version by typing:

conda update python

Virtual environment

To create a new virtual environment called ML_IAP with anaconda type:

conda create --name ML_IAP python=3.7.5

If you want to use a different python version you can change the python version but take into account that I verified that the code in the notebooks runs with this version (I certainly do not recommend any version prior to 3.6).

To activate the environment (you should do this always before opening the course notebooks or when installing packages) type:

conda activate ML_IAP

in older environments you might need to type:

source activate ML_IAP

in order to activate the virtual environment (though if you installed the new anaconda, you won't need it).

If you want to understand what is a virtual environmnet please read here.

If you want more details regarding creating virtual environments, please read here

Installing packages

Here you will install packages that are necessary for the course. Please type:

conda install scikit-learn numpy matplotlib scipy IPython pandas

also type the following commands:

conda install jupyter notebook

Continue by typing

pip install xgboost

pip install pandas_summary

pip install category_encoders

If you have any problems with category_encoders package, try typing:

conda install -c conda-forge category_encoders

Installing pytorch

pytorch is the package we will use for deep learning. The specific line of code you will run in order to get pytorch, depends on your OS and your python version. In order to understand what should you type, please go here and choose your OS/versions. I installed a version without a GPU (choosing CUDA none) and needed to type:

conda install pytorch torchvision -c pytorch

But this line of code is proper to my mac and you might need to write something else. Note that even if you do have a GPU (which is great!) I will not teach anything that requires it, and all the code was written to run on a CPU.

Since there was a recent new release of pytorch which still has a small issue (which should be solved in a couple of days), I recommend you to type:

pip install "pillow<7"

Although this might be solved in the next few days - see here and might be not necessary.

Installing fastai

type:

 pip install fastai

if you have problems in installing fastai - please read here

Running Jupyter notebook

Go to the IAP library (where you cloned the course repository) and type (don't forget to first activate your ML_IAP virtual environment):

jupyter notebook

Now choose a notebook that you want to run.

Comment - Jupyter Notebooks

All through the IAP, we will use Jupyter notebooks. If you are not familiar with Jupyter notebooks, please go over some basic tutorial (e.g., fastai tutorial notebook, general introduction to jupyter notebooks) and make sure you are able to run code on notebooks. I also added to the repository a tutorial notebook (jupyter-intro_RASP.ipynb) which is taken from Stephan's Rasp repository.

Test packages

to test that your installation of all packages has worked you can run that you can run the single cell in the notebook test_packages.ipynb.

If you get some error, please try to understand which package you did not install properly and try to reinstall it.

Warning

The notebooks might still have some small changes, so please clone the repository again before each session

Name		Name	Last commit message	Last commit date
Latest commit History 37 Commits
.ipynb_checkpoints		.ipynb_checkpoints
figs		figs
Gradient_decent_explained.ipynb		Gradient_decent_explained.ipynb
README.md		README.md
jupyter-intro_RASP.ipynb		jupyter-intro_RASP.ipynb
lesson1_RF_classification.ipynb		lesson1_RF_classification.ipynb
lesson2-RandomForest_reg.ipynb		lesson2-RandomForest_reg.ipynb
lesson3-NN.ipynb		lesson3-NN.ipynb
lesson4-CNN.ipynb		lesson4-CNN.ipynb
lesson5_transfer_learning_Class.ipynb		lesson5_transfer_learning_Class.ipynb
practice1-titanic_classification.ipynb		practice1-titanic_classification.ipynb
practice2-regression.ipynb		practice2-regression.ipynb
practice3-MNIST.ipynb		practice3-MNIST.ipynb
practice4-CNN_CIFAR10.ipynb		practice4-CNN_CIFAR10.ipynb
test_packages.ipynb		test_packages.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Hands-On Python Machine Learning - IAP at MIT

Getting Started

Prerequisites

Installing

Installing anaconda:

Updating python version

Virtual environment

Installing packages

Installing pytorch

Installing fastai

Running Jupyter notebook

Comment - Jupyter Notebooks

Test packages

Warning

Recommended reading

Please let me know if you have problems: [email protected]

About

Releases

Packages

Languages

yaniyuval/ML1_IAP

Folders and files

Latest commit

History

Repository files navigation

Hands-On Python Machine Learning - IAP at MIT

Getting Started

Prerequisites

Installing

Installing anaconda:

Updating python version

Virtual environment

Installing packages

Installing pytorch

Installing fastai

Running Jupyter notebook

Comment - Jupyter Notebooks

Test packages

Warning

Recommended reading

Please let me know if you have problems: [email protected]

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages