PyTorch ViT Image Classification with Apple MPS + Flask APIs + Web Client

This repository demonstrates an end-to-end pipeline for image classification using a Vision Transformer (ViT) model built with PyTorch, optimized for Apple's MPS (Metal Performance Shaders) to leverage GPU acceleration on M1, M2, and M3 Macs. The project includes a Flask-based API for backend model inference, allowing users to interact with the model through a web client. It provides a seamless integration of deep learning, RESTful APIs, and web deployment for real-time image classification tasks.

Features

PyTorch-based ViT image classification model, optimized with Apple MPS.
Flask server with RESTful API for model inference.
A simple web interface for interacting with the model.

Prerequisites

Apple Silicon Mac (M1, M2, or M3) for MPS acceleration (or use CPU). Windows GPU (CUDA) support will be added soon.
Python 3.8 or higher.
If using a Conda environment, Conda needs to be installed.

Installation

Clone the repository:

git clone https://github.com/mehradnia/PyTorch-ViT-Image-Classification-with-Apple-MPS-Flask-APIs-Web-Client.git
cd PyTorch-ViT-Image-Classification-with-Apple-MPS-Flask-APIs-Web-Client

Create and activate a virtual environment (Choose one of the methods below):

Using venv:

python3 -m venv venv
source venv/bin/activate  # On Windows use venv\Scripts\activate

Using Conda (make sure Conda is installed):

conda create --name myenv python=3.x
conda activate myenv

Install dependencies:
```
pip install -r requirements.txt
```

Set up your dataset:

Create a data/[YOUR_DATASET] directory.

Add your dataset structured in class-based folders:

data/[YOUR_DATASET]/
├── class1/
│   ├── 1.jpg
│   └── 2.jpg
├── class2/
│   ├── 1.jpg
│   └── 2.jpg

An example for an animals dataset:

data/animals/
├── cat/
│   ├── 1.jpg
│   └── 2.jpg
├── dog/
│   ├── 1.jpg
│   └── 2.jpg

Running the Project

1. Train the Model

Open the config.yaml file and replace path/to/your/data with your data directory (eg: data/animals).
Open the notebooks/vit_image_classifier.ipynb file and proceed through the instructions within the notebook to train your model.
Once training is completed, you can find the trained model in the /models directory.

2. Run the Flask Server

Start the Flask server using the command:
```
python3 app/server.py
```
Flask will run the server at http://localhost:8000.
The API exposes:
- POST /predict: Upload an image and get the classification result.

Example POST request:

curl -X POST -F "file=@path_to_image.jpg" http://localhost:8000/predict

Key Topics of Training Model:

Early Stopping:

This technique monitors the model's performance on the validation set during training. If the model's validation loss stops improving for a specified number of epochs (patience), training is halted to prevent overfitting and save time.

Data Augmentation:

Random transformations are applied to the training data to increase the variety of the dataset. Techniques like random resizing, rotations, and color jittering are used to help the model generalize better by learning from a broader range of input variations.

Reproducible Data Splitting:

It refers to the systematic partitioning of datasets into training, validation, and test subsets in a manner that guarantees consistency across multiple runs. This method helps make model evaluations reliable and allows others to replicate the experiments. Reproducible data splitting is important for keeping machine learning workflows fair and ensuring that different models can be compared accurately.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PyTorch ViT Image Classification with Apple MPS + Flask APIs + Web Client

Table of Contents

Features

Prerequisites

Installation

Running the Project

1. Train the Model

2. Run the Flask Server

Key Topics of Training Model:

Early Stopping:

Data Augmentation:

Reproducible Data Splitting:

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
app		app
models		models
notebooks		notebooks
.gitignore		.gitignore
README.md		README.md
config.yaml		config.yaml
requirements.txt		requirements.txt

mehradnia/PyTorch-ViT-Image-Classification-with-Apple-MPS-Flask-APIs-Web-Client

Folders and files

Latest commit

History

Repository files navigation

PyTorch ViT Image Classification with Apple MPS + Flask APIs + Web Client

Table of Contents

Features

Prerequisites

Installation

Running the Project

1. Train the Model

2. Run the Flask Server

Key Topics of Training Model:

Early Stopping:

Data Augmentation:

Reproducible Data Splitting:

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages