This project develops a machine learning solution for early detection of Autism Spectrum Disorder (ASD) using multiple classification algorithms. By leveraging advanced data preprocessing and machine learning techniques, the model aims to provide a reliable screening tool for potential ASD identification.
-
Multiple Machine Learning Models:
- Logistic Regression
- XGBoost Classifier
- Support Vector Machine (SVM)
-
Advanced Data Preprocessing:
- Feature engineering
- Age group categorization
- Log transformation
- Label encoding
- Feature scaling
-
Handling Class Imbalance:
- Random Over-Sampling technique to balance dataset
-
Interactive Streamlit Web Application:
- User-friendly interface
- Model selection
- Real-time ASD prediction
- Clone the repository
git clone https://github.com/yourusername/autism-prediction.git
cd autism-prediction
- Create virtual environment
python -m venv venv
source venv/bin/activate # On Windows: venv\Scripts\activate
- Install dependencies
pip install -r requirements.txt
python train.py
streamlit run app.py
-
Logistic Regression:
- Training AUC: ~0.85
- Validation AUC: ~0.80
-
XGBoost:
- Training AUC: ~0.90
- Validation AUC: ~0.85
-
Support Vector Machine:
- Training AUC: ~0.88
- Validation AUC: ~0.82
The dataset includes various features related to:
- Demographic information
- Behavioral scores
- Medical history
- Log transformation of age
- Feature engineering
- Label encoding
- Standard scaling
- Handling class imbalance
- Fork the repository
- Create your feature branch (
git checkout -b feature/AmazingFeature
) - Commit your changes (
git commit -m 'Add some AmazingFeature'
) - Push to the branch (
git push origin feature/AmazingFeature
) - Open a Pull Request
[MIT]