Project: Personalized Music Streaming Service

Introduction

The Personalized Music Streaming Service project aims to develop a streamlined alternative to popular music streaming platforms like Spotify. This project incorporates a sophisticated music recommendation system alongside playback and streaming capabilities, delivering real-time suggestions derived from user activity.

Dependencies

Before getting started, ensure you have the following dependencies installed:

Python
Flask
Apache Spark
Apache Kafka
MongoDB
Python Libraries:
- Kafka-Python: pip install kafka-python
- Pandas: pip install pandas
- Librosa: pip install librosa
- NumPy: pip install numpy
- Scikit-Learn: pip install scikit-learn
- Joblib: pip install joblib
- Flask: pip install Flask
- PySpark: pip install pyspark
- TQDM: pip install tqdm
Dataset:
- Free Music Archive (FMA): Download from the GitHub repository.

Phase #1: Extract, Transform, Load (ETL) Pipeline

In this phase, we'll create an Extract, Transform, Load (ETL) pipeline utilizing the Free Music Archive (FMA) dataset. The FMA dataset comprises 106,574 tracks, each lasting 30 seconds, spanning 161 unevenly distributed genres. We'll utilize the fma_large.zip dataset, along with fma_metadata.zip for track details like title, artist, genres, tags, and play counts.

Tasks:

Download and preprocess the FMA dataset using Python.
Extract important features like Mel-Frequency Cepstral Coefficients (MFCC), spectral centroid, or zero-crossing rate.
Explore normalization, standardization, and dimensionality reduction techniques.
Store the transformed audio features in MongoDB for scalability and accessibility.

Phase #2: Music Recommendation Model

Now that the data is securely stored in MongoDB, we'll train a music recommendation model using Apache Spark. We can leverage Apache Spark’s MLlib machine learning library or explore deep learning methodologies for enhanced accuracy. Algorithms such as collaborative filtering and Approximate Nearest Neighbours (ANN) can be employed in this process.

Tasks:

Utilize Apache Spark to train the recommendation model.
Evaluate the model using different metrics and perform hyperparameter tuning.
Assess the model's performance and accuracy.

Phase #3: Deployment

In this phase, we'll deploy the trained model onto a web application with streaming capabilities. We'll leverage frameworks like Flask or Django to develop an interactive music streaming web application. Apache Kafka will be used to dynamically generate music recommendations in real-time, based on user activity and historical playback data.

Tasks:

Develop an interactive music streaming web application with Flask or Django.
Utilize Apache Kafka for real-time music recommendation generation.
Implement a user-friendly web interface for seamless navigation and usage.

Folder Structure

ML_Logic: Contains machine learning logic for music recommendation.
Streaming_Logic: Contains logic for real-time music streaming recommendations.
producer.py: Generates music recommendations in real-time using Apache Kafka.
music_app.py: Main application file.
static: Contains static files for the web application (e.g., CSS, JavaScript).
test_logic: Contains test logic for unit testing.

Team

Meet the dedicated individuals who contributed to this project:

Ammar Khasif: GitHub
Arhum Khan: GitHub
Aaqib Ahmed Nazir: GitHub

Additional Information

This project aims to redefine the music streaming experience by delivering personalized recommendations and real-time streaming capabilities.

Let the music play! 🎶

Name		Name	Last commit message	Last commit date
Latest commit History 42 Commits
ML_Logic		ML_Logic
Streaming_Logic		Streaming_Logic
static		static
templates		templates
test_logic		test_logic
.gitignore		.gitignore
Feature_Extraction.ipynb		Feature_Extraction.ipynb
README.md		README.md
music_app.py		music_app.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Project: Personalized Music Streaming Service

Introduction

Dependencies

Phase #1: Extract, Transform, Load (ETL) Pipeline

Tasks:

Phase #2: Music Recommendation Model

Tasks:

Phase #3: Deployment

Tasks:

Folder Structure

Team

Additional Information

About

Releases

Packages

Contributors 2

Languages

ArhumKhan10/Spotify-Music-Recommendation

Folders and files

Latest commit

History

Repository files navigation

Project: Personalized Music Streaming Service

Introduction

Dependencies

Phase #1: Extract, Transform, Load (ETL) Pipeline

Tasks:

Phase #2: Music Recommendation Model

Tasks:

Phase #3: Deployment

Tasks:

Folder Structure

Team

Additional Information

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages