Skip to content
This repository has been archived by the owner on Jan 16, 2024. It is now read-only.

ryantjx/BT4222-NBA-Houston-Rockets-Analysis

Repository files navigation

BT4222 Group 13

Make Houston Rockets Great Again

In this project, our group aimed to:

  1. Understand the current situation of the Houston Rockets basketball team, by mining and analyzing structured historical player statistics and unstructured text data from various social media platforms, and

  2. Provide actionable, data-driven recommendations derived from our machine learning models to improve team performance and create a championship winning team.

Contributors

  1. Ashwin Kalaichelvan
  2. Koh Tze Kang
  3. Ryan Tan
  4. Hoki Fung
  5. Jacky Seah

Description

There are 3 main folders in this project,

  1. Player Performance Prediction
    1. Eric_Gordon.xlsx - Data for Eric Gordon
    2. Player_Performance_Prediction.ipynb - Notebook for Player Performance Prediction
  2. Reddit Sentiment Analysis and Play - by - Play Prediction
    1. NBA_Analytics.ipynb - Notebook for predicting outcome of game
    2. Reddit Public Sentiment - Scraping and Sentiment Analysis.ipynb - Notebook for web-scraping reddit posts
    3. WebScraping_PlaybyPlay.ipynb - Notebook for web-scraping basketball-reference.com play-by-play data
    4. master_datasheet.xlsx - dataset of all team and player information
    5. nba_games_data_sentimentanalysis_weighted.csv - dataset with twitter sentiment scores
    6. play-by-play-2022-04-10.json - dataset for play-by-play information
    7. reddit_scrape_posts.json - dataset for reddit posts
    8. reddit_sentiment.csv - dataset for reddit sentiment dates
    9. reddit_sentiment_scores.json - dataset for reddit sentiment scores
  3. Twitter Sentiment Analysis and Topic Modelling
    1. NBA Reporter Twitter Data Mining.ipynb - Notebook for web-scraping twitter reporters
    2. NBA Reporters.xlsx - dataset for twitter reporters
    3. Twitter_Data_Pre_Processing_and_Topic_Modelling.ipynb - Notebook for twitter sentiment scores and topic modelling
    4. nba_games.xlsx - dataset for nba games
    5. nba_games_data_sentimentanalysis_weighted.csv - dataset for nba games twitter sentiment scores

Setting Up

As our team utilized both Google Colab and Jupyter Notebook for development, please edit the file paths accordingly to load the relevant datasets that are in the same folder.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published