TeamPunchParty

Data200s Group project

Firstly, shout out to my teammate Korede Ogundele, who wrote a significant amount of code in this repository and who I've always enjoyed working with.

In this project, we combined data analysis and machine learning techniques to model air quality across various US counties. Using Python libraries NumPy, Pandas, seaborn, matplotlib, and scikit-learn, our team constructed two linear regression models and successfully conducted a comprehensive analysis of emission data.

Links:

dataset_A_preprocessing.ipynb

Originally copied from GHCN_data_preprocessing.ipynb in the dataset drive, we will likely tailor this to our needs.

Open questions:

See the "Emmissions questions" tab of the spreadsheet above. We can talk about transferring those questions to this readme or whatever long term organization we want.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

TeamPunchParty

Links:

The project description page:

The google drive for dataset A originally provided to us

Our spreadsheet of initially brainstormed questions

Our document with our initial proposal

Checkpoint 1 google doc writeup:

Checkpoint 2 google doc writeup:

dataset_A_preprocessing.ipynb

Open questions:

Files

README.md

Latest commit

History

README.md

File metadata and controls

TeamPunchParty

Links:

The project description page:

The google drive for dataset A originally provided to us

Our spreadsheet of initially brainstormed questions

Our document with our initial proposal

Checkpoint 1 google doc writeup:

Checkpoint 2 google doc writeup:

dataset_A_preprocessing.ipynb

Open questions: