Skip to content

info201b-2022-spring/final-projects-cristian-4

Repository files navigation

Voter Registration Data Analysis

Domain of Interest: Voter Registration in the United States

Why are you interested in this field/domain?

For our domain of interest we decided to focus on voter registration in the United States for the November 2020 Presidential Election. We became intrigued by datasets regarding voting after we discussed them in lecture. We decided to categorize voter registration in 2020 by race in order to gain more of an understanding of the disparities within country’s election process. Unfortunately, voter registration in the United States can directly correlate with an individual's race. Voter registration laws are often changed in order to make it more difficult for minority groups to vote and many choose to not even register because voting booths are not very accessible. Voting booths are often placed two towns over from communities whose populations are made up of mainly minority groups, making it extremely difficult for people of color to vote as well as those in low-income areas.

What are other examples of data-driven projects related to this domain?

  • Comparing data on voter registration based on regional information.
  • Comparing data on voter registration based on gender.
  • Comparing data on voter registration based on political party identification.

What data-driven questions do you hope to answer about this domain?

Were more white individuals registered to vote this past election than any other race?

How many individuals within minority groups registered to vote in the November 2020 election?

Are there any racial disparities surrounding voter registration for the November 2020 presidential election?

Finding Data

Silpa - Data Set #1: African Americans and Voting Registration Statistics

Where did you download the data?

U.S. Census Bureau: Voting and Registration in the Election of November 2020

How was the data collected?

The Data was collected by the U.S. Census Bureau. The U.S. Census Bureau is a federal agency that works on collecting facts and figures about American people and the U.S. economy through various sampling methods. This specific data was collected through a current population survey. The data collected was based on the African American population’s voting and registration statistics of the November 2020 election.

How many observations (rows) are in your data?

18: Three main categories: both sexes, male, and female. Within each of those they are split into 6 different age ranges. 18 and over, 18-24, 25-44, 45-64, 65-74, 75 and over

How many features (columns) are in the data?

10: total population, total citizen population, reported registered, reported not registered, no response to regsitration, reported voted, reported not voted, no response to voting, reported registered, and reported voted

What questions (from above) can be answered using the data in this dataset?

I was able to use the data in the dataset to answer all of the questions above, except for who and how the data was collected, which was listed on the website just under the data set.

Tika - Data Set #2: Whites and Voting Registration Statistics

Where did you download the data?

U.S. Census Bureau: Voting and Registration in the Election of November 2020

How was the data collected?

The data presented in this table was collected by the US Census Bureau, the principal agency of the United States Federal Statistical System, which collects and produces data regarding the American people and the economy. This data was collected through a survey specifically highlighting White Americans and their voting and registration status.

How many observations (rows) are in your data?

18: Three main categories: both sexes, male, and female. Within each of those they are split into 6 different age ranges. 18 and over, 18-24, 25-44, 45-64, 65-74, 75 and over

How many features (columns) are in the data?

There are three primary features (columns) in this dataset: ‘total population’, ‘United States citizen’, and another ‘total population’ column. The second column, ‘United States citizen’, is subdivided into 7 columns, the first being ‘total citizen population’ and the rest being ‘reported registered’, ‘reported not registered’, ‘no response to registration’, ‘reported voted’, ‘reported not voted’, and ‘no response to voting’. All of these are subdivided into two further categories: number and percent. The third column, ‘total population’ is subdivided into ‘reported registered’ and given as a percentage, and ‘reported voted’ which is also given as a percentage.

What questions (from above) can be answered using the data in this dataset?

All the questions could be answered by using this dataset except for exactly how this data was collected.

Cristian - Data Set #3: Latinos and Voting Registration Statistics

Where did you download the data?

U.S. Census Bureau: Voting and Registration in the Election of November 2020

How was the data collected?

The data was collected by the US Census Bureau by gathering data (for the '20 November election) on: reported voting and registration by race, hispanic origin, sex, and age.

How many observations (rows) are in your data?

18

How many features (columns) are in the data?

10

What questions (from above) can be answered using the data in this dataset?

I used this dataset to answer all of the above questions except for how the data was collected, and by who.

About

final-projects-cristian-4 created by GitHub Classroom

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 4

  •  
  •  
  •  
  •