-
CMU Movie Summary Corpus: The CMU Movie Summary Corpus is a comprehensive dataset providing information about movies, including box office revenue, genre, release date, runtime, language, actors, and plot summaries.
-
IMDb Movies Dataset: The IMDb Movies Dataset is a valuable resource for incorporating IMDb ratings into our analysis. It also helps enhance our dataset by filling in missing information not covered by the CMU Movie Summary Corpus.
-
The Movies Dataset : The Movies Dataset is another dataset containing information about movies. We leverage this dataset to supplement our analysis with additional data that may be absent from the CMU Movie Summary Corpus.
-
Mojo Movie : The Mojo Movie dataset provides information on top box office revenue for English movies. It complements our analysis by filling potential gaps in the CMU Movie Summary Corpus.
-
Global Database of Inflation: The Global Database of Inflation is a dataset containing information on world inflation. We use it to normalize movie revenues, ensuring a consistent monetary scale over the years.
- Knowledge Graph API We utilize the Knowledge Graph API as a replacement for the deprecated Freebase API. This helps us map Freebase IDs of ethnicities to their corresponding names.
- List of African-American actors, List of Hispanic and Latino American actors, and List of Italian-American actors: Wikipedia pages we use to supplement our analysis with additional actors' ethnicities that may be absent from the CMU Movie Summary Corpus.
Click here to get back to the Datastory.