Skip to content

Latest commit

 

History

History
101 lines (54 loc) · 4.18 KB

README.md

File metadata and controls

101 lines (54 loc) · 4.18 KB

INFO 201B - Final Project

Group B-7: Jeffrey Lan, Yiding Zhang, Minh Le, Wencheng Zhang

WILDFIRES IN THE US

Domain of interest

Why are you interested in this field/domain?

The wildfires incidents are increasing recently. This project can help paint the picture of the state of the earth and how it is important to protect the forest and our environment.

What other examples of data driven project have you found related to this domain (share at least 3)?

Fighting Wildfires with Data

The impact of land ownership, firefighting, and reserve status on fire probability in California

Improving the Health of California's Headwater Forests

What data-driven questions do you hope to answer about this domain (share at least 3)?

  • The number of wildfires each year in the US and in each state, and how that number progresses over the year.
  • Which state has the maximum number of wildfires each year, and how that number progresses over the years.
  • Which states appear most frequently in the result of the previous question?
  • Which state appears the second most frequently?
  • How much has the government spent on the cost of fire suppression?

Finding Data

1 - Data set 1:

Where did you download the data (e.g., a web URL)?

https://www.kaggle.com/datasets/rtatman/188-million-us-wildfires

How was the data collected or generated? Make sure to explain who collected the data (not necessarily the same people that host the data), and who or what the data is about?

This data publication contains a spatial database of wildfires that occurred in the United States from 1992 to 2015. This dataset was originally generated to support the national Fire Program Analysis (FPA) system. The wildfire records are acquired from the reporting systems of federal, state, and local fire organizations.

How many observations (rows) are in your data?

1.88 million rows

How many features (columns) are in the data?

130 columns

What questions (from above) can be answered using the data in this dataset?

  • The number of wildfires each year in the US and in each state, and how that number progresses over the year.
  • Which state has the maximum number of wildfires each year, and how that number progresses over the years.
  • Which states appear most frequently in the result of the previous question, which state appears the second most frequently, etc.

2 - Data set 2:

Where did you download the data (e.g., a web URL)?

https://archive.ics.uci.edu/ml/datasets/forest+fires

How was the data collected or generated? Make sure to explain who collected the data (not necessarily the same people that host the data), and who or what the data is about?

Paulo Cortez and Aníbal Morais, Department of Information Systems, University of Minho, Portugal. The data is about the total number of wildfires in the US.

How many observations (rows) are in your data?

518 rows

How many features (columns) are in the data?

13 columns

What questions (from above) can be answered using the data in this dataset?

The number of wildfires each year in the US and in each state, and how that number progresses over the year.

3 - Data set 3:

Where did you download the data (e.g., a web URL)?

https://www.kaggle.com/datasets/kkhandekar/total-wildfires-acres-affected-1983-2020

How was the data collected or generated? Make sure to explain who collected the data (not necessarily the same people that host the data), and who or what the data is about?

Total Wildland Fires and Acres (1983-2020). Prior to 1983, the federal wildland fire agencies did not track official wildfire data using current reporting processes. As a result, there is no official data prior to 1983.

How many observations (rows) are in your data?

37 rows

How many features (columns) are in the data?

9 columns

What questions (from above) can be answered using the data in this dataset?

  • Total number of fires in one year
  • Cost of fire suppression.

---END OF LINE---