Group B-25: Joshua Chen, Derek Lu, Benedict Wang, Fan Bu
-
Why are you interested in this field/domain?
We are interested in this field because we want to see the war through the eyes of statistics. Apart from acquiring informaiton from the news media and other text-based contents, we try to collect some data and bring a data-driven persepective to the audiences. The cause of a war is complicated and the impacts on both countries is significant and extensive. To have a deeper understanding of the war, we plan to interpret the war with the dataset on the internet. -
What other examples of data driven project have you found related to this domain (share at least 3)?
-
The first data driven project I found was the analysis and summary about the exchange rate of Russian Ruble to the other major currencies. The research utilized data since 25 years ago to make a line graph demonstrating the fluctuation of the Russian Ruble. It also included forecast for the next year and a table comparing the daily change of Ruble's exchange rate with different currencies.
Source -
The second data driven project pointed out the issue of food price surge. The report included line graph documenting the oil and cereal prices since January , 2020. It also analyzed the context that gave rise to the ups and downs.
Source -
The third projects I found rocorded the cross-border movement of refugees from Ukraine to the neighboring countries. It portrayed a graph indicating the cumulative number of people fleeing from Ukraine. Moreover, there were a table listing the countries accepting the refugees.
Source
- What data-driven questions do you hope to answer about this domain (share at least 3)?
- How is the exchange rate of Russia Ruble to Dollar impacted by the Ukraine war?
- How much is the equipment loss of the troops every day?
- What is the reaction of the war on social meida?
-
Where did you download the data (e.g., a web URL)?
I downloaded from this website. -
How was the data collected or generated? Make sure to explain who collected the data (not necessarily the same people that host the data), and who or what the data is about?
Those data were extracted from investing.com. The investing.com collects market value of Ethereum in each week and construct them into a list. Those data are related to the price of Ethereum, which is originated in Russian and soon became one the largest cryptocurrency in the world. They are generated based on the open, close, high, and lows in the market price for the currency as well as the volume of a week. It is generated to help people understand the price change of Ethereum over years. -
How many observations (rows) are in your data?
There are 2229 observation done in this data. -
How many features (columns) are in the data?
There are 6 features in this data -
What questions (from above) can be answered using the data in this dataset?
I can found out the change of Ethereum price during the Ukraine Russia War with calculation of highs and lows, as well as market volume.
-
Where did you download the data (e.g., a web URL)?
Kaggle -
How was the data collected or generated? Make sure to explain who collected the data (not necessarily the same people that host the data), and who or what the data is about?
The data is collected by Armed Forces of Ukraine and Ministry of Defence of Ukraine. This data is about the equipment losses of Russia on each date. Equipment includes tank, aircraft, and helicopters ect. -
How many observations (rows) are in your data?
64 rows, which is the days of the war -
How many features (columns) are in the data?
15 columns, including different types of equipment. -
What questions (from above) can be answered using the data in this dataset?
What is the average amount of losses in a day? What type of equipment lost the most in the war?
-
Where did you download the data (e.g., a web URL)?
I downloaded from Kaggle. -
How was the data collected or generated? Make sure to explain who collected the data (not necessarily the same people that host the data), and who or what the data is about?
The data were collected from Tweeter. The data were collected by Towhidul Tonmoy. The dataset collects tweets regarding Russia and Ukraine from 21st February till date. -
How many observations (rows) are in your data?
There are 10013 observations in the data. -
How many features (columns) are in the data?
There are 36 features in the data. -
What questions (from above) can be answered using the data in this dataset?
I can find out the public pressure and the heat of this special topic on social media.