Skip to content

rukayah/Data_Analysis_with_Volcano_Data

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 Cannot retrieve latest commit at this time.

History

5 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

image Data_Analysis_with_Volcano_Data

Introduction

From wikipedia:

A volcano is a rupture in the crust of a planetary-mass object, such as Earth, that allows hot lava, volcanic ash, and gases to escape from a magma chamber below the surface. A volcano is a rupture in the crust of a planetary-mass object, such as Earth, that allows hot lava, volcanic ash, and gases to escape from a magma chamber below the surface. image

Challenge

There are five sources of data for analizing the impact of the volcano eruptions, which county is more prone to this disaster and how to take measures to prevent it.The data has to be combined together to get broad insights for informed decision making.

The data is orginally from The Smithsonian Institution

Exploratory Data Analysis

Therefore, with the help of data analysis and visualization, we can find trend about countries with most volcano occurences and look for measures to migitate against such disaster for future purposes.

Conclusions and Recommendations

In the volcano data (volcano.csv), 44.6% of all volcanic eruption occurences actually happened,other category includes volcanic eruption uncertain or there is unrest. United states with the highest number of volcanic eruption since year dated 1883 should be paid attention to. Particular places such as Oregon and California which have the highest occurence in the country should be given attention. Stratovolcano volcano type has the highest occurence from the year stated

Data Dictionary

Volcano.csv

variable class description
volcano_number double Volcano unique ID
volcano_name character Volcano name
primary_volcano_type character Volcano type (see wikipedia above for full details)
last_eruption_year character Last year erupted
country character Country
region character Region
subregion character Sub region
latitude double Latitude
longitude double Longitude
elevation double Elevation
tectonic_settings character Plate tectonic settings (subduction, intraplate, rift zone) + crust
evidence_category character Type of evidence
major_rock_1 character Major rock type
major_rock_2 character Major rock type
major_rock_3 character Major rock type
major_rock_4 character Major rock type
major_rock_5 character Major rock type
minor_rock_1 character Minor rock type
minor_rock_2 character Major rock type
minor_rock_3 character Minor rock type
minor_rock_4 character Minor rock type
minor_rock_5 character Minor rock type
population_within_5_km double Total population within 5 km of volcano
population_within_10_km double Total population within 10 km of volcano
population_within_30_km double Total population within 30 km of volcano
population_within_100_km double Total population within 100 km of volcano

eruption.csv

variable class description
volcano_number double Volcano unique ID
volcano_name character Volcano name
eruption_number double Eruption number
eruption_category character Type of eruption
area_of_activity character Area of activity
vei double Volcano Explosivity Index (0-8) see wikipedia above
start_year double Start year
start_month double Start month
start_day double Start day
evidence_method_dating character Evidence for dating volcano eruption
end_year double End year
end_month double End Month
end_day double End day
latitude double Latitude
longitude double Longitude

events.csv

variable class description
volcano_number double Volcano Unique ID
volcano_name character Volcano name
eruption_number double Eruption number
eruption_start_year double Eruption start year
event_number double Event number
event_type character Event type
event_remarks character Event remarks
event_date_year double Event year
event_date_month double Event month
event_date_day double Event day

tree_rings.csv

variable class description
year integer Year of observation CE
n_tree double Tree ring z-scores relative to year = 1000-1099 (a z-score is a measure of variability from the mean - either positive or negative)
europe_temp_index double Pages 2K Temperature for Europe in Celsius relative to 1961 to 1990

sulfur.csv

variable class description
year double Year w/ decimal CE
neem double Sulfur detected in ng/g from NEEM - ice cores from Greenland, data collected from melting ice cores, data range was 500 to 705 CE
wdc double Sulfur detected in ng/g from WDC - ice cores from Antartica, data collected from melting ice cores, data range was 500 to 705 CE

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published