README.md 3.08 KB
Newer Older
Rahim Rasool committed
1 2 3 4 5 6 7 8 9 10 11
Data Science Dojo <br/>
Copyright (c) 2019 - 2020

---

**Level:** Intermediate <br/>
**Recommended Use:** Classification Models<br/>
**Domain:** Health Sciences<br/> 

## Coronavirus Data Set 

12
### Track the outbreak of coronoavirus (COVID-19) 
Rahim Rasool committed
13 14 15 16 17 18 19 20 21 22


---
![](coronavirus.jpg)
---

The recent outbreak of the novel coronavirus has caused great concern all around the world. It has affected more around tens of thousands of people, mostly in China. 
The outbreak, originating in the Chinese city of Wuhan, has been declared a global emergency by the World Health Organization (WHO).

This data set consists of 4 files and was collected through various sources.
Usman Shahid committed
23
The data is available from 22 Jan, 2020 and was last updated on 3 March, 2020. 
24
The first file **covid_19_data.csv** contains daily level information on the number of 2019-nCoV affected cases across the globe.
Rahim Rasool committed
25 26 27 28 29 30 31 32 33 34 35 36
The next 3 files contain time series data of confirmed cases, death cases and recovered cases, respectively.

The data dictionary for the first file is provided below.


---

### Data Dictionary 

| Column   Position 	| Atrribute Name        	| Definition																	|  
|-------------------	|-----------------------	|------------------------------------------------------------------------------ |
| 1                 	| Sno		              	| Serial Number					  												|
37
| 2                 	| ObservationDate     		| Date and time of the observation in MM/DD/YYYY HH:MM:SS       				| 
Rahim Rasool committed
38
| 3                 	| Province / State   		| Province or state of the observation                          				| 
39
| 4                 	| Country / Region			| Country of observation                                        				| 
Rahim Rasool committed
40 41 42 43 44 45 46
| 5                 	| Last Update 				| Time in UTC at which the row is updated for the given province or country. 	| 
| 6                 	| Confirmed                 | Number of confirmed cases														| 
| 7                 	| Deaths                  	| Number of deaths                                                              | 
| 8                 	| Recovered     			| Number of recovered cases                  									|

---

Rebecca Merrett committed
47 48
### Merged Data Sources

Usman Shahid committed
49
In an effort to help people stay informed and updated on Coronavirus (COVID-19), Data Science Dojo has decided to merge together several data sources of daily reports on the situation. This merged dataset includes not only the latest cases and death counts, but also the travel advisory levels and travel restrictions, by country. This will be updated daily for your convenience, with past days of the data in CSV format archived in our public git repository.  
Rebecca Merrett committed
50 51
This single view of the current status of Coronavirus is pulled, cleaned and merged daily from the World Health Organization reports, US Department of State Travel Advisories, Smart Traveller, and CNN news service. 

Rahim Rasool committed
52 53 54 55 56
### Acknowledgement


This data set has been sourced from [Kaggle](https://www.kaggle.com/sudalairajkumar/novel-corona-virus-2019-dataset) and [Johns Hopkins University](https://github.com/CSSEGISandData/COVID-19). 
This dataset is provided to the public strictly for educational and academic research purposes