README.md 4.69 KB
Newer Older
Rahim Rasool committed
1
# Data Sets to Uplift your Skills 
Tarun Shrivas committed
2

Arham Akheel committed
3

Rahim Rasool committed
4 5 6 7 8
+ Data Science Dojo has added 30 data sets to this repository. 
+ The repository carries a diverse range of themes, difficulty levels, sizes and attributes. 
+ They offer hands-on practice to boost their skills in exploratory data analysis, data visualization, data wrangling and machine learning.
+ The data sets below have been sorted with increasing level of difficulty for convenience (Beginner, Intermediate, Advanced).

Rahim Rasool committed
9 10
![](21.jpg)

Rahim Rasool committed
11 12 13 14 15
##### In order to fork this repository, click on the link to the guide [How to fork a project](https://docs.gitlab.com/ee/gitlab-basics/fork-project.html) on GitLab.

---
### Beginner:

Rahim Rasool committed
16 17
[**Find out the age of Abalone from physical measurements**](Abalone)<br/>
Regression Models | Environment
Rahim Rasool committed
18

Rahim Rasool committed
19 20
[**Predict student's knowledge level**](User Knowledge Modeling)<br/>
Classification/Clustering | Education/Web
Rahim Rasool committed
21

Rahim Rasool committed
22 23
[**Can you predict the price of a house?**](Real Estate Valuation)<br/>
Regression Models | Real Estate
Rahim Rasool committed
24

Rahim Rasool committed
25 26
[**Can you estimate location from WIFI Signal Strength**](Wireless Indoor Localization)<br/>
Classification Models | Mobile/Location
Rahim Rasool committed
27

Rahim Rasool committed
28 29
[**Predict acceptability of a car**](Car Evaluation)<br/>
Classification Models | Automobile
Rahim Rasool committed
30

Rahim Rasool committed
31 32
[**Predict seminal quality of an individual**](Fertility)<br/>
Regression/Classification Models | Healthcare/Life
Rahim Rasool committed
33

Rahim Rasool committed
34 35
[**Estimate chance of bankruptcy from qualitative parameters by experts**](Qualitative Bankruptcy)<br/>
Classification Models | Finance/Banking
Rahim Rasool committed
36 37 38
---
### Intermediate:

Rahim Rasool committed
39 40
[**Can you predict the fuel-efficiency of a car?**](Auto MPG)<br/>
Regression Models | Automobiles
Rahim Rasool committed
41

Rahim Rasool committed
42 43
[**Was that chest pain an indicator of a heart disease**](Heart Disease)<br/>
Classification Models | Health Sciences
Rahim Rasool committed
44

Rahim Rasool committed
45 46
[**Predict total number of demand of orders**](Daily Demand Forecasting Orders)<br/>
Regression Models | Business
Rahim Rasool committed
47

Rahim Rasool committed
48 49
[**Find out if a donor will give blood in March 2007**](Blood Transfusion Service Center)<br/>
Classification Models | Business
Rahim Rasool committed
50

Rahim Rasool committed
51 52
[**Forecast pollution level of a city**](Beijing PM2.5)<br/>
Regression Models | Environment
Rahim Rasool committed
53

Rahim Rasool committed
54 55
[**Will the patient survive for at least one year after a heart attack**](Echocardiogram)<br/>
Classification Models | Automobiles
Rahim Rasool committed
56

Rahim Rasool committed
57 58
[**Estimate compressive strength of concrete**](Concrete Compressive Strength)<br/>
Regression Models | Civil Engineering/Construction
Rahim Rasool committed
59

Rahim Rasool committed
60 61
[**Discover patterns relating liver disorder and alcohol consumption**](Liver Disorders)<br/>
Classification/Regression/Clustering Models | Healthcare
Rahim Rasool committed
62

Rahim Rasool committed
63 64
[**Predict which stock will provide greatest rate of return**](Dow Jones Index)<br/>
Clustering/Regression/Classification Models | Business/Finance
Rahim Rasool committed
65

Rahim Rasool committed
66 67
[**Assess heating and cooling load requirements of building**](Energy Efficiency)<br/>
Regression/Classification Models | Energy
Rahim Rasool committed
68

Rahim Rasool committed
69 70
[**Determine the type of glass using oxide content**](Glass Identification)<br/>
Classification Models | Physical
Rahim Rasool committed
71

Rahim Rasool committed
72 73
[**Predict chance of survival**](Hepatitis)<br/>
Classification Models | Healthcare
Rahim Rasool committed
74

Rahim Rasool committed
75 76
[**Find patterns from spending data at wholesale**](Wholesale Customers)<br/>
Classification/Clustering | Business/Retail
Rahim Rasool committed
77

Rahim Rasool committed
78 79
[**Group similar travel reviews**](Travel Reviews)<br/>
Clustering/Classification Models | Domain: Web
Rahim Rasool committed
80

Rahim Rasool committed
81 82
[**Relate returns of Istanbul Stock Exchange with other international indices**](Istanbul Stock Exchange)<br/>
Regression/Classification Models | Business/Finance
Rahim Rasool committed
83

Rahim Rasool committed
84 85
[**Predict bike rental count (hourly/daily) based on the environmental & seasonal settings**](Bike Sharing)<br/>
Regression Models | Social
Rahim Rasool committed
86

Rahim Rasool committed
87 88
[**Detect Room Occupancy through Light, Temperature, Humidity and CO2 sensors**](Occupancy Detection)<br/>
Classification Models | Energy/Buildings
Rahim Rasool committed
89

Rahim Rasool committed
90 91
[**Estimate whether a person’s income exceeds $50K/year**](Census Income)<br/>
Classification Models | Social/Government
Rahim Rasool committed
92 93 94 95

---
### Advanced:

Rahim Rasool committed
96 97
[**Detect Autistic Spectrum Disorder (ASD) cases**](Autism Screening Adult)<br/>
Classification Models | Healthcare/Social Sciences
Rahim Rasool committed
98

Rahim Rasool committed
99 100
[**Estimate the probability of Default**](Default of Credit Card Clients)<br/>
Classification Models | Business/Finance
Rahim Rasool committed
101

Rahim Rasool committed
102 103
[**Predict if a note is genuine**](Banknote Authentication)<br/>
Classification Models | Banking/Finance
Rahim Rasool committed
104

Rahim Rasool committed
105 106
[**Find a short term forecast on electricity consumption of a single home**](Individual Household Electric Power Consumption)<br/>
Regression/Clustering Models | Electricity
Rahim Rasool committed
107

Rahim Rasool committed
108 109
[**Predict the number of shares on social networks**](Online News Popularity)<br/>
Regression/Classification Models | Business/Web
Tarun Shrivas committed
110

Rahim Rasool committed
111 112
---

Rahim Rasool committed
113
### Queries:
Tarun Shrivas committed
114

Rahim Rasool committed
115 116
**Can I use these datasets for my project?**<br/>
Sure! You're totally free to do so.
Tarun Shrivas committed
117

Rahim Rasool committed
118 119
**Can i add a dataset here**<br/>
Send us a pull request and we'll discuss
Tarun Shrivas committed
120

Rahim Rasool committed
121 122
**There seems to be a problem here.**<br/>
If you find an issue, kindly raise it using help of this [link](https://docs.gitlab.com/ee/user/project/issues/create_new_issue.html)
Tarun Shrivas committed
123 124