Commit 49f5cc33 by Rahim Rasool

Update README.md

parent d108f3a6
##### Datasets # Data Sets to Uplift your Skills
This is a test commit
-Arham
+ Data Science Dojo has added 30 data sets to this repository.
+ The repository carries a diverse range of themes, difficulty levels, sizes and attributes.
+ They offer hands-on practice to boost their skills in exploratory data analysis, data visualization, data wrangling and machine learning.
+ The data sets below have been sorted with increasing level of difficulty for convenience (Beginner, Intermediate, Advanced).
##### In order to fork this repository, click on the link to the guide [How to fork a project](https://docs.gitlab.com/ee/gitlab-basics/fork-project.html) on GitLab.
---
### Beginner:
1) **Find out the age of Abalone from physical measurements**<br/>
Recommended Use: Regression Models<br/>
Domain: Environment
2) **Predict student's knowledge level**<br/>
Recommended Use: Classification/Clustering<br/>
Domain: Education/Web
3) **Can you predict the price of a house?**<br/>
Recommended Use: Regression Models<br/>
Domain: Real Estate
4) **Can you estimate location from WIFI Signal Strength**<br/>
Recommended Use: Classification Models<br/>
Domain: Mobile/Location
5) **Predict acceptability of a car**<br/>
Recommended Use: Classification Models<br/>
Domain: Automobile
6) **Predict seminal quality of an individual**<br/>
Recommended Use: Regression/Classification Models<br/>
Domain: Healthcare/Life
7) **Estimate chance of bankruptcy from qualitative parameters by experts**<br/>
Recommended Use: Classification Models<br/>
Domain: Finance/Banking
---
### Intermediate:
8) **Can you predict the fuel-efficiency of a car?**<br/>
Recommended Use: Regression Models<br/>
Domain: Automobiles
9) **Was that chest pain an indicator of a heart disease**<br/>
Recommended Use: Classification Models<br/>
Domain: Health Sciences
10) **Predict total number of demand of orders**<br/>
Recommended Use: Regression Models<br/>
Domain: Business
11) **Find out if a donor will give blood in March 2007**<br/>
Recommended Use: Classification Models<br/>
Domain: Business
12) **Forecast pollution level of a city**<br/>
Recommended Use: Regression Models<br/>
Domain: Environment
13) **Will the patient survive for at least one year after a heart attack**<br/>
Recommended Use: Classification Models<br/>
Domain: Automobiles
14) **Estimate compressive strength of concrete**<br/>
Recommended Use: Regression Models<br/>
Domain: Civil Engineering/Construction
15) **Discover patterns relating liver disorder and alcohol consumption**<br/>
Recommended Use: Classification/Regression/Clustering Models<br/>
Domain: Healthcare
16) **Predict which stock will provide greatest rate of return**<br/>
Recommended Use: Clustering/Regression/Classification Models<br/>
Domain: Business/Finance
17) **Assess heating and cooling load requirements of building**<br/>
Recommended Use: Regression/Classification Models<br/>
Domain: Energy
18) **Determine the type of glass using oxide content**<br/>
Recommended Use: Classification Models<br/>
Domain: Physical
19) **Predict chance of survival**<br/>
Recommended Use: Classification Models<br/>
Domain: Healthcare
20) **Find patterns from spending data at wholesale**<br/>
Recommended Use: Classification/Clustering<br/>
Domain: Business/Retail
21) **Group similar travel reviews**<br/>
Recommended Use: Clustering/Classification Models<br/>
Domain: Web
22) **Relate returns of Istanbul Stock Exchange with other international indices**<br/>
Recommended Use: Regression/Classification Models<br/>
Domain: Business/Finance
23) **Predict bike rental count (hourly/daily) based on the environmental & seasonal settings**<br/>
Recommended Use: Regression Models<br/>
Domain: Social
24) **Detect Room Occupancy through Light, Temperature, Humidity and CO2 sensors**<br/>
Recommended Use: Classification Models<br/>
Domain: Energy/Buildings
25) **Estimate whether a person’s income exceeds $50K/year**<br/>
Recommended Use: Classification Models<br/>
Domain: Social/Government
---
### Advanced:
26) **Detect Autistic Spectrum Disorder (ASD) cases**<br/>
Recommended Use: Classification Models<br/>
Domain: Healthcare/Social Sciences
27) **Estimate the probability of Default**<br/>
Recommended Use: Classification Models<br/>
Domain: Business/Finance
28) **Predict if a note is genuine**<br/>
Recommended Use: Classification Models<br/>
Domain: Banking/Finance
29) **Find a short term forecast on electricity consumption of a single home**<br/>
Recommended Use: Regression/Clustering Models<br/>
Domain: Electricity
30) **Predict the number of shares on social networks**<br/>
Recommended Use: Regression/Classification Models<br/>
Domain: Business/Web
......
Markdown is supported
0% or
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment