Data Science Dojo
Copyright (c) 2016 - 2019
Level Intermediate
Recommended Use:Classification/Regression/Clustering Models
Domain: Healthcare
Liver Disorders Data Set
Observe relationship between liver disorder and alcohol consumption
This intermediate level data set has 398 rows and 9 columns and provides mileage, horsepower, model year, and other technical specifications for cars. This data set is recommended for learning and practicing your skills in exploratory data analysis, data visualization, and regression modelling techniques. Feel free to explore the data set with multiple supervised and unsupervised learning techniques. The Following data dictionary gives more details on this data set:
Data Dictionary
Column Position | Atrribute Name | Definition | Data Type | Example | % Null Ratios |
---|---|---|---|---|---|
1 | Mcv | Mean Corpuscular Volume (Blood Test) | Quantitative | 85, 91, 96 | 0 |
2 | Alkphos | Alkaline Phosphotase (Blood Test) | Quantitative | 78, 55, 70 | 0 |
3 | Sgpt | Alamine Aminotransferase (Blood Test) | Quantitative | 45, 12, 15 | 0 |
4 | Sgot | Aspartate Aminotransferase (Blood Test) | Quantitative | 27, 17, 33 | 0 |
5 | Gammagt | Gamma-Glutamyl Transpeptidase (Blood Test) | Quantitative | 31, 54, 36 | 0 |
6 | Drinks | Number of half-pint equivalents of alcoholic beverages drunk per day | Quantitative | 0.5, 3.0, 8.0 | 0 |
7 | Selector (Not a variable) | Field used to split data into two sets | Quantitative | 1, 2 | 0 |
Acknowledgement
This data set has been sourced from the Machine Learning Repository of University of California, Irvine Auto MPG Data Set (UC Irvine). The UCI page mentions StatLib (Carnegie Mellon University) as the original source of the data set.