Data Science Dojo
Copyright (c) 2016 - 2019
---
**Level:** Intermediate
**Recommended Use:** Classification/Regression/Clustering Models
**Domain:** Healthcare
## Liver Disorders Data Set
### Patterns relating liver disorder and alcohol consumption
---

---
This *intermediate* level data set has 345 rows and 7 columns, where the 7th column is not a variable but just a train/test selector. The headers have to be manually placed in the data set.
The dataset does not contain any variable representing presence or absence of a liver disorder.
This data set is recommended for learning and practicing your skills in **exploratory data analysis**, **data visualization**, **regression**, **classification** and **clustering** modelling techniques.
Feel free to explore the data set with multiple **supervised** and **unsupervised** learning techniques.
The Following data dictionary gives more details on this data set:
---
### Data Dictionary
| Column Position | Attribute Name | Definition | Data Type | Example | % Null Ratios |
|------------------- |--------------------------- |---------------------------------------------------------------------- |-------------- |--------------- |--------------- |
| 1 | Mcv | Mean Corpuscular Volume (Blood Test) | Quantitative | 85, 91, 96 | 0 |
| 2 | Alkphos | Alkaline Phosphotase (Blood Test) | Quantitative | 78, 55, 70 | 0 |
| 3 | Sgpt | Alamine Aminotransferase (Blood Test) | Quantitative | 45, 12, 15 | 0 |
| 4 | Sgot | Aspartate Aminotransferase (Blood Test) | Quantitative | 27, 17, 33 | 0 |
| 5 | Gammagt | Gamma-Glutamyl Transpeptidase (Blood Test) | Quantitative | 31, 54, 36 | 0 |
| 6 | Drinks | Number of half-pint equivalents of alcoholic beverages drunk per day | Quantitative | 0.5, 3.0, 8.0 | 0 |
| 7 | Selector (Not a variable) | Field used to split data into two sets | Quantitative | 1, 2 | 0 |
### Acknowledgement
This data set has been sourced from the Machine Learning Repository of University of California, Irvine [Liver Disorders Data Set (UC Irvine)](https://archive.ics.uci.edu/ml/datasets/Liver+Disorders).
The UCI page mentions BUPA Medical Research Ltd as the original source of the data set.