README.md 3.22 KB
Newer Older
Rahim Rasool committed
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46
Data Science Dojo <br/>
Copyright (c) 2016 - 2019

---

**Level:** Intermediate <br/>
**Recommended Use:** Classification/Clustering <br/>
**Domain:** Business/Retail<br/> 

## Wholesale Customers Data Set 

### Discover patterns from spending data at wholesale 


---
![](349.jpg)
---

This *intermediate* level data set has 440 rows and 8 columns. 
The data set refers to clients of a wholesale distributor. It includes the annual spending in monetary units (m.u.) on diverse product categories.
This data set is recommended for learning and practicing your skills in **exploratory data analysis**, **data visualization**, **classification modelling** and **clustering**. 
Feel free to explore the data set with multiple **supervised** and **unsupervised** learning techniques. The Following data dictionary gives more details on this data set:

---

### Data Dictionary 

| Column   Position 	| Atrribute Name   	| Definition                                                                                     	| Data Type    	| Example            	| % Null Ratios 	|
|-------------------	|------------------	|------------------------------------------------------------------------------------------------	|--------------	|--------------------	|---------------	|
| 1                 	| Channel          	| Customers   Channel: Horeca (Hotel/Restaurant/Cafe) or Retail channel (1: Horeca, 2:   Retail) 	| Quantitative 	| 1, 2               	| 0             	|
| 2                 	| Region           	| Customers   Region: Lisnon, Oporto or Other (1: Lisnon, 2: Oporto, 3: Other)                   	| Quantitative 	| 1, 2, 3            	| 0             	|
| 3                 	| Fresh            	| Annual spending (monetary units) on fresh   products                                           	| Quantitative 	| 18291, 1640,   219 	| 0             	|
| 4                 	| Milk             	| Annual   spending (m.u.) on milk products                                                      	| Quantitative 	| 5139, 3259,   829  	| 0             	|
| 5                 	| Grocery          	| Annual   spending (m.u.) on grocery products                                                   	| Quantitative 	| 6532, 4042, 3      	| 0             	|
| 6                 	| Frozen           	| Annual   spending (m.u.) on frozen products                                                    	| Quantitative 	| 10643, 987, 6312   	| 0             	|
| 7                 	| Detergents_Paper 	| Annual   spending (m.u.) on detergents and paper products                                      	| Quantitative 	| 12034, 116, 3      	| 0             	|
| 8                 	| Delicassen       	| Annual   spending (m.u.) on and delicatessen products                                          	| Quantitative 	| 14472, 772, 120    	| 0             	|



### Acknowledgement


This data set has been sourced from the Machine Learning Repository of University of California, Irvine [Wholesale Customers Data Set (UC Irvine)](https://archive.ics.uci.edu/ml/datasets/Wholesale+customers).<br/> 
The UCI page mentions the following as the original source of the data set:<br/> 
*Abreu, N. (2011). Analise do perfil do cliente Recheio e desenvolvimento de um sistema promocional. Mestrado em Marketing, ISCTE-IUL, Lisbon*