README.md 3.22 KB
Newer Older
Rahim Rasool committed
1 2 3 4 5 6 7 8 9 10 11
Data Science Dojo <br/>
Copyright (c) 2016 - 2019

---

**Level:** Intermediate <br/>
**Recommended Use:** Classification/Clustering <br/>
**Domain:** Business/Retail<br/> 

## Wholesale Customers Data Set 

12
### Find patterns from spending data at wholesale 
Rahim Rasool committed
13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46


---
![](349.jpg)
---

This *intermediate* level data set has 440 rows and 8 columns. 
The data set refers to clients of a wholesale distributor. It includes the annual spending in monetary units (m.u.) on diverse product categories.
This data set is recommended for learning and practicing your skills in **exploratory data analysis**, **data visualization**, **classification modelling** and **clustering**. 
Feel free to explore the data set with multiple **supervised** and **unsupervised** learning techniques. The Following data dictionary gives more details on this data set:

---

### Data Dictionary 

| Column   Position 	| Atrribute Name   	| Definition                                                                                     	| Data Type    	| Example            	| % Null Ratios 	|
|-------------------	|------------------	|------------------------------------------------------------------------------------------------	|--------------	|--------------------	|---------------	|
| 1                 	| Channel          	| Customers   Channel: Horeca (Hotel/Restaurant/Cafe) or Retail channel (1: Horeca, 2:   Retail) 	| Quantitative 	| 1, 2               	| 0             	|
| 2                 	| Region           	| Customers   Region: Lisnon, Oporto or Other (1: Lisnon, 2: Oporto, 3: Other)                   	| Quantitative 	| 1, 2, 3            	| 0             	|
| 3                 	| Fresh            	| Annual spending (monetary units) on fresh   products                                           	| Quantitative 	| 18291, 1640,   219 	| 0             	|
| 4                 	| Milk             	| Annual   spending (m.u.) on milk products                                                      	| Quantitative 	| 5139, 3259,   829  	| 0             	|
| 5                 	| Grocery          	| Annual   spending (m.u.) on grocery products                                                   	| Quantitative 	| 6532, 4042, 3      	| 0             	|
| 6                 	| Frozen           	| Annual   spending (m.u.) on frozen products                                                    	| Quantitative 	| 10643, 987, 6312   	| 0             	|
| 7                 	| Detergents_Paper 	| Annual   spending (m.u.) on detergents and paper products                                      	| Quantitative 	| 12034, 116, 3      	| 0             	|
| 8                 	| Delicassen       	| Annual   spending (m.u.) on and delicatessen products                                          	| Quantitative 	| 14472, 772, 120    	| 0             	|



### Acknowledgement


This data set has been sourced from the Machine Learning Repository of University of California, Irvine [Wholesale Customers Data Set (UC Irvine)](https://archive.ics.uci.edu/ml/datasets/Wholesale+customers).<br/> 
The UCI page mentions the following as the original source of the data set:<br/> 
*Abreu, N. (2011). Analise do perfil do cliente Recheio e desenvolvimento de um sistema promocional. Mestrado em Marketing, ISCTE-IUL, Lisbon*