Data Science Dojo
Copyright (c) 2016 - 2019


Level: Intermediate
Recommended Use: Classification/Clustering
Domain: Business/Retail

Wholesale Customers Data Set

Find patterns from spending data at wholesale


This intermediate level data set has 440 rows and 8 columns. The data set refers to clients of a wholesale distributor. It includes the annual spending in monetary units (m.u.) on diverse product categories. This data set is recommended for learning and practicing your skills in exploratory data analysis, data visualization, classification modelling and clustering. Feel free to explore the data set with multiple supervised and unsupervised learning techniques. The Following data dictionary gives more details on this data set:


Data Dictionary

Column Position Atrribute Name Definition Data Type Example % Null Ratios
1 Channel Customers Channel: Horeca (Hotel/Restaurant/Cafe) or Retail channel (1: Horeca, 2: Retail) Quantitative 1, 2 0
2 Region Customers Region: Lisnon, Oporto or Other (1: Lisnon, 2: Oporto, 3: Other) Quantitative 1, 2, 3 0
3 Fresh Annual spending (monetary units) on fresh products Quantitative 18291, 1640, 219 0
4 Milk Annual spending (m.u.) on milk products Quantitative 5139, 3259, 829 0
5 Grocery Annual spending (m.u.) on grocery products Quantitative 6532, 4042, 3 0
6 Frozen Annual spending (m.u.) on frozen products Quantitative 10643, 987, 6312 0
7 Detergents_Paper Annual spending (m.u.) on detergents and paper products Quantitative 12034, 116, 3 0
8 Delicassen Annual spending (m.u.) on and delicatessen products Quantitative 14472, 772, 120 0

Acknowledgement

This data set has been sourced from the Machine Learning Repository of University of California, Irvine Wholesale Customers Data Set (UC Irvine).
The UCI page mentions the following as the original source of the data set:
Abreu, N. (2011). Analise do perfil do cliente Recheio e desenvolvimento de um sistema promocional. Mestrado em Marketing, ISCTE-IUL, Lisbon