Data Science Dojo
Copyright (c) 2016 - 2019
Level Intermediate
Recommended Use: Classification Models
Domain: Energy/Buildings
Occupancy Detection Data Set
Detect Occupancy through Light, Temperature, Humidity and CO2 sensors
This intermediate level data set has 20560 rows and 7 attributes which are divided into 3 data sets for training and testing. The data set provides experimental data used for binary classification (room occupancy of an office room) from Temperature, Humidity, Light and CO2. Ground-truth occupancy was obtained from time stamped pictures that were taken every minute. This data set is recommended for learning and practicing your skills in exploratory data analysis, data visualization, and classification modelling techniques. Feel free to explore the data set with multiple supervised and unsupervised learning techniques. The Following data dictionary gives more details on this data set:
Data Dictionary
Column Position | Atrribute Name | Definition | Data Type | Example | % Null Ratios |
---|---|---|---|---|---|
1 | Date | Date & time in year-month-day hour:minute:second format | Qualitative | 2/4/2015 17:57, 2/4/2015 17:55, 2/4/2015 18:06 | 0 |
2 | Temperature | Temperature in degree Celcius | Quantitative | 23.150, 23.075, 22.890 | 0 |
3 | Humidity | Relative humidity in percentage | Quantitative | 27.272000, 27.200000, 27.390000 | 0 |
4 | Light | Illuminance measurement in unit Lux | Quantitative | 426.0, 419.0, 0.0 | 0 |
5 | CO2 | CO2 in parts per million (ppm) | Quantitative | 489.666667, 495.500000, 534.500000 | 0 |
6 | HumidityRatio | Humadity ratio: Derived quantity from temperature and relative humidity, in kgwater-vapor/kg-air | Quantitative | 0.004986, 0.005088, 0.005203 | 0 |
7 | Occupancy | Occupied or not: 1 for occupied and 0 for not occupied | Quantitative | 1, 0 | 0 |
Acknowledgement
This data set has been sourced from the Machine Learning Repository of University of California, Irvine Occupancy Detection Data Set (UC Irvine). The UCI page mentions the following publication Accurate occupancy detection of an office room from light, temperature, humidity and CO2 measurements using statistical learning models. Luis M. Candanedo, VÃ