The Building Data Genome 2 (BDGE) dataset is a comprehensive collection of energy consumption data from over 1,000 buildings, intended for the analysis and prediction of energy usage in educational facilities. Spanning a two-year period, this dataset provides a detailed view of energy dynamics, supporting efforts to enhance sustainability and efficiency in educational buildings. The data can be found in https://drive.google.com/drive/u/0/folders/1URKQ9QYbfpu5EALuo007US2g1Z0H-Uvj
The BDGE dataset consists of three primary components:
Building Metadata: Contains information about the buildings, including their types, locations, and specific characteristics that may influence energy consumption (e.g., floor area, building age, and facility type).
Meter Data: Includes readings from various energy meters installed in the buildings, capturing consumption data for different types of energy sources - electricity, chilled water, steam, hot water, and gas. The meter data is detailed, showing usage over time, typically recorded on an hourly basis.
Weather Information: Provides local weather data relevant to the buildings' locations. This part of the dataset includes temperature, humidity, wind speed, and other weather-related variables that can significantly affect energy consumption patterns.
- The data cleaning process can be found in Data Pre Processing & EDA/.ipynb_checkpoints folder
- The modelling process can be found in the Modelling folder
- The full report can be found under the Presentation & Report folder