标杆管理
可解释性
机器学习
计算机科学
过度拟合
人工智能
能源消耗
水准点(测量)
集成学习
数据挖掘
工程类
人工神经网络
电气工程
业务
营销
地理
大地测量学
作者
Xiaoyu Jin,Fu Xiao,Chong Zhang,Ao Li
标识
DOI:10.1016/j.enbuild.2022.111909
摘要
Building energy performance benchmarking is adopted by many countries in the world as an effective tool to reduce energy consumption at city or country level. Machine learning holds a lot of promise for quickly and correctly predicting energy consumption from massive data, thereby it’s suitable for large-scale performance assessment. However, there is a severe problem of data imbalance in building types in many datasets. Due to the lack of samples for some types of buildings, unfavorable results, such as low accuracy of prediction, are produced sometimes. Meanwhile, the poor interpretability of machine learning models makes it difficult to promote the benchmarking frameworks based on machine learning. Therefore, this study proposed a novel machine learning based building performance benchmarking framework with improved generalization and interpretability. A reliable and convenient data augmentation approach was established to overcome the data imbalance problem while avoiding the overfitting problem. Superior results were obtained in case studies using three city-level open-source building datasets from two different countries. A complete rating framework was also proposed, with proper explanations of results at sample level. The performance of this rating framework was verified by comparing with other data-driven benchmarking frameworks. Moreover, the importance of variables was quantified and ranked, which can be a significant reference for data collectors and publishers. The results demonstrated that data augmentation can effectively solve the problem of data imbalance, which enables the universality of machine learning based benchmarking on all types of buildings. And the proposed GEIN benchmarking framework can also effectively address the issues of interpretability.
科研通智能强力驱动
Strongly Powered by AbleSci AI