Utilizing Domain Knowledge: Robust Machine Learning for Building Energy Prediction with Small, Inconsistent Datasets

Chen, Xia; Singh, Manav Mahan; Geyer, Philipp

Computer Science > Machine Learning

arXiv:2302.10784 (cs)

[Submitted on 23 Jan 2023 (v1), last revised 3 Mar 2023 (this version, v2)]

Title:Utilizing Domain Knowledge: Robust Machine Learning for Building Energy Prediction with Small, Inconsistent Datasets

Authors:Xia Chen, Manav Mahan Singh, Philipp Geyer

View PDF

Abstract:The demand for a huge amount of data for machine learning (ML) applications is currently a bottleneck in an empirically dominated field. We propose a method to combine prior knowledge with data-driven methods to significantly reduce their data dependency. In this study, component-based machine learning (CBML) as the knowledge-encoded data-driven method is examined in the context of energy-efficient building engineering. It encodes the abstraction of building structural knowledge as semantic information in the model organization. We design a case experiment to understand the efficacy of knowledge-encoded ML in sparse data input (1% - 0.0125% sampling rate). The result reveals its three advanced features compared with pure ML methods: 1. Significant improvement in the robustness of ML to extremely small-size and inconsistent datasets; 2. Efficient data utilization from different entities' record collections; 3. Characteristics of accepting incomplete data with high interpretability and reduced training time. All these features provide a promising path to alleviating the deployment bottleneck of data-intensive methods and contribute to efficient real-world data usage. Moreover, four necessary prerequisites are summarized in this study that ensures the target scenario benefits by combining prior knowledge and ML generalization.

Comments:	9 pages, 3 figures
Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:2302.10784 [cs.LG]
	(or arXiv:2302.10784v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2302.10784

Submission history

From: Xia Chen [view email]
[v1] Mon, 23 Jan 2023 08:56:11 UTC (1,559 KB)
[v2] Fri, 3 Mar 2023 16:01:49 UTC (1,559 KB)

Computer Science > Machine Learning

Title:Utilizing Domain Knowledge: Robust Machine Learning for Building Energy Prediction with Small, Inconsistent Datasets

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Utilizing Domain Knowledge: Robust Machine Learning for Building Energy Prediction with Small, Inconsistent Datasets

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators