Skip to main content
eScholarship
Open Access Publications from the University of California

UC Berkeley

UC Berkeley Previously Published Works bannerUC Berkeley

Using causal inference to avoid fallouts in data-driven parametric analysis: A case study in the architecture, engineering, and construction industry

Published Web Location

https://doi.org/10.1016/j.dibe.2023.100296
No data is associated with this publication.
Abstract

The decision-making process in real-world implementations has been affected by a growing reliance on data-driven models. Recognizing the limitations of isolated methodologies - namely, the lack of domain understanding in data-driven models, the subjective nature of empirical knowledge, and the idealized assumptions in first-principles simulations, we explore their synergetic integration. We showed the potential risk of biased results when using data-driven models without causal analysis. Through a case study on energy consumption in building design, we demonstrate how causal analysis significantly enhances the modeling process, mitigating biases and spurious correlations. We concluded that: (a) Sole data-driven models' accuracy assessment or domain knowledge screening may not rule out biased and spurious results; (b) Data-driven models' feature selection should involve careful consideration of causal relationships, especially colliders; (c) Integrating causal analysis results aid to first-principles simulation design and parameter checking to avoid cognitive biases. We advocate for the routine integration of causal inference within data-driven models in engineering practices, emphasizing its critical role in ensuring the models' reliability and real-world applicability.

Many UC-authored scholarly publications are freely available on this site because of the UC's open access policies. Let us know how this access is important for you.

Item not freely available? Link broken?
Report a problem accessing this item