Incorporating Machine Learning into a Data Warehouse for Real-Time Construction Projects Benchmarking

  • Yin, Zhe (Construction Industry Institute, The University of Texas at Austin) ;
  • DeGezelle, Deborah (Construction Industry Institute, The University of Texas at Austin) ;
  • Hirota, Kazuma (Walker Department of Mechanical Engineering, The University of Texas at Austin) ;
  • Choi, Jiyong (Dept. of Manufacturing and Construction Management, Central Connecticut State University)
  • Published : 2022.06.20

Abstract

Machine Learning is a process of using computer algorithms to extract information from raw data to solve complex problems in a data-rich environment. It has been used in the construction industry by both academics and practitioners for multiple applications to improve the construction process. The Construction Industry Institute, a leading construction research organization has twenty-five years of experience in benchmarking capital projects in the industry. The organization is at an advantage to develop useful machine learning applications because it possesses enormous real construction data. Its benchmarking programs have been actively used by owner and contractor companies today to assess their capital projects' performance. A credible benchmarking program requires statistically valid data without subjective interference in the program administration. In developing the next-generation benchmarking program, the Data Warehouse, the organization aims to use machine learning algorithms to minimize human effort and to enable rapid data ingestion from diverse sources with data validity and reliability. This research effort uses a focus group comprised of practitioners from the construction industry and data scientists from a variety of disciplines. The group collaborated to identify the machine learning requirements and potential applications in the program. Technical and domain experts worked to select appropriate algorithms to support the business objectives. This paper presents initial steps in a chain of what is expected to be numerous learning algorithms to support high-performance computing, a fully automated performance benchmarking system.

Keywords

Acknowledgement

This research wouldn't make a progress without the support of the focus group consisting of volunteering industry professionals from the CII member companies. Researchers at CII are sincerely grateful for their dedicated efforts.