1 Introduction
People spend about 90% of their time in the indoor environment that encompass all facets of our lives, such as homes, offices, schools, and stores (
Popkin, 1999;
Klepeis et al., 2001). Buildings harbor various biotic and abiotic exposomes, including bacteria, viruses, fungi, particulate matter 2.5 (PM
2.5), and volatile organic compounds (VOCs), which circulate in the buildings’ air and on surfaces from the most inaccessible to most touched places. Transmission of and exposure to these biotic and abiotic exposomes have significant impacts on human health. For example, the coronavirus disease 2019 (COVID-19) pandemic has swept the entire world, infecting more than 250 million people with the death toll surpassing 5 million worldwide as of November 2021 (
Worldometer, 2021). The airborne and fomite-based transmission of SARS-CoV-2 and many other pathogens in the mass-gathering building environment is a significant concern as it has led to significant health and economic loss worldwide. Chemical exposure in buildings, such as VOCs and PM
2.5, also affects human health. Exposure to these pollutants may cause sick building syndrome, a collection of nonspecific syndromes linked to the indoor environment, the symptoms of which have an annual cost of $60 billion due to reduced worker productivity (
Fisk, 2002). Given the significant impacts of indoor exposure on human health, there is a crucial need to develop effective means of intelligently preventing the transmission of pathogens and people’s exposure to harmful pollutants in buildings.
The advancements in building information modeling (BIM), the Internet of Things (IoT), and data analytics have accelerated building digitalization for design, construction, operation, and maintenance. The concept of building digital twins (DTs) has also gained momentum to realize the full potential of exploiting the rich information in BIM and sensing data to continuously monitor indoor environment and maintain healthy indoor environment. In the context of buildings, a DT is the digital counterpart of a physical asset that integrates real-time sensor readings and 3D virtual models of building assets to analyze and control the building’s interactions with its environment and users (
Jiang et al., 2021). DTs enable unprecedented monitoring, understanding, and optimization of the functions of physical entities via seamless data transmission between the physical and virtual worlds, deriving relevant solutions for real-life systems and informing and influencing human behaviors. In the Architecture, Engineering, and Construction (AEC) field, there has been a proliferation of studies aiming to develop fundamental principles and techniques for DT, including DT creation, data collection, processing, and analytics, simulation, and control. The primary applications of such research include construction management (
Boje et al., 2020;
Pan and Zhang, 2021), building operation and maintenance (
Lu et al., 2020a), and energy management (
Kaewunruen et al., 2019;
Agostinelli et al., 2021). However, very few studies have focused on leveraging DT techniques to improve human health outcomes in the indoor environment.
The COVID-19 pandemic has raised awareness of indoor environment health as multiple pathogens are transmitted in indoor environments with the potential to infect people. Thus, new visions have aspired to make the indoor environment healthy. For instance, new streams of data on indoor environment and their occupants could be exploited in the buildings’ DTs to assess infection and exposure risks, and the corresponding physical components, such as building control systems (e.g., heating, ventilation, and air conditioning (HVAC) systems), could be adaptively controlled to reduce biotic and abiotic exposure of occupants and increase human comfort while maintaining efficient and cost-effective operations. Despite these fundamental studies and technological developments, research has largely ignored the prospect of digital twinning for health management. Hence, the answers to these important questions remain ambiguous: 1) What existing technologies are available and how can these technologies be extended and linked to indoor environmental and human health? 2) What are the barriers to and gaps in linking the DT to human health in indoor environment? And 3) what future research is needed to further exploit the potential of DTs for improving human health in indoor environment?
Therefore, this research aims to summarize and discuss DT techniques in the context of environmental health and infection prevention and highlight the gaps that must be closed for the seamless integration of healthy buildings, paving the road for future research. To contribute to the body of knowledge, we compared and summarized the state-of-the-art techniques developed for building DTs, discussed their potential applications for and limitations of improving human and environmental health, and highlighted the knowledge gaps in extending it to ensure a healthy indoor environment. To the authors’ best knowledge, this review is the first to discuss these issues and present roadmaps for adding health to DTs. The remainder of this paper is organized as follows. Section 2 introduces the background and concept of DTs, as well as their enabling technologies. It also highlights the significance of DTs for a healthy indoor environment. Section 3 introduces the method and procedure of literature review. Sections 4–7 review the current applications of building DTs and knowledge gaps in healthy indoor environment research from the perspective of BIM, IoT sensing, data analytics, and smart building control. Section 8 summarizes and concludes the paper and discusses future research directions for cultivating a DT-enabled smart and healthy indoor environment, elaborating on a future vision of integrated DTs with special consideration of the above four emerging techniques and corresponding issues.
2 Digital twins for healthy indoor environment
A DT is a holistic simulation and representation of an as-built system that uses physical model and sensor updates to enable the flow of information between the physical and virtual systems to allow mirroring, or twinning, of the real space condition and optimization of the service or production (
Jiang et al., 2021). To this end, a DT has three components: The physical system, the virtual model, and connected data and associated analytics. Four steps are required for building a foundational DT for indoor environment, as presented in Fig.1, including: 1) develop a virtual and digital representation of a physical facility using 3D modeling (create); 2) establish a real-time bidirectional connection between the physical and cyber systems to allow real-time data acquisition and control (data sensing and control); 3) develop data analytics in virtual models to inform optimized control of physical systems for improved monitoring, production, and service (analyze); and 4) test, calibrate, and update the DT (evolve). The DT can exploit BIM approach and sensor networks to establish a real-time view of the facility. The collected data will capture the dynamics to enable real-time analytics and inform decision-making to actuate control of the physical components of the indoor environments to achieve healthy indoor environment, building service optimization, occupant comfort enhancement, etc. As such, existing buildings can benefit from implementing a DT to enhance building operation and maintenance.
The AEC industry has conducted studies on building DTs. For instance, research efforts have been made to develop DTs from BIM models and IoT sensor networks (
e.g., Lu et al., 2020a;
Pan and Zhang, 2021). Data processing methods have been established to detect data irregularity, support decision making, understand occupants’ behavior and states (
Saha et al., 2019;
Piscitelli et al., 2021), and provide a basis for physical control for building operation and service. The primary application domains are building energy efficiency (
Kaewunruen et al., 2019;
Francisco et al., 2020), occupant comfort (
Clausen et al., 2021), building safety (
Liu et al., 2020), and proactive maintenance (
Lu et al., 2020b). Despite the partial success of previous research efforts, there remains a challenge to integrate health considerations into DTs. The rationale for digital twinning for health is manifold. First, the indoor environment is highly heterogeneous with various functionalities and characteristics, providing diverse habitats for various pathogens. Microbiomes differ considerably in rooms with different usages in the same building (
Adams et al., 2016;
Gilbert and Stephens, 2018). Thus, exposure risks vary depending on the exact locations, underlining the need for spatially adapted risk management practices. Second, pathogens present in distinctive habitats in indoor environment, such as surfaces, air, and water, and have different transmission and exposure pathways (
Ijaz et al., 2016;
Prussin et al., 2017;
Kraay et al., 2018). The transmission of pathogens that persist on surfaces could be prevented by minimizing physical contact or enhancing surface disinfection; however, the same strategy would not be fully effective if the pathogens were also airborne. Third, the transmission and exposure pathways of pathogens are complicated by the building occupants’ behaviors, particularly in indoor environment with many users (
Zhang et al., 2018;
2020). Fourth, approaches such as prolonged access control and cleaning to reduce pathogen transmission and exposure could significantly disrupt the standard functionalities of indoor environments. The highly dynamic and connected nature of human behavior, microbial transmission, and building operations in the indoor environment requires the deployment of adaptive and advanced systems to enable real-time monitoring and control responses to constant changes in the building–human–pathogen system. Digital twinning holds significant promise in addressing these challenges.
3 Methodology for literature search
According to the background and concepts introduced in Sections 1 and 2, the DT of the indoor environment can be perceived as the interaction between the buildings’ indoor environment and its digital representation model to provide real-time monitoring, analytics, and control for the environments. Therefore, four research clusters that develop key technologies to enable the establishment of DT have been identified, i.e., BIM, IoT sensing, data analytics, and smart building control. Specifically, BIM serves as the foundation to create information-rich building models, which is the repository to accommodate the virtual space and relevant data. IoT sensing enables data acquisition regarding physical spaces and humans, such as environmental parameters, occupant behaviors, and the status of physical assets. The IoT data are critical links to connect the physical and virtual spaces. Then, with the advances in artificial intelligence (AI) as well as modeling techniques, we can retrieve useful information to support intelligent decision-making based on collected data. The derived information can be used to control the physical space.
Once determined the four research clusters, keyword-based search method was used to identify related studies for each research theme, where Google Scholar is the main search engine. Specifically, the keywords consist of three lists that cover research target object, approach, and objective, and are illustrated in Tab.1. The keywords in different lists are combined in the search, such as “building information modeling, health”, “sensing, building environment”, and “indoor air quality, building control, comfort”. Considering the large volume of literatures from the search, we have narrowed down the literatures to the ones published from 2015 to 2021. We performed initial screening by checking paper title and abstract to ensure the inclusion of most relevant papers. Specific criteria are employed to further exclude irrelevant papers, including 1) conference publications, 2) duplicated papers or papers discussing very similar contents from the same authors, and 3) papers focusing on hardware development. It should be noted that conference papers in fields of computer science and information technology were not excluded considering most of their innovations are first published in conferences. Based on all identified literatures, we reviewed four techniques related to DTs and discussed the potential applications and barriers to promote environmental and human health.
4 Building information modeling
4.1 Background and related studies on BIM for building management
BIM is a digital representation of a facility throughout its lifecycle for all stakeholders involved to exchange information and facilitate decisions (
Wang, 2012). Since the concept of BIM was proposed in the 1970s, it has been viewed as one of the most promising developments of the AEC industries in decades. Many applications have been developed for a facility’s lifecycle, e.g., design visualization, design performance analysis, clash detection, 4D schedule simulation, 5D cost estimation, and maintenance planning. By integrating with other advanced technologies (e.g., AI, virtual reality, 3D printing, and IoT), BIM has evolved as an open platform for facility information integration, exchange, and management (
Pan and Zhang, 2021) and has become an important data source for building DTs (
Boje et al., 2020). Some DT frameworks have been developed in the AEC industry for construction management (
Pan and Zhang, 2021), asset management (
Lu et al., 2020a), and energy management (
Kaewunruen et al., 2019). In these frameworks, BIM typically serves as a database, combining with sensor networks to achieve real-time monitoring for a facility. Based on these data, AI and machine learning techniques are used for knowledge mining, performance prediction, and real-time controls. In the process, industrial foundation classes (IFC)-based BIM models are often used to overcome data interoperability issues among variant software or tools because IFC is a platform-neutral file format for representing AEC data. Since different applications typically have specific data requirements, model view definition (MVD)—a subset of IFC schemas—has been used to clarify specific information requirements and overcome information redundancy issues. With the development of semantic web technologies, ontology-based information exchange methods are also gaining more attention (
Zhong et al., 2018).
A noticeably small amount of research has focused on BIM applications for healthy indoor environment. This area of study is often viewed as a sub-category of design performance or facility management with an emphasis on the management of indoor air quality, daylighting, ventilation, etc. COVID-19 has caused a significant increase in attention paid to using BIM to facilitate healthy indoor environment. Rice (
2021) summarized 14 building health indicators, including thermal comfort, VOCs, and sound insulation, and discussed the potential to use BIM models to measure and assess these indicators through questionnaires. Most of the respondents in the study agreed that BIM models can measure these indicators, though they have argument about the measurement complexity for different indicators. Three complexity levels were identified, listed here from simple to complex: Direct measurement based on BIM models, extra plug-in required, and extra sensors and detectors required. Some studies have focused on specific indicators (e.g., indoor air quality (
Utkucu and Sözer, 2020) and lighting (
Montiel-Santiago et al., 2020)) and have discussed how BIM can be used to assess and monitor them. In addition to human perceptible indicators, studies (e.g.,
Adams et al., 2014;
Li et al., 2021a) have been conducted from the building microbiology perspective to extract building information such as layout and link it with microbial communities acquired through high-throughput sequencing techniques to understand the impacts of design on microbial risks. For instance, Adams et al. (
2014) documented the variations in microbial communities between different room types. A pilot study conducted by Li et al. (
2021a) linked BIM with the microbial transmission model for the first time to assess the infection risks at the room level, paving the way for integration of BIM with microbial risk assessment to promote health. In summary, the current BIM usage for a healthy indoor environment involves two processes (as illustrated in Fig.2): 1) integrating information extracted from BIM with other data to assess health or health-related factors based on simulation or data-driven methods (
Sporr et al., 2019;
Kwok et al., 2020;
Gan et al., 2021); and 2) utilizing visualization capabilities of BIM to represent and communicate health or health-related factors to users (
Alavi et al., 2021;
Valinejadshoubi et al., 2021).
4.2 Knowledge gaps in BIM for a healthy indoor environment
Despite the scarce literature, we must address the two main research gaps to exploit BIM for a healthy indoor environment. First, an integrated healthy indoor environment assessment and simulation are required. Existing results focus on the simulation of individual indicators. However, these health indicators are not independent. For example, wall insulation will impact sound insulation as well as indoor temperature (thermal comfort) (
Yang and Moon, 2019), and indoor temperature alters the rate at which indoor environment inhabitants shed microorganisms (
Horve et al., 2020). Depending on one indicator to optimize design or operation may impact other health conditions. Therefore, figuring out the relationships among these indicators and designing an integrated simulation platform are critical to designing and maintaining a healthy indoor environment. BIM, as an integrated platform, can serve this integration function. However, the connection of BIM models with indoor environment health assessments or simulations may result in data interoperability issues that must be addressed. The interoperability exists at both the semantic level and syntactic level. At the semantic level, the data requirements for various indoor environment simulations are still unclear. BIM is an integrated model with multi-disciplinary information, but simplifying BIM models is necessary for specific simulations. At the syntactic level, multi-software exists for various simulations. To develop a BIM-based DT platform for healthy indoor environment management, it is necessary to enable model transfer between tools to avoid duplicating modeling efforts and maintain information consistency. IFC-based system development can be a potential solution, and this future direction includes identifying information requirements, clarifying information exchange process maps, and developing an MVD for healthy indoor environment.
Second, a closed-loop system is required to facilitate an environment that promotes health. Despite some progress made toward a healthy indoor environment, a significant gap remains in understanding how indoor environment design and operation will mediate occupant health. This knowledge gap stems from the lack of a generalizable framework and tools to quantitatively model the indoor environment and correlate the design and operation with health indicators. To close this gap, instead of using BIM as a pure data repository, it may be beneficial to create and test a computational framework and tools to explore the mechanistic relationships between design features/operation strategies and health indicators. In addition to simulation, with the application of the IoT for indoor environment monitoring, environmental and occupant data can be collected, which can then be linked back to BIM models to maintain a living BIM. The living BIM can serve as a source for knowledge mining (e.g., the relations between operation strategies and healthy indicators) based on data-driven methods and can also be used to calibrate simulations. A feedback loop must also be created based on simulation/data-driven models to guide healthy building design and maintain a healthy indoor environment with predictive operation and control.
5 Internet of Things sensing
The IoT connects massive sensing and actuating devices and allows them to exchange and share information via a communication network. Thus, it can deliver adequate information about environments to users to enable innovative and intelligent applications in various areas, including smart cities (
Gaur et al., 2015;
Singh et al., 2020), smart and connected healthcare (
Wu et al., 2017;
Pramanik et al., 2019), and smart building management (
Verma et al., 2019;
Daissaoui et al., 2020). IoT is a highly interdisciplinary field with various enabling technologies including sensing technologies, network communication, cloud computing, data analytics, and system control. IoT sensing is an indispensable component of digital twinning as it captures the attributes and statuses of physical systems (primarily the environment and occupants of the indoor environment), which are then used to establish the virtual twin and enable various analyses. Fig.3 illustrates the typical applications of IoT sensing in indoor environments, which can be divided into two categories: Environmental sensing and occupancy sensing.
5.1 Environmental sensing
Indoor environmental quality (IEQ) is critical to occupant health, comfort, and productivity (
Lee et al., 2019;
Marques and Pitarma, 2019;
Mujan et al., 2019). IEQ monitoring captures environmental attributes of the indoor environment and is a fundamental component of digital twinning, which can be achieved using IoT sensing technologies for thermal comfort optimization (
Salamone et al., 2018;
Zahid et al., 2021), indoor air quality assessment and control (
Benammar et al., 2018;
Jo et al., 2020), energy conservation (
Khanna et al., 2019;
Wu and Liu, 2020), and many other smart building applications. Various IoT sensors are used to measure IEQ, such as thermal comfort (e.g., temperature and humidity) sensors. The collected data can be used to estimate occupant comfort satisfaction (
Lee et al., 2019;
Alsaleem et al., 2020) and can provide rich information for the control of HVAC systems for energy conversation (
Rajith et al., 2018;
Ruano et al., 2018). Air velocity sensors may also be used to measure the airflow rate in the indoor environment (
Dong et al., 2019), which is another indicator used to assess and control the thermal comfort of occupants (
Uǧursal and Culp, 2013;
Cheng et al., 2015). Furthermore, the collected data could be coupled with computational fluid dynamics (CFD) and BIM to model and assess the transmission of air pollutants and airborne pathogens (
Shi et al., 2019).
Air quality sensors can monitor the concentrations of indoor pollutants such as PM
10, PM
2.5, and VOCs (
Marques et al., 2018;
Palmisani et al., 2021), which are major environmental carcinogens and have significant negative impacts on human health (
World Health Organization, 2021). Therefore, these levels are usually monitored using IoT sensors to assess indoor air quality for healthy environments. Moreover, CO
2 sensors can measure the concentrations of CO
2 in the indoor environment, which is typically used to capture the presence and number of occupants by leveraging the estimated amount of CO
2 exhaled by an individual (
Mumma, 2004;
Nassif, 2012). In addition to air quality, light and noise conditions also affect occupant comfort and health and are monitored using IoT sensors. For instance, light sensors are used to monitor the illumination intensity, and the collected data are commonly used for the intelligent control of lighting systems considering the interplay of energy costs, visual comfort, and occupants’ activities (
Leephakpreeda, 2005;
Navada et al., 2013;
Gentile et al., 2016;
Choi et al., 2016). Sound level sensors are further used to monitor noise in the indoor environment. Environmental noise directly affects occupant health and productivity, as high noise levels can cause various health symptoms, such as high blood pressure and stress. Therefore, sound sensing has been leveraged to detect noise levels to contribute to the assessment and control of acoustic comfort (
Marques and Pitarma, 2020).
5.2 Occupancy sensing
Occupants interact with building systems and indoor environments through their presence and activities, and such occupant behavior affects pathogen transmission and energy consumption in buildings, thus influencing the health outcomes and sustainability of the indoor environment. A variety of sensing technologies have been developed and used for occupant presence detection, counting, and localization. The passive infrared (PIR) sensor is one of the most widely used sensors for occupancy detection given its low cost and ease of deployment. It determines the presence of occupants by detecting the infrared radiation changes caused by an occupant’s movements (
Chen et al., 2018b). The outputs of PIR sensors can also be used to extract occupant movement patterns and estimate the number of occupants using algorithms developed in previous studies, such as distance- and direction-based algorithms (
Wahl et al., 2012) and statistical regression algorithms (
Raykov et al., 2016). Environmental sensors have also been used in occupancy detection. Specifically, as demonstrated by several studies (
Dong et al., 2010;
Candanedo et al., 2017), CO
2 sensors are the most effective environmental sensors as indoor CO
2 concentrations are directly related to the number of occupants. This data can further be fused with data from other environmental sensors, such as temperature and humidity sensors, for occupancy sensing (
Candanedo and Feldheim, 2016). Several radio-based sensing technologies, such as radio frequency identification (RFID), Bluetooth low energy (BLE), and Wi-Fi, have been increasingly used to estimate the presence and number of occupants in the indoor environment (
Filippoupolitis et al., 2017;
Wang et al., 2017a;
Zou et al., 2017). Such approaches rely on radio signals transmitted between receivers and transmitters and typically assume that each occupant carries one smartphone that receives Wi-Fi and Bluetooth signals, which may not always be true. Other studies (
Zhuang et al., 2016;
Topak et al., 2018) have leveraged the strength of received signals as well as changes in the strengths to localize occupants. However, radio-based localization is very sensitive to environmental configurations and is subject to multi-path errors, thus resulting in relatively low accuracy and reliability. With the advances of deep learning and computer vision techniques, cameras have been increasingly used in occupancy sensing considering the rich contextual information they provide. With multi-camera systems, the presence (
Silvestre-Blanes and Pérez-Lloréns, 2011), number (
Liu et al., 2013), and location (
Cai and Cai, 2020) of occupants can be estimated. However, vision-based occupancy sensing suffers from some issues, such as high computational costs, sensitivity to illumination conditions, the requirement of line-of-sight, and privacy concerns.
Occupant activity recognition is a key enabler of user-centered control in building management systems by providing context-aware and occupant-centered information. There are two primary streams of approaches, i.e., sensor-based and vision-based approaches, used to recognize daily activities for occupants in the buildings, such as running, walking, sitting, and jumping. Accelerometers embedded in wearable sensors or smartphones are the main sensors used to recognize activities, where motion signals are pre-processed to reduce noise and then analyzed via statistical or machine learning models to classify activities (
Chen et al., 2017;
2019a). Such approaches require occupants to carry sensors, which may not result in continual sensing of all occupants in the environment. Furthermore, the results are sensitive to the mounting positions of the sensors. Depth and RGB cameras are used to obtain joint positions and appearance information of occupants in the scene, which are then used to recognize human activity via computer vision techniques (
Cippitelli et al., 2016;
Zerrouki et al., 2018;
Zhao et al., 2019b). Vision-based activity recognition is relatively accurate and can capture rich scene contexts, though privacy and security are the main concerns in building applications. Recently, several studies (
Wang et al., 2017b;
Chen et al., 2019b) have also developed methods to leverage Wi-Fi signals that are becoming ubiquitous in the indoor environment for activity recognition. Wi-Fi channel state information (CSI) is exploited because it can measure the multipath variation influenced by occupants’ movements, thus providing fine-grained information on human activity. However, because body motions in a wireless environment can induce fluctuations in the received signal, ensuring the accuracy and robustness of such a system remains challenging.
Occupants’ physiological states, such as heart rate and the temperatures of different body parts, have also been collected using various sensors to assess occupant thermal comfort. Wearable sensors, such as smart wrist bands, are used to acquire wrist temperature, heart rate and other states, which can be correlated with activity levels using embedded internal sensors (
Hasan et al., 2016;
Li et al., 2018). Non-contact and non-invasive sensors, such as infrared cameras (
Pavlin et al., 2017;
Li et al., 2019), are alternatives for capturing the skin temperatures of multiple occupants at a distance. Despite the advantages of non-invasiveness and high efficiency, the accuracy of this method needs significant improvement. The sensed physiological states of occupants provide rich data for thermal comfort models, thus facilitating adaptive control to ensure thermal comfort while mitigating energy costs.
5.3 Knowledge gaps in IoT sensing for environmental and human health monitoring
Despite numerous studies, IoT sensing in buildings for human well-being and health is still superficial because of many issues. First, although various environmental sensors and wearable sensors have been used to monitor IEQ and occupants’ physiological states, the direct measurement of health-related factors, such as pathogens, using sensors remains underdeveloped. Airborne pathogens, such as Influenza, Rhinovirus, and SARS-CoV-2, pose a significant threat to occupant health and may cause infectious diseases in the indoor environment. However, the direct detection of airborne pathogens from the air is extremely challenging considering the heterogeneous characteristics of pathogens and the low concentrations found in aerosol samples. The currently developed biosensors are expensive and their sensitivity is still largely unsatisfactory (
Fronczek and Yoon, 2015). This leaves a critical need for low-cost real-time biosensors for pathogen detection in the indoor environment.
Second, although low-cost environmental sensors could measure IEQ, the accuracy and reliability of their measurements mainly rely on the configuration and placement of the sensors, particularly in complex indoor environments. For instance, it is extremely challenging to place sensors in a way that covers all building areas to measure CO2 concentration. Moreover, photometric sensors are subject to the impacts of sunlight when placed near a window. Considering various sensor configurations and placements, effective methods are also needed to convert point measurements to model the continuous states of the 3D space and to determine optimal sensor configuration considering the trade-offs among multiple factors, such as cost, installation efforts, measurement accuracy, and spatial resolution. Furthermore, data fusion algorithms should be developed to fuse various environmental and occupant sensors to obtain more comprehensive and accurate information.
Third, current occupant sensing can only provide approximate occupancy information. The accurate localization and tracking of multiple occupants in complex indoor environments remain challenging. Moreover, current occupant activity monitoring mainly focuses on daily activities conducted by individuals; the interactions among multiple occupants and those between occupants and the environment are overlooked, hindering the assessment of potential disease transmission and health impacts due to occupant–environment interactions. This will be further elaborated in the next section.
Fourth, IoT sensors, especially cameras and wearable sensors, suffer from security and privacy concerns. Recent advances in edge-computing and privacy-preserving techniques have great potential to mitigate such issues. For instance, depth and thermal images can be used instead of RGB images for video analysis. Edge devices can also be used to allow local and decentralized data analytics without having to store and transmit data. However, more studies are required to ensure privacy and security in IoT sensing for the indoor environment. Furthermore, as the use of IoT sensing in the indoor environment increases, the real-time transmission of big data remains challenging. 5G mobile networks could offer a promising solution through integration with the building sensing network.
6 Data analytics
Data analytics has been used as an important tool to facilitate the operation and maintenance of buildings to improve building performance, such as indoor air quality, thermal comfort, and energy conservation. Data collected through multiple channels, including building design and operation information, IoT sensors, and public domain datasets (e.g., outdoor weather and air quality data and land use data), are processed and analyzed to predict the status and performance of the building and derive information-based control decisions. Fig.4 illustrates the typical approaches of data analytics in various applications of building management, which can be divided into two categories: Simulation-based methods and data-driven methods.
6.1 Simulation-based methods
Over the past decades, physical-based building performance simulation (BPS) has been widely studied and classically applied to predict building performance in terms of energy consumption, carbon emissions, and other occupant comfort-related metrics (
Azar et al., 2020). BPS is based on physical principles and thermodynamical heat and mass transfer, and requires detailed building information, such as building geometry, construction materials, mechanical, electrical and plumbing (MEP) system configuration and operation schedules, and occupancy schedules of the building. From the perspective of a healthy indoor environment, various BPS models, typically integrated with CFD techniques, are used to model, assess, and control indoor air quality (
Li and Wen, 2014;
Zhuang et al., 2014), pathogen transmission (
Peng et al., 2020;
Motamedi et al., 2022), and thermal and visual comfort (
Costanzo and Donn, 2017;
Cheong et al., 2020). A main limitation of BPS models is the simplicity of occupant behavior, which will influence the accuracy and reliability of simulation outputs. Agent-based simulation (ABS) has been used to better model the dynamic interaction between occupants and the indoor environment and can be integrated with BPS to study the influence of such interaction on building performance, such as energy consumption (
Zhang et al., 2011;
Ding et al., 2019) and disease transmission (
Zhou et al., 2021;
Islam et al., 2022), to guide building operation and control. In addition, given the rich building design and operation information (both spatial and semantic information) in BIM, it has become an indispensable tool in data analytics and has been coupled with BPS to generate more accurate and reliable outputs in terms of building performance (
Zhou et al., 2020b).
6.2 Data-driven methods
With the advances in sensing technologies and machine learning algorithms, big data analytics and data-driven approaches have attracted growing interest in predicting building performance and occupancy behavior. With minimal requirements for prior knowledge regarding building and occupancy, such an approach can rapidly learn from data with high computational efficiency. In data-driven analysis, the IoT sensors described in the previous section serve as a major source of real-time environmental and occupant data. Other data sources include publicly available data (e.g., climate and air quality data), and simulated data collected from BPS models for either virtual or real buildings (e.g., light distribution and thermal comfort). Once the data are collected, pre-processing techniques, such as data cleaning, data reduction, data transformation, and feature engineering, are used to organize the data in structured format (
Fan et al., 2021). Then, various models are developed to learn from the data for the prediction of either building performance or occupant behavior. Typical data-driven models in building management include probabilistic models (e.g., Bayesian networks (
He et al., 2016) and the hidden Markov model (
Lam et al., 2009)), statistical and classical machine learning models (e.g., support vector machines (
Jain et al., 2014) and decision trees (
Yu et al., 2010)), and, most recently, deep learning models (e.g., convolutional neural networks (CNNs) and recurrent neural networks (RNNs) (
Zhao et al., 2019a;
Chen et al., 2021)). Data-driven methods are applied to model a wide range of information in the context of a healthy indoor environment, including occupancy detection and activity recognition (
Saha et al., 2019), occupant comfort modeling (
Chen et al., 2015a;
Zhou et al., 2020a), fault detection and diagnosis (
Piscitelli et al., 2021;
Xia et al., 2021), and building performance benchmarking (
Wang, 2015;
Yang et al., 2018b), which provide further insights for occupant-centered building design and control.
6.3 Knowledge gaps in data analytics for assessing environmental and human health
Despite the numerous techniques developed for data analytics, there are significant gaps in processes of linking the data with environmental and human health. First, indoor environment health is a broad concept. Most existing studies focus on indoor air quality and thermal comfort to ensure occupant comfort and well-being. Significantly less attention is paid to the pathogen transmission and exposure risk caused by IEQ and occupant behavior. Considering the current COVID-19 pandemic and other infectious diseases (e.g., seasonal flu) that can easily be transmitted within the indoor environment through indoor airflow and occupant behavior, substantial research efforts are necessary to join the modeling of occupant–environment interactions, building environmental quality, and the microbial (e.g., pathogen) transmission process to estimate spatiotemporal infection risks. New data analysis methods should be developed to monitor human–environment interactions using IoT sensors, with more focus on personalized assessment.
Second, the connection between environmental and human health has not been well exploited. Dose-response models are usually used to quantify the effects of exposure (e.g., bioaerosol concentration) on human health (
US Environmental Protection Agency, 2019), which must be integrated into the data analytics of building performance in order to derive meaningful insights and control decisions to improve occupant health.
Third, due to diverse occupant characteristics and the complex and dynamic interaction between occupants and the indoor environment, individuals may suffer different exposure risks or exhibit different comfort levels. However, personalized assessments are challenging because most sensing and data analysis methods consider occupants at an aggregate level instead of individuals. Although wearable sensors can be used to acquire individual data, privacy and security issues must be considered in the development of data analysis methods. Federated learning could be a promising solution as models are trained locally and the trained model parameters, instead of sensing data, are transmitted to and aggregated in the central server.
Forth, both simulation-based and data-driven methods have their own limitations. For instance, a simulation-based approach relies heavily on assumptions and cannot capture real-time situations or determine when reality deviates from modeled assumptions. Data-driven approaches require a large amount of data for model training and may fail to learn the physical principles of the real world. Methods should be developed to strategically integrate real-time sensing data and physical-based simulation to model building performance at a fine resolution to accurately simulate the real-world indoor environment without losing the high fidelity of the real-time data. In addition to building and occupant data, broader contextual data can be incorporated in building analysis. For example, during the COVID-19 pandemic, contact-tracing data and epidemiology surveillance data could be linked with a pathogen transmission model and building information to assess the infection risks at the building level or room level.
7 Smart building control
7.1 Traditional building control and its limitations
The building operation phase, accompanied by more than 80% of energy consumption and 50% of greenhouse gas emissions, constitutes one of the most important phases in the lifecycle of a building (
Sharma et al., 2011). Smart building control is important to facilitate improved energy efficiency and indoor environment status in building operation such that a healthy and comfortable environment is provided to building occupants in a sustainable manner. In the past, building controls have usually been rule-based: Simplified “if/else” conditions have been defined to operate the building systems (
May-Ostendorp et al., 2011;
Schulze and Eicker, 2013;
de Coninck et al., 2014). Control variables in building systems are adjusted, e.g., based on the proportional-integral-derivative (PID) scheme (
Blasco et al., 2012), to ensure successful implementation of controls and to achieve well-defined objectives in the control process. With explicit control rules and logic, rule-based control is easily understandable and not computationally demanding; hence, it is readily integrated into building management systems. Nevertheless, without any predictive power or reflection on the dynamics of continuously changing building operations, this control strategy is likely to output suboptimal control sequences that result in degraded building operation performance (
Alimohammadisagvand et al., 2018). More advanced techniques are essential to further enhance the development of building physics control.
7.2 IoT and digital twin-enabled smart building control
The advancement of IoT techniques reveals a new opportunity for intelligent decision-making in building operations by introducing sensing data streams that describe building dynamics in real time. DTs of buildings, integrating these IoT sensing data and advanced building modeling, have arisen to empower smart building operation in a predictive way. The model predictive control (MPC) is the most widely used smart building control strategy (see Fig.5). Specifically, MPC techniques employ the central building model to predict building dynamics based on the ambient environment and measured building status (e.g., indoor environment and energy) and then optimize the building operations for different purposes (
Rockett and Hathway, 2017;
Serale et al., 2018). To support the accurate prediction of indoor dynamics and reliable decision-making, the useful sensing data include indoor environment status, building energy consumption, indoor occupant activities, outdoor weather conditions, etc. (
Oldewurtel et al., 2012;
Halhoul Merabet et al., 2021;
Zhan and Chong, 2021). This sensing information is then passed into digital building models capable of simulating building dynamics as outcomes of the proposed control actions.
Specifically, physics-based white-box modeling (
Salakij et al., 2016;
Schirrer et al., 2016;
Ruusu et al., 2019), grey-box modeling (
Li et al., 2021b), and data-driven black-box modeling (
Zhang et al., 2021a) of buildings have all been exploited for building digital replicas of different complexities to aid in building system control decision-making. White-box modeling utilizes the principles of heat transfer and thermodynamics to describe various building dynamics under internal and external disturbance (
Li and Wen, 2014). Modeling engines such as EnergyPlus (
Crawley et al., 2001) and TRNSYS (
TRNSYS, 2019) have been developed to reliably simulate the indoor environment and energy dynamics across diverse circumstances with different fidelities despite the extensive expertise required to establish physics-based building models. Data-driven black-box modeling fully relies on mathematical models to predict building dynamics with model inputs (
Wang and Chen, 2019). Unlike white-box modeling that requires possibly massive numbers of inputs while being computationally intensive, black-box building models are usually lightweight in calculation; hence, they are more readily applied in the iterative optimization process of MPC implementation. However, it is difficult to explicitly interpret the parameters of black-box models, which impacts their practical acceptability in building engineering. As a combination of white-box and black-box modeling, grey-box models are reduced-order physics-based building models (
Harb et al., 2016). Grey-box models have been widely adopted in smart building control due to certain advantages, including being 1) more easily interpretable compared to black-box models with aggregated physical meanings in model parameters, and 2) less computationally intensive compared to high-fidelity and physics-based white-box models. The resistance–capacitance model (
Yang et al., 2018a) is the most widely used grey-box building model for supporting smart building operation.
Overall, building DTs with advanced dynamic modeling capabilities serves as the intelligent brain in smart building control. With inputs and outputs from digital building models, building control is formulated as an optimization problem to generate optimal control sequences of building systems. Different optimization techniques (e.g., mixed integer problem and nonlinear programming) can be employed to efficiently solve the problems (
Xi et al., 2007;
Široký et al., 2011). MPC techniques have been applied in real buildings with demonstrable performance to improve building operation practices (
Široký et al., 2011;
Ferreira et al., 2012;
Ruusu et al., 2019). Regarding the control purpose, DT-enabled smart building controls mainly focus on improving indoor health and environmental comfort with energy efficiency in building operations (
West et al., 2014;
Chen et al., 2015b). Achieving improved building services, including both enhanced occupant health and comfort with energy efficiency, is the most commonly set target for smart building control (
Halhoul Merabet et al., 2021). The health and comfort of indoor occupants are comprised of various qualities, such as indoor air quality, thermal comfort, and visual comfort. By incorporating building modeling and indoor and outdoor air quality monitoring, building DTs have enabled consistently optimal ventilation by switching ventilation modes (
Chen et al., 2018a;
Cao et al., 2020) and adjusting ventilation rates (
Cho et al., 2015;
Ganesh et al., 2019;
Li and Wang, 2020) to reduce the indoor air pollutant concentration (
Kim et al., 2019a) while saving energy in ventilation operation (
Kim et al., 2016). The controlled indoor air pollutants include CO
2 (
Jreijiry et al., 2007;
Hesaraki and Holmberg, 2015), total VOC (TVOC) (
Hesaraki and Holmberg, 2015;
Guyot et al., 2018), respiratory particle matters (
Morgan et al., 2017;
Kim et al., 2019b;
Cao et al., 2020), and ozone (
Walker and Sherman, 2013), among others. Multiple categories of air pollutants are usually simultaneously considered in the control development (
Ganesh et al., 2019;
Afroz et al., 2020;
Saini et al., 2020). Effective indoor environmental sensor deployment is important to sufficiently control indoor air pollutants in building operation (
Zeng et al., 2020). Furthermore, occupant behaviors and modeling could be additionally critical components in effective indoor air pollutant management (
Kim et al., 2019a), especially in demand-controlled ventilation strategies that require accurate occupancy information to determine ventilation rates (
Wang et al., 2018).
With the spread of COVID-19, methods of controlling airborne infection have received increasing attention. Smart ventilation strategies, including both mechanical and natural ventilation (
Qian et al., 2010) along with occupancy adjustments, have been proposed to reduce virus transmission risks in building operations (
Wang et al., 2021;
Xu et al., 2021;
Zhang et al., 2021b). The on/off thresholds of HVAC systems or thermostat set points are typically adjusted to minimize the building’s energy consumption while ensuring the thermal comfort of occupants (
Castilla et al., 2011) based on the Predicted Mean Vote/Predicted Percentage of Dissatisfied or adaptive thermal comfort model (
Enescu, 2017). To ensure satisfactory indoor illuminance conditions without wasting energy, smart lighting control strategies have also been developed to optimize lighting operation schedules (using on/off or dimming controls) based on measured environmental status and occupancy (
Kandasamy et al., 2018;
Zou et al, 2018). Moreover, to broadly improve regional sustainability with reduced greenhouse gas and air pollutant emissions associated with fossil fuel-based power generation, building DTs utilizing system-sensing data are also emerging to facilitate the integration of renewable energy and building operations with demand response support, considering power generation, energy storage (e.g., thermal or battery storage), and building and occupant needs (
Albadi and El-Saadany, 2007;
Biyik and Kahraman, 2019;
Kathirgamanathan et al., 2021).
7.3 Knowledge gaps in building control for healthy building operation
The existing gaps in the research on smart control for healthy building operations are threefold, including lack of sensing data and DT integration, the negligence of health factors in the control loop, and insufficient consideration of social interactions in the control design.
First, with the advancement of sensing techniques, massive amounts of sensing data, related to not only the indoor environment (e.g., temperature and humidity) but also occupancy and indoor air quality, are continuously collected as time-series streams during building operation. These data provide valuable resources to construct building intelligence with smart control for healthy building operations. However, depending on the building size, number of sensing dimensions and deployed sensors, and data collection frequency, effective management and optimal integration of these sensing data with DTs are essential—yet, with room for improvement—to fully uncover the data values for healthy building operation. In practice, this requires improvements to both hardware, e.g., data storage techniques, and software, e.g., computing and optimization algorithms, to sufficiently support smart control applications. Efficient data processing is critical to ensure real-time implementation and enhance the adaptiveness of intelligent building management. To standardize the sensor integration process and better utilize sensing data, the building metadata schema and semantic data descriptions should be further improved. Recent efforts such as Brick and Project Haystack represent important steps toward facilitating the IoT sensing data integration with DTs.
Second, based on the literature review, existing building smart control strategies have largely focused on achieving building energy efficiency and comfort while neglecting health and infection prevention. Due to the lack of digital information regarding the infection risks and different pathogens’ transmission routes, the operation of the physical components of buildings has not been fully linked with health evaluations. Devising bio-informed adaptive control can address this limitation. For instance, ventilation strategies that consider infection risks and pathogen transmission routes in the control loop (
Qian and Zheng, 2018) could be developed to dilute the concentrations of bioaerosols and the pathogens. Furthermore, sunlight could significantly deactivate the pathogens (
Hobday and Dancer, 2013), making adaptive façade design and control of a potential method for disinfection (
Xie et al., 2021). As service robots have grown in popularity, there has been development in their ability to clean and disinfect surfaces (
Hu et al., 2020). It is also possible to control in-duct ultraviolet light and ozone generators more precisely for air decontamination (
Ijaz et al., 2016).
Third, the existing smart control strategies only consider individual human factors while neglecting social interactions between occupants in healthy building operations. The social interactions of occupants may increase virus transmission in buildings. Thus, social interactions must be considered when adapting the physical components of building systems. Incorporating human interaction and the associated infection risks as additional dimensions in smart control development will further improve the practice of healthy building operation, although this integration should be achieved while considering the other goals of building control, such as energy efficiency and comfort, to operate buildings in a smart and sustainable manner.
8 Conclusions and future research directions
The COVID-19 pandemic raised awareness of indoor environmental health, particularly the infection risks within buildings. There is increasing advocacy for smart buildings to create more healthy environments. This paper reviewed the techniques developed for building DTs, i.e., BIM, IoT sensing, data analytics, and smart building control, as well as the numerous applications of such DTs, and confirmed the lack of necessary consideration of environmental and occupant health. This research identified knowledge gaps and discussed future research directions for each technique toward promoting healthy indoor environment. Fig.6 illustrates our vision of digital twinning for a healthy indoor environment, which requires connecting the four important components of a DT for seamless integration. A DT for a healthy indoor environment must include real-time monitoring and integrated modeling of the complex environment–occupant–microbiome interaction, as well as quantitative assessment and adaptive control for both environmental and human health.
• The advance of IoT sensing is expected to provide unprecedented real-time data for every aspect of the indoor environment. To achieve this, new cost-effective sensors are needed for direct detection of microbiome communities (e.g., pathogens), and methods must be developed for optimal sensor configuration to ensure both measurement accuracy and cost efficiency. New technologies, such as edge devices and privacy-preserving sensors, should be used in IoT sensing to ensure security and privacy. Moreover, with the large volume and heterogeneity of data, future research should develop standard metadata schemas and semantic data descriptions for data integration with DTs.
• In the future healthy indoor environment, BIM should serve as an integrated assessment and simulation platform to provide actionable information for building control systems by combining environment–occupant–microbiome information with multi-disciplinary simulations (e.g., energy, air quality, and pathogen transmission simulations). This highlights the challenge of data interoperability. IFC-based system development could be a potential solution, and future directions should include information requirement identification and information exchange process map clarification. For real-time sensory data, future BIM is expected to be continuously updated, thus calibrating the simulation. In addition to the assessment and simulation in support of smart building control, a closed-loop design framework is needed to identify the relationships between indoor environment characteristics and health indicators to guide building design.
• The current smart building control, which mainly emphasizes energy efficiency and comfort, must be augmented with the capability to ensure environmental health and mitigate infection risk. The relationship between building operations (e.g., ventilation and surface disinfection) and infection risk must be quantified to achieve bio-informed adaptive control. Furthermore, occupants are important as active components of the indoor environment, and their behavior will largely influence the infectious disease transmission. Therefore, the interactions among occupants, and between occupants and the environment, must be considered in building control strategies.
• Advanced data analytics is an essential part of digital twinning for a healthy indoor environment, providing valuable information for each of the other three techniques. New methods leveraging rich, real-time sensing data, building design and operation data, and external contextual data are needed to strategically integrate physics-based simulations and data-driven analysis for environmental–occupant–pathogen modeling. Furthermore, methods are needed to establish a quantified relationship between environmental and human health. Another future research direction may include personalized assessments to fulfill diverse requirements for occupants with different characteristics and preferences. To achieve this, it is critical to obtain high-resolution occupant information, which calls for both advanced sensing technology and data analysis algorithms that can preserve occupant privacy. Federated learning could be a potential solution for personalized and privacy-preserving analysis.
Health has multifaceted dimensions and is very difficult to address. As the COVID-19 pandemic raises awareness of pathogenic infection within the indoor environment, we believe that health can be incorporated into the DT paradigm by first digitalizing the infection risks to prevent infections. The paradigm could be further adapted to include other elements of environmental exposure such as VOCs and PM2.5. For other aspects of health and human well-being, such as mental health, productivity, and sick building syndrome, it is important to further understand the influencing factors and link the sensing data, building model, and health performance in DTs.