A novel deep learning framework with variational auto-encoder for indoor air quality prediction
Qiyue Wu, Yun Geng, Xinyuan Wang, Dongsheng Wang, ChangKyoo Yoo, Hongbin Liu
A novel deep learning framework with variational auto-encoder for indoor air quality prediction
● PLS-VAER is proposed for modeling of PM2.5 concentration.
● Data are decomposed by PLS to capture nonlinear feature.
● VAER can improve the predictive performance by variational inference.
● The proposed model provides a novel method for monitoring indoor air quality.
Exposure to poor indoor air conditions poses significant risks to human health, increasing morbidity and mortality rates. Soft measurement modeling is suitable for stable and accurate monitoring of air pollutants and improving air quality. Based on partial least squares (PLS), we propose an indoor air quality prediction model that utilizes variational auto-encoder regression (VAER) algorithm. To reduce the negative effects of noise, latent variables in the original data are extracted by PLS in the first step. Then, the extracted variables are used as inputs to VAER, which improve the accuracy and robustness of the model. Through comparative analysis with traditional methods, we demonstrate the superior performance of our PLS-VAER model, which exhibits improved prediction performance and stability. The root mean square error (RMSE) of PLS-VAER is reduced by 14.71%, 26.47%, and 12.50% compared to single VAER, PLS-SVR, and PLS-ANN, respectively. Additionally, the coefficient of determination (R2) of PLS-VAER improves by 13.70%, 30.09%, and 11.25% compared to single VAER, PLS-SVR, and PLS-ANN, respectively. This research offers an innovative and environmentally-friendly approach to monitor and improve indoor air quality.
Indoor air quality / PM2.5 concentration / Variational auto-encoder / Latent variable / Soft measurement modeling
[1] |
Aljunaid M , Tao Y , Shi H . (2021). A novel mutual information and partial least squares approach for quality-related and quality-unrelated fault detection. Processes (Basel, Switzerland), 9(1): 166
CrossRef
Google scholar
|
[2] |
Alsenan S A , Al-Turaiki I M , Hafez A M . (2020). Feature extraction methods in quantitative structure activity relationship modeling: a comparative study. IEEE Access: Practical Innovations, Open Solutions, 8: 78737–78752
CrossRef
Google scholar
|
[3] |
Ángel de Miguel M , Armingol J M , García F . (2022). Vehicles trajectory prediction using recurrent VAE network. IEEE Access: Practical Innovations, Open Solutions, 10: 32742–32749
CrossRef
Google scholar
|
[4] |
Apsemidis A , Psarakis S , Moguerza J M . (2020). A review of machine learning kernel methods in statistical process monitoring. Computers & Industrial Engineering, 142: 106376
CrossRef
Google scholar
|
[5] |
Challoner A , Pilla F , Gill L . (2015). Prediction of indoor air exposure from outdoor air quality using an artificial neural network model for inner city commercial buildings. International Journal of Environmental Research and Public Health, 12(12): 15233–15253
CrossRef
Google scholar
|
[6] |
Chen R Q , Shi G H , Zhao W L , Liang C H . (2021). A joint model for IT operation series prediction and anomaly detection. Neurocomputing, 448: 130–139
CrossRef
Google scholar
|
[7] |
Chen Y Y , Sung F C , Chen M L , Mao I F , Lu C Y . (2016a). Indoor air quality in the metro system in north Taiwan, China. International Journal of Environmental Research and Public Health, 13(12): 1200
CrossRef
Google scholar
|
[8] |
Chen Z , Ding S X , Zhang K , Li Z , Hu Z . (2016b). Canonical correlation analysis-based fault detection methods with application to alumina evaporation process. Control Engineering Practice, 46: 51–58
CrossRef
Google scholar
|
[9] |
Chen Z , Zhang K , Ding S X , Shardt Y A W , Hu Z . (2016c). Improved canonical correlation analysis-based fault detection methods for industrial processes. Journal of Process Control, 41: 26–34
CrossRef
Google scholar
|
[10] |
Correia C , Martins V , Cunha-Lopes I , Faria T , Diapouli E , Eleftheriadis K , Almeida S M . (2020). Particle exposure and inhaled dose while commuting in Lisbon. Environmental Pollution, 257: 113547
CrossRef
Google scholar
|
[11] |
Diao M , Holloway T , Choi S , O’Neill S M , Al-Hamdan M Z , Van Donkelaar A , Martin R V , Jin X , Fiore A M , Henze D K .
CrossRef
Google scholar
|
[12] |
Feng S , Gao D , Liao F , Zhou F , Wang X . (2016). The health effects of ambient PM2.5 and potential mechanisms. Ecotoxicology and Environmental Safety, 128: 67–74
CrossRef
Google scholar
|
[13] |
Han K , Wen H , Shi J , Lu K H , Zhang Y , Fu D , Liu Z . (2019). Variational autoencoder: an unsupervised model for encoding and decoding fMRI activity in visual cortex. NeuroImage, 198: 125–136
CrossRef
Google scholar
|
[14] |
Hong Y , Hwang U , Yoo J , Yoon S . (2019). How generative adversarial networks and their variants work: an overview. ACM Computing Surveys, 52(1): 3301282
|
[15] |
Ji W , Liu C , Liu Z , Wang C , Li X . (2021). Concentration, composition, and exposure contributions of fine particulate matter on subway concourses in China. Environmental Pollution, 275: 116627
CrossRef
Google scholar
|
[16] |
Jin X B , Gong W T , Kong J L , Bai Y T , Su T L . (2022). PFVAE: a planar flow-based variational auto-encoder prediction model for time series data. Mathematics, 10(4): 610
CrossRef
Google scholar
|
[17] |
Kim M H , Kim Y S , Lim J , Kim J T , Sung S W , Yoo C . (2010). Data-driven prediction model of indoor air quality in an underground space. Korean Journal of Chemical Engineering, 27(6): 1675–1680
CrossRef
Google scholar
|
[18] |
Längkvist M , Karlsson L , Loutfi A . (2014). A review of unsupervised feature learning and deep learning for time-series modeling. Pattern Recognition Letters, 42: 11–24
CrossRef
Google scholar
|
[19] |
Lee S H , Choi S . (2007). Two-dimensional canonical correlation analysis. IEEE Signal Processing Letters, 14(10): 735–738
CrossRef
Google scholar
|
[20] |
Liu H , Yang C , Huang M , Yoo C . (2020). Multivariate statistical monitoring of subway indoor air quality using dynamic concurrent partial least squares. Environmental Science and Pollution Research International, 27(4): 4159–4169
CrossRef
Google scholar
|
[21] |
Loy-Benitez J , Heo S , Yoo C . (2020). Soft sensor validation for monitoring and resilient control of sequential subway indoor air quality through memory-gated recurrent neural networks-based autoencoders. Control Engineering Practice, 97: 104330
CrossRef
Google scholar
|
[22] |
Makarenkov V , Legendre P . (2002). Nonlinear redundancy analysis and canonical correspondence analysis based on polynomial regression. Ecology, 83(4): 1146–1161
CrossRef
Google scholar
|
[23] |
Mannan M , Al-Ghamdi S G . (2021). Indoor air quality in buildings: a comprehensive review on the factors influencing air pollution in residential and commercial structure. International Journal of Environmental Research and Public Health, 18(6): 3276
CrossRef
Google scholar
|
[24] |
Mehmood T , Liland K H , Snipen L , Saebo S . (2012). A review of variable selection methods in partial least squares regression. Chemometrics and Intelligent Laboratory Systems, 118: 62–69
CrossRef
Google scholar
|
[25] |
Melaku Y A , Gill T K , Taylor A W , Adams R , Shi Z . (2018). A comparison of principal component analysis, partial least-squares and reduced-rank regressions in the identification of dietary patterns associated with bone mass in ageing Australians. European Journal of Nutrition, 57(5): 1969–1983
CrossRef
Google scholar
|
[26] |
Memarzadeh M , Matthews B , Avrekh I . (2020). Unsupervised anomaly detection in flight data using convolutional variational auto-encoder. Aerospace (Basel, Switzerland), 7(8): 115
CrossRef
Google scholar
|
[27] |
Ooi S K , Tanny D , Chen J , Wang K . (2021). Developing semi-supervised variational autoencoder-generative adversarial network models to enhance quality prediction performance. Chemometrics and Intelligent Laboratory Systems, 217: 104385
CrossRef
Google scholar
|
[28] |
Pu Z , Yan J , Chen L , Li Z , Tian W , Tao T , Xin K . (2023). A hybrid Wavelet-CNN-LSTM deep learning model for short- term urban water demand forecasting. Frontiers of Environmental Science & Engineering, 17(2): 22
CrossRef
Google scholar
|
[29] |
Qian J , Song Z , Yao Y , Zhu Z , Zhang X . (2022). A review on autoencoder based representation learning for fault detection and diagnosis in industrial processes. Chemometrics and Intelligent Laboratory Systems, 231: 104711
CrossRef
Google scholar
|
[30] |
Qin Y , Lou Z , Wang Y , Lu S , Sun P . (2022). An analytical partial least squares method for process monitoring. Control Engineering Practice, 124: 105182
CrossRef
Google scholar
|
[31] |
Ran X , Chen W , Yvert B , Zhang S . (2022). A hybrid autoencoder framework of dimensionality reduction for brain-computer interface decoding. Computers in Biology and Medicine, 148: 105871
CrossRef
Google scholar
|
[32] |
RasiwasiaNMahajanDMahadevanVAggarwalG (2014). Cluster canonical correlation analysis. Reykjavik, ICELAND, 823–831
|
[33] |
Reche C , Moreno T , Martins V , Minguillon M C , Jones T , de Miguel E , Capdevila M , Centelles S , Querol X . (2017). Factors controlling particle number concentration and size at metro stations. Atmospheric Environment, 156: 169–181
CrossRef
Google scholar
|
[34] |
San Martin G , Lopez Droguett E , Meruane V , das Chagas Moura M . (2019). Deep variational auto-encoders: a promising tool for dimensionality reduction and ball bearing elements fault diagnosis. Structural Health Monitoring, 18(4): 1092–1128
CrossRef
Google scholar
|
[35] |
Shu X , Bao T , Li Y , Gong J , Zhang K . (2022). VAE-TALSTM: a temporal attention and variational autoencoder-based long short-term memory framework for dam displacement prediction. Engineering with Computers, 38(4): 3497–3512
CrossRef
Google scholar
|
[36] |
Śmiełowska M , Marc M , Zabiegala B . (2017). Indoor air quality in public utility environments: a review. Environmental Science and Pollution Research International, 24(12): 11166–11176
CrossRef
Google scholar
|
[37] |
SouzaFAraujoRMendesJ (2016). Review of soft sensor methods or regression applications. Chemometrics and Intelligent Laboratory Systems 152: 69-79 doi: 10.1016/j.chemolab.2015.12.011
|
[38] |
Su X , Sutarlie L , Loh X J . (2020). Sensors and analytical technologies for air quality: particulate matters and bioaerosols. Chemistry, an Asian Journal, 15(24): 4241–4255
CrossRef
Google scholar
|
[39] |
Sun J , Wang X , Xiong N , Shao J . (2018). Learning sparse representation with variational auto-encoder for anomaly detection. IEEE Access: Practical Innovations, Open Solutions, 6: 33353–33361
CrossRef
Google scholar
|
[40] |
Vallejo M , de La Espriella C , Gómez-Santamaría J , Ramírez-Barrera A F , Delgado-Trejos E . (2020). Soft metrology based on machine learning: a review. Measurement Science & Technology, 31(3): 032001
CrossRef
Google scholar
|
[41] |
Wang B , Li Z , Dai Z , Lawrence N , Yan X . (2020). Data-driven mode identification and unsupervised fault detection for nonlinear multimode processes. IEEE Transactions on Industrial Informatics, 16(6): 3651–3661
CrossRef
Google scholar
|
[42] |
Wang J , Lu Y , Xin C , Yoo C , Liu H . (2022). Kernel PLS with AdaBoost ensemble learning for particulate matters forecasting in subway environment. Measurement, 204: 111974
CrossRef
Google scholar
|
[43] |
Wei W , Ramalho O , Malingre L , Sivanantham S , Little J C , Mandin C . (2019). Machine learning and statistical models for predicting indoor air quality. Indoor Air, 29(5): 704–726
CrossRef
Google scholar
|
[44] |
Xie W , You J , Zhi C , Li L . (2021). The toxicity of ambient fine particulate matter (PM2.5) to vascular endothelial cells. Journal of Applied Toxicology, 41(5): 713–723
CrossRef
Google scholar
|
[45] |
Xu B , Hao J L . (2017). Air quality inside subway metro indoor environment worldwide: a review. Environment International, 107: 33–46
CrossRef
Google scholar
|
[46] |
Xu Q S , Liang Y Z , Shen H L . (2001). Generalized PLS regression. Journal of Chemometrics, 15(3): 135–148
CrossRef
Google scholar
|
[47] |
Yan X , Xu Y , She D , Zhang W . (2022). Reliable fault diagnosis of bearings using an optimized stacked variational denoising auto-encoder. Entropy (Basel, Switzerland), 24(1): 24010036
|
[48] |
Zhang K , Yang J , Sha J , Liu H . (2022). Dynamic slow feature analysis and random forest for subway indoor air quality modeling. Building and Environment, 213: 108876
CrossRef
Google scholar
|
[49] |
Zhang M H , Xu Q S , Massart D L . (2004). Averaged and weighted average partial least squares. Analytica Chimica Acta, 504(2): 279–289
CrossRef
Google scholar
|
[50] |
Zhang Y , Li F , Ni C , Gao S , Zhang S , Xue J , Ning Z , Wei C , Fang F , Nie Y .
CrossRef
Google scholar
|
[51] |
Zhu J , Shi H , Song B , Tao Y , Tan S . (2020). Information concentrated variational auto-encoder for quality-related nonlinear process monitoring. Journal of Process Control, 94: 12–25
CrossRef
Google scholar
|
/
〈 | 〉 |