Back to Search Start Over

Multiple PM Low-Cost Sensors, Multiple Seasons’ Data, and Multiple Calibration Models

Authors :
S Srishti
Pratyush Agrawal
Padmavati Kulkarni
Hrishikesh Chandra Gautam
Meenakshi Kushwaha
V. Sreekanth
Source :
Aerosol and Air Quality Research, Vol 23, Iss 3, Pp 1-15 (2023)
Publication Year :
2023
Publisher :
Springer, 2023.

Abstract

Abstract In this study, we combined state-of-the-art data modelling techniques (machine learning [ML] methods) and data from state-of-the-art low-cost particulate matter (PM) sensors (LCSs) to improve the accuracy of LCS-measured PM2.5 (PM with aerodynamic diameter less than 2.5 microns) mass concentrations. We collocated nine LCSs and a reference PM2.5 instrument for 9 months, covering all local seasons, in Bengaluru, India. Using the collocation data, we evaluated the performance of the LCSs and trained around 170 ML models to reduce the observed bias in the LCS-measured PM2.5. The ML models included (i) Decision Tree, (ii) Random Forest (RF), (iii) eXtreme Gradient Boosting, and (iv) Support Vector Regression (SVR). A hold-out validation was performed to assess the model performance. Model performance metrics included (i) coefficient of determination (R2), (ii) root mean square error (RMSE), (iii) normalised RMSE, and (iv) mean absolute error. We found that the bias in the LCS PM2.5 measurements varied across different LCS types (RMSE = 8–29 µg m−3) and that SVR models performed best in correcting the LCS PM2.5 measurements. Hyperparameter tuning improved the performance of the ML models (except for RF). The performance of ML models trained with significant predictors (fewer in number than the number of all predictors, chosen based on recursive feature elimination algorithm) was comparable to that of the ‘all predictors’ trained models (except for RF). The performance of most ML models was better than that of the linear models. Finally, as a research objective, we introduced the collocated black carbon mass concentration measurements into the ML models but found no significant improvement in the model performance.

Details

Language :
English
ISSN :
16808584 and 20711409
Volume :
23
Issue :
3
Database :
Directory of Open Access Journals
Journal :
Aerosol and Air Quality Research
Publication Type :
Academic Journal
Accession number :
edsdoj.632c88894b0f40d88a6c636401a0627c
Document Type :
article
Full Text :
https://doi.org/10.4209/aaqr.220428