Back to Search
Start Over
Multiple PM Low-Cost Sensors, Multiple Seasons’ Data, and Multiple Calibration Models
- Source :
- Aerosol and Air Quality Research, Vol 23, Iss 3, Pp 1-15 (2023)
- Publication Year :
- 2023
- Publisher :
- Springer, 2023.
-
Abstract
- Abstract In this study, we combined state-of-the-art data modelling techniques (machine learning [ML] methods) and data from state-of-the-art low-cost particulate matter (PM) sensors (LCSs) to improve the accuracy of LCS-measured PM2.5 (PM with aerodynamic diameter less than 2.5 microns) mass concentrations. We collocated nine LCSs and a reference PM2.5 instrument for 9 months, covering all local seasons, in Bengaluru, India. Using the collocation data, we evaluated the performance of the LCSs and trained around 170 ML models to reduce the observed bias in the LCS-measured PM2.5. The ML models included (i) Decision Tree, (ii) Random Forest (RF), (iii) eXtreme Gradient Boosting, and (iv) Support Vector Regression (SVR). A hold-out validation was performed to assess the model performance. Model performance metrics included (i) coefficient of determination (R2), (ii) root mean square error (RMSE), (iii) normalised RMSE, and (iv) mean absolute error. We found that the bias in the LCS PM2.5 measurements varied across different LCS types (RMSE = 8–29 µg m−3) and that SVR models performed best in correcting the LCS PM2.5 measurements. Hyperparameter tuning improved the performance of the ML models (except for RF). The performance of ML models trained with significant predictors (fewer in number than the number of all predictors, chosen based on recursive feature elimination algorithm) was comparable to that of the ‘all predictors’ trained models (except for RF). The performance of most ML models was better than that of the linear models. Finally, as a research objective, we introduced the collocated black carbon mass concentration measurements into the ML models but found no significant improvement in the model performance.
- Subjects :
- Plantower
Beta attenuation monitor
Support vector regression
Science
Subjects
Details
- Language :
- English
- ISSN :
- 16808584 and 20711409
- Volume :
- 23
- Issue :
- 3
- Database :
- Directory of Open Access Journals
- Journal :
- Aerosol and Air Quality Research
- Publication Type :
- Academic Journal
- Accession number :
- edsdoj.632c88894b0f40d88a6c636401a0627c
- Document Type :
- article
- Full Text :
- https://doi.org/10.4209/aaqr.220428