1. A machine learning model to estimate ambient PM2.5 concentrations in industrialized highveld region of South Africa.
- Author
-
Zhang, Danlu, Du, Linlin, Wang, Wenhao, Zhu, Qingyang, Bi, Jianzhao, Scovronick, Noah, Naidoo, Mogesh, Garland, Rebecca M., and Liu, Yang
- Subjects
- *
AIR quality standards , *MACHINE learning , *STANDARD deviations , *SPATIAL resolution , *PARTICULATE matter , *RANDOM forest algorithms - Abstract
Exposure to fine particulate matter (PM 2.5) has been linked to a substantial disease burden globally, yet little has been done to estimate the population health risks of PM 2.5 in South Africa due to the lack of high-resolution PM 2.5 exposure estimates. We developed a random forest model to estimate daily PM 2.5 concentrations at 1 km2 resolution in and around industrialized Gauteng Province, South Africa, by combining satellite aerosol optical depth (AOD), meteorology, land use, and socioeconomic data. We then compared PM 2.5 concentrations in the study domain before and after the implementation of the new national air quality standards. We aimed to test whether machine learning models are suitable for regions with sparse ground observations such as South Africa and which predictors played important roles in PM 2.5 modeling. The cross-validation R2 and Root Mean Square Error of our model was 0.80 and 9.40 μg/m3, respectively. Satellite AOD, seasonal indicator, total precipitation, and population were among the most important predictors. Model-estimated PM 2.5 levels successfully captured the temporal pattern recorded by ground observations. Spatially, the highest annual PM 2.5 concentration appeared in central and northern Gauteng, including northern Johannesburg and the city of Tshwane. Since the 2016 changes in national PM 2.5 standards, PM 2.5 concentrations have decreased in most of our study region, although levels in Johannesburg and its surrounding areas have remained relatively constant. This is anadvanced PM 2.5 model for South Africa with high prediction accuracy at the daily level and at a relatively high spatial resolution. Our study provided a reference for predictor selection, and our results can be used for a variety of purposes, including epidemiological research, burden of disease assessments, and policy evaluation. • We developed a random forest model to estimate daily PM 2.5 concentrations at 1 km2 resolution in South Africa. • Our model captured seasonal trends and spatial patterns of PM 2.5 with relatively high accuracy. • High PM 2.5 levels were identified in low-income settlements and industrial areas in western Mpumalanga. • PM2.5 levels decreased in north of Gauteng province after the implementation of new air quality standard. [ABSTRACT FROM AUTHOR]
- Published
- 2021
- Full Text
- View/download PDF