Back to Search Start Over

Using multiple linear regression and random forests to identify spatial poverty determinants in rural China.

Authors :
Liu, Mengxiao
Hu, Shan
Ge, Yong
Heuvelink, Gerard B.M.
Ren, Zhoupeng
Huang, Xiaoran
Source :
Spatial Statistics (2211-6753); Apr2021, Vol. 42, pN.PAG-N.PAG, 1p
Publication Year :
2021

Abstract

Identifying poverty determinants in a region is crucial for taking effective poverty reduction measures. This paper utilizes two variable importance analysis methods to identify the relative importance of different geographic factors to explain the spatial distribution of poverty: the Lindeman, Merenda, and Gold (LMG) method used in multiple linear regression (MLR) and variable importance used in random forest (RF) machine learning. A case study was conducted in Yunyang, a poverty-stricken county in China, to evaluate the performances of the two methods for identifying village-level poverty determinants. The results indicated that: (1) MLR and RF had similar explanation accuracy; (2) LMG and RF were consistent in the three main determinants of poverty; (3) LMG better identified the importance of variables that were highly related to poverty but correlated with other variables, while RF better identified the non-linear relationships between poverty and explanatory variables; (4) accessibility metrics are the most important variables influencing poverty in Yunyang and have a linear relationship with poverty. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
22116753
Volume :
42
Database :
Supplemental Index
Journal :
Spatial Statistics (2211-6753)
Publication Type :
Academic Journal
Accession number :
149367252
Full Text :
https://doi.org/10.1016/j.spasta.2020.100461