Back to Search Start Over

A data driven methodology for social science research with left-behind children as a case study.

Authors :
Wu C
Wang G
Hu S
Liu Y
Mi H
Zhou Y
Guo YK
Song T
Source :
PloS one [PLoS One] 2020 Nov 20; Vol. 15 (11), pp. e0242483. Date of Electronic Publication: 2020 Nov 20 (Print Publication: 2020).
Publication Year :
2020

Abstract

For decades, traditional correlation analysis and regression models have been used in social science research. However, the development of machine learning algorithms makes it possible to apply machine learning techniques for social science research and social issues, which may outperform standard regression methods in some cases. Under the circumstances, this article proposes a methodological workflow for data analysis by machine learning techniques that have the possibility to be widely applied in social issues. Specifically, the workflow tries to uncover the natural mechanisms behind the social issues through a data-driven perspective from feature selection to model building. The advantage of data-driven techniques in feature selection is that the workflow can be built without so much restriction of related knowledge and theory in social science. The advantage of using machine learning techniques in modelling is to uncover non-linear and complex relationships behind social issues. The main purpose of our methodological workflow is to find important fields relevant to the target and provide appropriate predictions. However, to explain the result still needs theory and knowledge from social science. In this paper, we trained a methodological workflow with left-behind children as the social issue case, and all steps and full results are included.<br />Competing Interests: The authors have declared that no competing interests exist.

Details

Language :
English
ISSN :
1932-6203
Volume :
15
Issue :
11
Database :
MEDLINE
Journal :
PloS one
Publication Type :
Academic Journal
Accession number :
33216786
Full Text :
https://doi.org/10.1371/journal.pone.0242483