Cargando…

A data driven methodology for social science research with left-behind children as a case study

For decades, traditional correlation analysis and regression models have been used in social science research. However, the development of machine learning algorithms makes it possible to apply machine learning techniques for social science research and social issues, which may outperform standard r...

Descripción completa

Detalles Bibliográficos
Autores principales: Wu, Chao, Wang, Guolong, Hu, Simon, Liu, Yue, Mi, Hong, Zhou, Ye, Guo, Yi-ke, Song, Tongtong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7678991/
https://www.ncbi.nlm.nih.gov/pubmed/33216786
http://dx.doi.org/10.1371/journal.pone.0242483
_version_ 1783612261429936128
author Wu, Chao
Wang, Guolong
Hu, Simon
Liu, Yue
Mi, Hong
Zhou, Ye
Guo, Yi-ke
Song, Tongtong
author_facet Wu, Chao
Wang, Guolong
Hu, Simon
Liu, Yue
Mi, Hong
Zhou, Ye
Guo, Yi-ke
Song, Tongtong
author_sort Wu, Chao
collection PubMed
description For decades, traditional correlation analysis and regression models have been used in social science research. However, the development of machine learning algorithms makes it possible to apply machine learning techniques for social science research and social issues, which may outperform standard regression methods in some cases. Under the circumstances, this article proposes a methodological workflow for data analysis by machine learning techniques that have the possibility to be widely applied in social issues. Specifically, the workflow tries to uncover the natural mechanisms behind the social issues through a data-driven perspective from feature selection to model building. The advantage of data-driven techniques in feature selection is that the workflow can be built without so much restriction of related knowledge and theory in social science. The advantage of using machine learning techniques in modelling is to uncover non-linear and complex relationships behind social issues. The main purpose of our methodological workflow is to find important fields relevant to the target and provide appropriate predictions. However, to explain the result still needs theory and knowledge from social science. In this paper, we trained a methodological workflow with left-behind children as the social issue case, and all steps and full results are included.
format Online
Article
Text
id pubmed-7678991
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-76789912020-12-02 A data driven methodology for social science research with left-behind children as a case study Wu, Chao Wang, Guolong Hu, Simon Liu, Yue Mi, Hong Zhou, Ye Guo, Yi-ke Song, Tongtong PLoS One Research Article For decades, traditional correlation analysis and regression models have been used in social science research. However, the development of machine learning algorithms makes it possible to apply machine learning techniques for social science research and social issues, which may outperform standard regression methods in some cases. Under the circumstances, this article proposes a methodological workflow for data analysis by machine learning techniques that have the possibility to be widely applied in social issues. Specifically, the workflow tries to uncover the natural mechanisms behind the social issues through a data-driven perspective from feature selection to model building. The advantage of data-driven techniques in feature selection is that the workflow can be built without so much restriction of related knowledge and theory in social science. The advantage of using machine learning techniques in modelling is to uncover non-linear and complex relationships behind social issues. The main purpose of our methodological workflow is to find important fields relevant to the target and provide appropriate predictions. However, to explain the result still needs theory and knowledge from social science. In this paper, we trained a methodological workflow with left-behind children as the social issue case, and all steps and full results are included. Public Library of Science 2020-11-20 /pmc/articles/PMC7678991/ /pubmed/33216786 http://dx.doi.org/10.1371/journal.pone.0242483 Text en © 2020 Wu et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Wu, Chao
Wang, Guolong
Hu, Simon
Liu, Yue
Mi, Hong
Zhou, Ye
Guo, Yi-ke
Song, Tongtong
A data driven methodology for social science research with left-behind children as a case study
title A data driven methodology for social science research with left-behind children as a case study
title_full A data driven methodology for social science research with left-behind children as a case study
title_fullStr A data driven methodology for social science research with left-behind children as a case study
title_full_unstemmed A data driven methodology for social science research with left-behind children as a case study
title_short A data driven methodology for social science research with left-behind children as a case study
title_sort data driven methodology for social science research with left-behind children as a case study
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7678991/
https://www.ncbi.nlm.nih.gov/pubmed/33216786
http://dx.doi.org/10.1371/journal.pone.0242483
work_keys_str_mv AT wuchao adatadrivenmethodologyforsocialscienceresearchwithleftbehindchildrenasacasestudy
AT wangguolong adatadrivenmethodologyforsocialscienceresearchwithleftbehindchildrenasacasestudy
AT husimon adatadrivenmethodologyforsocialscienceresearchwithleftbehindchildrenasacasestudy
AT liuyue adatadrivenmethodologyforsocialscienceresearchwithleftbehindchildrenasacasestudy
AT mihong adatadrivenmethodologyforsocialscienceresearchwithleftbehindchildrenasacasestudy
AT zhouye adatadrivenmethodologyforsocialscienceresearchwithleftbehindchildrenasacasestudy
AT guoyike adatadrivenmethodologyforsocialscienceresearchwithleftbehindchildrenasacasestudy
AT songtongtong adatadrivenmethodologyforsocialscienceresearchwithleftbehindchildrenasacasestudy
AT wuchao datadrivenmethodologyforsocialscienceresearchwithleftbehindchildrenasacasestudy
AT wangguolong datadrivenmethodologyforsocialscienceresearchwithleftbehindchildrenasacasestudy
AT husimon datadrivenmethodologyforsocialscienceresearchwithleftbehindchildrenasacasestudy
AT liuyue datadrivenmethodologyforsocialscienceresearchwithleftbehindchildrenasacasestudy
AT mihong datadrivenmethodologyforsocialscienceresearchwithleftbehindchildrenasacasestudy
AT zhouye datadrivenmethodologyforsocialscienceresearchwithleftbehindchildrenasacasestudy
AT guoyike datadrivenmethodologyforsocialscienceresearchwithleftbehindchildrenasacasestudy
AT songtongtong datadrivenmethodologyforsocialscienceresearchwithleftbehindchildrenasacasestudy