Cargando…
A data driven methodology for social science research with left-behind children as a case study
For decades, traditional correlation analysis and regression models have been used in social science research. However, the development of machine learning algorithms makes it possible to apply machine learning techniques for social science research and social issues, which may outperform standard r...
Autores principales: | , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7678991/ https://www.ncbi.nlm.nih.gov/pubmed/33216786 http://dx.doi.org/10.1371/journal.pone.0242483 |
_version_ | 1783612261429936128 |
---|---|
author | Wu, Chao Wang, Guolong Hu, Simon Liu, Yue Mi, Hong Zhou, Ye Guo, Yi-ke Song, Tongtong |
author_facet | Wu, Chao Wang, Guolong Hu, Simon Liu, Yue Mi, Hong Zhou, Ye Guo, Yi-ke Song, Tongtong |
author_sort | Wu, Chao |
collection | PubMed |
description | For decades, traditional correlation analysis and regression models have been used in social science research. However, the development of machine learning algorithms makes it possible to apply machine learning techniques for social science research and social issues, which may outperform standard regression methods in some cases. Under the circumstances, this article proposes a methodological workflow for data analysis by machine learning techniques that have the possibility to be widely applied in social issues. Specifically, the workflow tries to uncover the natural mechanisms behind the social issues through a data-driven perspective from feature selection to model building. The advantage of data-driven techniques in feature selection is that the workflow can be built without so much restriction of related knowledge and theory in social science. The advantage of using machine learning techniques in modelling is to uncover non-linear and complex relationships behind social issues. The main purpose of our methodological workflow is to find important fields relevant to the target and provide appropriate predictions. However, to explain the result still needs theory and knowledge from social science. In this paper, we trained a methodological workflow with left-behind children as the social issue case, and all steps and full results are included. |
format | Online Article Text |
id | pubmed-7678991 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-76789912020-12-02 A data driven methodology for social science research with left-behind children as a case study Wu, Chao Wang, Guolong Hu, Simon Liu, Yue Mi, Hong Zhou, Ye Guo, Yi-ke Song, Tongtong PLoS One Research Article For decades, traditional correlation analysis and regression models have been used in social science research. However, the development of machine learning algorithms makes it possible to apply machine learning techniques for social science research and social issues, which may outperform standard regression methods in some cases. Under the circumstances, this article proposes a methodological workflow for data analysis by machine learning techniques that have the possibility to be widely applied in social issues. Specifically, the workflow tries to uncover the natural mechanisms behind the social issues through a data-driven perspective from feature selection to model building. The advantage of data-driven techniques in feature selection is that the workflow can be built without so much restriction of related knowledge and theory in social science. The advantage of using machine learning techniques in modelling is to uncover non-linear and complex relationships behind social issues. The main purpose of our methodological workflow is to find important fields relevant to the target and provide appropriate predictions. However, to explain the result still needs theory and knowledge from social science. In this paper, we trained a methodological workflow with left-behind children as the social issue case, and all steps and full results are included. Public Library of Science 2020-11-20 /pmc/articles/PMC7678991/ /pubmed/33216786 http://dx.doi.org/10.1371/journal.pone.0242483 Text en © 2020 Wu et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Wu, Chao Wang, Guolong Hu, Simon Liu, Yue Mi, Hong Zhou, Ye Guo, Yi-ke Song, Tongtong A data driven methodology for social science research with left-behind children as a case study |
title | A data driven methodology for social science research with left-behind children as a case study |
title_full | A data driven methodology for social science research with left-behind children as a case study |
title_fullStr | A data driven methodology for social science research with left-behind children as a case study |
title_full_unstemmed | A data driven methodology for social science research with left-behind children as a case study |
title_short | A data driven methodology for social science research with left-behind children as a case study |
title_sort | data driven methodology for social science research with left-behind children as a case study |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7678991/ https://www.ncbi.nlm.nih.gov/pubmed/33216786 http://dx.doi.org/10.1371/journal.pone.0242483 |
work_keys_str_mv | AT wuchao adatadrivenmethodologyforsocialscienceresearchwithleftbehindchildrenasacasestudy AT wangguolong adatadrivenmethodologyforsocialscienceresearchwithleftbehindchildrenasacasestudy AT husimon adatadrivenmethodologyforsocialscienceresearchwithleftbehindchildrenasacasestudy AT liuyue adatadrivenmethodologyforsocialscienceresearchwithleftbehindchildrenasacasestudy AT mihong adatadrivenmethodologyforsocialscienceresearchwithleftbehindchildrenasacasestudy AT zhouye adatadrivenmethodologyforsocialscienceresearchwithleftbehindchildrenasacasestudy AT guoyike adatadrivenmethodologyforsocialscienceresearchwithleftbehindchildrenasacasestudy AT songtongtong adatadrivenmethodologyforsocialscienceresearchwithleftbehindchildrenasacasestudy AT wuchao datadrivenmethodologyforsocialscienceresearchwithleftbehindchildrenasacasestudy AT wangguolong datadrivenmethodologyforsocialscienceresearchwithleftbehindchildrenasacasestudy AT husimon datadrivenmethodologyforsocialscienceresearchwithleftbehindchildrenasacasestudy AT liuyue datadrivenmethodologyforsocialscienceresearchwithleftbehindchildrenasacasestudy AT mihong datadrivenmethodologyforsocialscienceresearchwithleftbehindchildrenasacasestudy AT zhouye datadrivenmethodologyforsocialscienceresearchwithleftbehindchildrenasacasestudy AT guoyike datadrivenmethodologyforsocialscienceresearchwithleftbehindchildrenasacasestudy AT songtongtong datadrivenmethodologyforsocialscienceresearchwithleftbehindchildrenasacasestudy |