Cargando…

Data analysis and modeling pipelines for controlled networked social science experiments

There is large interest in networked social science experiments for understanding human behavior at-scale. Significant effort is required to perform data analytics on experimental outputs and for computational modeling of custom experiments. Moreover, experiments and modeling are often performed in...

Descripción completa

Detalles Bibliográficos
Autores principales: Cedeno-Mieles, Vanessa, Hu, Zhihao, Ren, Yihui, Deng, Xinwei, Contractor, Noshir, Ekanayake, Saliya, Epstein, Joshua M., Goode, Brian J., Korkmaz, Gizem, Kuhlman, Chris J., Machi, Dustin, Macy, Michael, Marathe, Madhav V., Ramakrishnan, Naren, Saraf, Parang, Self, Nathan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7685486/
https://www.ncbi.nlm.nih.gov/pubmed/33232347
http://dx.doi.org/10.1371/journal.pone.0242453
_version_ 1783613192500412416
author Cedeno-Mieles, Vanessa
Hu, Zhihao
Ren, Yihui
Deng, Xinwei
Contractor, Noshir
Ekanayake, Saliya
Epstein, Joshua M.
Goode, Brian J.
Korkmaz, Gizem
Kuhlman, Chris J.
Machi, Dustin
Macy, Michael
Marathe, Madhav V.
Ramakrishnan, Naren
Saraf, Parang
Self, Nathan
author_facet Cedeno-Mieles, Vanessa
Hu, Zhihao
Ren, Yihui
Deng, Xinwei
Contractor, Noshir
Ekanayake, Saliya
Epstein, Joshua M.
Goode, Brian J.
Korkmaz, Gizem
Kuhlman, Chris J.
Machi, Dustin
Macy, Michael
Marathe, Madhav V.
Ramakrishnan, Naren
Saraf, Parang
Self, Nathan
author_sort Cedeno-Mieles, Vanessa
collection PubMed
description There is large interest in networked social science experiments for understanding human behavior at-scale. Significant effort is required to perform data analytics on experimental outputs and for computational modeling of custom experiments. Moreover, experiments and modeling are often performed in a cycle, enabling iterative experimental refinement and data modeling to uncover interesting insights and to generate/refute hypotheses about social behaviors. The current practice for social analysts is to develop tailor-made computer programs and analytical scripts for experiments and modeling. This often leads to inefficiencies and duplication of effort. In this work, we propose a pipeline framework to take a significant step towards overcoming these challenges. Our contribution is to describe the design and implementation of a software system to automate many of the steps involved in analyzing social science experimental data, building models to capture the behavior of human subjects, and providing data to test hypotheses. The proposed pipeline framework consists of formal models, formal algorithms, and theoretical models as the basis for the design and implementation. We propose a formal data model, such that if an experiment can be described in terms of this model, then our pipeline software can be used to analyze data efficiently. The merits of the proposed pipeline framework is elaborated by several case studies of networked social science experiments.
format Online
Article
Text
id pubmed-7685486
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-76854862020-12-02 Data analysis and modeling pipelines for controlled networked social science experiments Cedeno-Mieles, Vanessa Hu, Zhihao Ren, Yihui Deng, Xinwei Contractor, Noshir Ekanayake, Saliya Epstein, Joshua M. Goode, Brian J. Korkmaz, Gizem Kuhlman, Chris J. Machi, Dustin Macy, Michael Marathe, Madhav V. Ramakrishnan, Naren Saraf, Parang Self, Nathan PLoS One Research Article There is large interest in networked social science experiments for understanding human behavior at-scale. Significant effort is required to perform data analytics on experimental outputs and for computational modeling of custom experiments. Moreover, experiments and modeling are often performed in a cycle, enabling iterative experimental refinement and data modeling to uncover interesting insights and to generate/refute hypotheses about social behaviors. The current practice for social analysts is to develop tailor-made computer programs and analytical scripts for experiments and modeling. This often leads to inefficiencies and duplication of effort. In this work, we propose a pipeline framework to take a significant step towards overcoming these challenges. Our contribution is to describe the design and implementation of a software system to automate many of the steps involved in analyzing social science experimental data, building models to capture the behavior of human subjects, and providing data to test hypotheses. The proposed pipeline framework consists of formal models, formal algorithms, and theoretical models as the basis for the design and implementation. We propose a formal data model, such that if an experiment can be described in terms of this model, then our pipeline software can be used to analyze data efficiently. The merits of the proposed pipeline framework is elaborated by several case studies of networked social science experiments. Public Library of Science 2020-11-24 /pmc/articles/PMC7685486/ /pubmed/33232347 http://dx.doi.org/10.1371/journal.pone.0242453 Text en © 2020 Cedeno-Mieles et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Cedeno-Mieles, Vanessa
Hu, Zhihao
Ren, Yihui
Deng, Xinwei
Contractor, Noshir
Ekanayake, Saliya
Epstein, Joshua M.
Goode, Brian J.
Korkmaz, Gizem
Kuhlman, Chris J.
Machi, Dustin
Macy, Michael
Marathe, Madhav V.
Ramakrishnan, Naren
Saraf, Parang
Self, Nathan
Data analysis and modeling pipelines for controlled networked social science experiments
title Data analysis and modeling pipelines for controlled networked social science experiments
title_full Data analysis and modeling pipelines for controlled networked social science experiments
title_fullStr Data analysis and modeling pipelines for controlled networked social science experiments
title_full_unstemmed Data analysis and modeling pipelines for controlled networked social science experiments
title_short Data analysis and modeling pipelines for controlled networked social science experiments
title_sort data analysis and modeling pipelines for controlled networked social science experiments
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7685486/
https://www.ncbi.nlm.nih.gov/pubmed/33232347
http://dx.doi.org/10.1371/journal.pone.0242453
work_keys_str_mv AT cedenomielesvanessa dataanalysisandmodelingpipelinesforcontrollednetworkedsocialscienceexperiments
AT huzhihao dataanalysisandmodelingpipelinesforcontrollednetworkedsocialscienceexperiments
AT renyihui dataanalysisandmodelingpipelinesforcontrollednetworkedsocialscienceexperiments
AT dengxinwei dataanalysisandmodelingpipelinesforcontrollednetworkedsocialscienceexperiments
AT contractornoshir dataanalysisandmodelingpipelinesforcontrollednetworkedsocialscienceexperiments
AT ekanayakesaliya dataanalysisandmodelingpipelinesforcontrollednetworkedsocialscienceexperiments
AT epsteinjoshuam dataanalysisandmodelingpipelinesforcontrollednetworkedsocialscienceexperiments
AT goodebrianj dataanalysisandmodelingpipelinesforcontrollednetworkedsocialscienceexperiments
AT korkmazgizem dataanalysisandmodelingpipelinesforcontrollednetworkedsocialscienceexperiments
AT kuhlmanchrisj dataanalysisandmodelingpipelinesforcontrollednetworkedsocialscienceexperiments
AT machidustin dataanalysisandmodelingpipelinesforcontrollednetworkedsocialscienceexperiments
AT macymichael dataanalysisandmodelingpipelinesforcontrollednetworkedsocialscienceexperiments
AT marathemadhavv dataanalysisandmodelingpipelinesforcontrollednetworkedsocialscienceexperiments
AT ramakrishnannaren dataanalysisandmodelingpipelinesforcontrollednetworkedsocialscienceexperiments
AT sarafparang dataanalysisandmodelingpipelinesforcontrollednetworkedsocialscienceexperiments
AT selfnathan dataanalysisandmodelingpipelinesforcontrollednetworkedsocialscienceexperiments