Cargando…

Datathons and Software to Promote Reproducible Research

BACKGROUND: Datathons facilitate collaboration between clinicians, statisticians, and data scientists in order to answer important clinical questions. Previous datathons have resulted in numerous publications of interest to the critical care community and serve as a viable model for interdisciplinar...

Descripción completa

Detalles Bibliográficos
Autores principales: Celi, Leo Anthony, Lokhandwala, Sharukh, Montgomery, Robert, Moses, Christopher, Naumann, Tristan, Pollard, Tom, Spitz, Daniel, Stretch, Robert
Formato: Online Artículo Texto
Lenguaje:English
Publicado: JMIR Publications 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5014984/
https://www.ncbi.nlm.nih.gov/pubmed/27558834
http://dx.doi.org/10.2196/jmir.6365
_version_ 1782452358765084672
author Celi, Leo Anthony
Lokhandwala, Sharukh
Montgomery, Robert
Moses, Christopher
Naumann, Tristan
Pollard, Tom
Spitz, Daniel
Stretch, Robert
author_facet Celi, Leo Anthony
Lokhandwala, Sharukh
Montgomery, Robert
Moses, Christopher
Naumann, Tristan
Pollard, Tom
Spitz, Daniel
Stretch, Robert
author_sort Celi, Leo Anthony
collection PubMed
description BACKGROUND: Datathons facilitate collaboration between clinicians, statisticians, and data scientists in order to answer important clinical questions. Previous datathons have resulted in numerous publications of interest to the critical care community and serve as a viable model for interdisciplinary collaboration. OBJECTIVE: We report on an open-source software called Chatto that was created by members of our group, in the context of the second international Critical Care Datathon, held in September 2015. METHODS: Datathon participants formed teams to discuss potential research questions and the methods required to address them. They were provided with the Chatto suite of tools to facilitate their teamwork. Each multidisciplinary team spent the next 2 days with clinicians working alongside data scientists to write code, extract and analyze data, and reformulate their queries in real time as needed. All projects were then presented on the last day of the datathon to a panel of judges that consisted of clinicians and scientists. RESULTS: Use of Chatto was particularly effective in the datathon setting, enabling teams to reduce the time spent configuring their research environments to just a few minutes—a process that would normally take hours to days. Chatto continued to serve as a useful research tool after the conclusion of the datathon. CONCLUSIONS: This suite of tools fulfills two purposes: (1) facilitation of interdisciplinary teamwork through archiving and version control of datasets, analytical code, and team discussions, and (2) advancement of research reproducibility by functioning postpublication as an online environment in which independent investigators can rerun or modify analyses with relative ease. With the introduction of Chatto, we hope to solve a variety of challenges presented by collaborative data mining projects while improving research reproducibility.
format Online
Article
Text
id pubmed-5014984
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher JMIR Publications
record_format MEDLINE/PubMed
spelling pubmed-50149842016-09-20 Datathons and Software to Promote Reproducible Research Celi, Leo Anthony Lokhandwala, Sharukh Montgomery, Robert Moses, Christopher Naumann, Tristan Pollard, Tom Spitz, Daniel Stretch, Robert J Med Internet Res Original Paper BACKGROUND: Datathons facilitate collaboration between clinicians, statisticians, and data scientists in order to answer important clinical questions. Previous datathons have resulted in numerous publications of interest to the critical care community and serve as a viable model for interdisciplinary collaboration. OBJECTIVE: We report on an open-source software called Chatto that was created by members of our group, in the context of the second international Critical Care Datathon, held in September 2015. METHODS: Datathon participants formed teams to discuss potential research questions and the methods required to address them. They were provided with the Chatto suite of tools to facilitate their teamwork. Each multidisciplinary team spent the next 2 days with clinicians working alongside data scientists to write code, extract and analyze data, and reformulate their queries in real time as needed. All projects were then presented on the last day of the datathon to a panel of judges that consisted of clinicians and scientists. RESULTS: Use of Chatto was particularly effective in the datathon setting, enabling teams to reduce the time spent configuring their research environments to just a few minutes—a process that would normally take hours to days. Chatto continued to serve as a useful research tool after the conclusion of the datathon. CONCLUSIONS: This suite of tools fulfills two purposes: (1) facilitation of interdisciplinary teamwork through archiving and version control of datasets, analytical code, and team discussions, and (2) advancement of research reproducibility by functioning postpublication as an online environment in which independent investigators can rerun or modify analyses with relative ease. With the introduction of Chatto, we hope to solve a variety of challenges presented by collaborative data mining projects while improving research reproducibility. JMIR Publications 2016-08-24 /pmc/articles/PMC5014984/ /pubmed/27558834 http://dx.doi.org/10.2196/jmir.6365 Text en ©Leo Anthony Celi, Sharukh Lokhandwala, Robert Montgomery, Christopher Moses, Tristan Naumann, Tom Pollard, Daniel Spitz, Robert Stretch. Originally published in the Journal of Medical Internet Research (http://www.jmir.org), 24.08.2016. https://creativecommons.org/licenses/by/2.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0/ (https://creativecommons.org/licenses/by/2.0/) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in the Journal of Medical Internet Research, is properly cited. The complete bibliographic information, a link to the original publication on http://www.jmir.org/, as well as this copyright and license information must be included.
spellingShingle Original Paper
Celi, Leo Anthony
Lokhandwala, Sharukh
Montgomery, Robert
Moses, Christopher
Naumann, Tristan
Pollard, Tom
Spitz, Daniel
Stretch, Robert
Datathons and Software to Promote Reproducible Research
title Datathons and Software to Promote Reproducible Research
title_full Datathons and Software to Promote Reproducible Research
title_fullStr Datathons and Software to Promote Reproducible Research
title_full_unstemmed Datathons and Software to Promote Reproducible Research
title_short Datathons and Software to Promote Reproducible Research
title_sort datathons and software to promote reproducible research
topic Original Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5014984/
https://www.ncbi.nlm.nih.gov/pubmed/27558834
http://dx.doi.org/10.2196/jmir.6365
work_keys_str_mv AT celileoanthony datathonsandsoftwaretopromotereproducibleresearch
AT lokhandwalasharukh datathonsandsoftwaretopromotereproducibleresearch
AT montgomeryrobert datathonsandsoftwaretopromotereproducibleresearch
AT moseschristopher datathonsandsoftwaretopromotereproducibleresearch
AT naumanntristan datathonsandsoftwaretopromotereproducibleresearch
AT pollardtom datathonsandsoftwaretopromotereproducibleresearch
AT spitzdaniel datathonsandsoftwaretopromotereproducibleresearch
AT stretchrobert datathonsandsoftwaretopromotereproducibleresearch