Cargando…

Maximizing the utility of public data

The human genome project galvanized the scientific community around an ambitious goal. Upon completion, the project delivered several discoveries, and a new era of research commenced. More importantly, novel technologies and analysis methods materialized during the project period. The cost reduction...

Descripción completa

Detalles Bibliográficos
Autores principales: Ahmed, Mahmoud, Kim, Hyun Joon, Kim, Deok Ryong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10102460/
https://www.ncbi.nlm.nih.gov/pubmed/37065493
http://dx.doi.org/10.3389/fgene.2023.1106631
_version_ 1785025693398073344
author Ahmed, Mahmoud
Kim, Hyun Joon
Kim, Deok Ryong
author_facet Ahmed, Mahmoud
Kim, Hyun Joon
Kim, Deok Ryong
author_sort Ahmed, Mahmoud
collection PubMed
description The human genome project galvanized the scientific community around an ambitious goal. Upon completion, the project delivered several discoveries, and a new era of research commenced. More importantly, novel technologies and analysis methods materialized during the project period. The cost reduction allowed many more labs to generate high-throughput datasets. The project also served as a model for other extensive collaborations that generated large datasets. These datasets were made public and continue to accumulate in repositories. As a result, the scientific community should consider how these data can be utilized effectively for the purposes of research and the public good. A dataset can be re-analyzed, curated, or integrated with other forms of data to enhance its utility. We highlight three important areas to achieve this goal in this brief perspective. We also emphasize the critical requirements for these strategies to be successful. We draw on our own experience and others in using publicly available datasets to support, develop, and extend our research interest. Finally, we underline the beneficiaries and discuss some risks involved in data reuse.
format Online
Article
Text
id pubmed-10102460
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-101024602023-04-15 Maximizing the utility of public data Ahmed, Mahmoud Kim, Hyun Joon Kim, Deok Ryong Front Genet Genetics The human genome project galvanized the scientific community around an ambitious goal. Upon completion, the project delivered several discoveries, and a new era of research commenced. More importantly, novel technologies and analysis methods materialized during the project period. The cost reduction allowed many more labs to generate high-throughput datasets. The project also served as a model for other extensive collaborations that generated large datasets. These datasets were made public and continue to accumulate in repositories. As a result, the scientific community should consider how these data can be utilized effectively for the purposes of research and the public good. A dataset can be re-analyzed, curated, or integrated with other forms of data to enhance its utility. We highlight three important areas to achieve this goal in this brief perspective. We also emphasize the critical requirements for these strategies to be successful. We draw on our own experience and others in using publicly available datasets to support, develop, and extend our research interest. Finally, we underline the beneficiaries and discuss some risks involved in data reuse. Frontiers Media S.A. 2023-03-31 /pmc/articles/PMC10102460/ /pubmed/37065493 http://dx.doi.org/10.3389/fgene.2023.1106631 Text en Copyright © 2023 Ahmed, Kim and Kim. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Genetics
Ahmed, Mahmoud
Kim, Hyun Joon
Kim, Deok Ryong
Maximizing the utility of public data
title Maximizing the utility of public data
title_full Maximizing the utility of public data
title_fullStr Maximizing the utility of public data
title_full_unstemmed Maximizing the utility of public data
title_short Maximizing the utility of public data
title_sort maximizing the utility of public data
topic Genetics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10102460/
https://www.ncbi.nlm.nih.gov/pubmed/37065493
http://dx.doi.org/10.3389/fgene.2023.1106631
work_keys_str_mv AT ahmedmahmoud maximizingtheutilityofpublicdata
AT kimhyunjoon maximizingtheutilityofpublicdata
AT kimdeokryong maximizingtheutilityofpublicdata