Cargando…
Maximizing the utility of public data
The human genome project galvanized the scientific community around an ambitious goal. Upon completion, the project delivered several discoveries, and a new era of research commenced. More importantly, novel technologies and analysis methods materialized during the project period. The cost reduction...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10102460/ https://www.ncbi.nlm.nih.gov/pubmed/37065493 http://dx.doi.org/10.3389/fgene.2023.1106631 |
_version_ | 1785025693398073344 |
---|---|
author | Ahmed, Mahmoud Kim, Hyun Joon Kim, Deok Ryong |
author_facet | Ahmed, Mahmoud Kim, Hyun Joon Kim, Deok Ryong |
author_sort | Ahmed, Mahmoud |
collection | PubMed |
description | The human genome project galvanized the scientific community around an ambitious goal. Upon completion, the project delivered several discoveries, and a new era of research commenced. More importantly, novel technologies and analysis methods materialized during the project period. The cost reduction allowed many more labs to generate high-throughput datasets. The project also served as a model for other extensive collaborations that generated large datasets. These datasets were made public and continue to accumulate in repositories. As a result, the scientific community should consider how these data can be utilized effectively for the purposes of research and the public good. A dataset can be re-analyzed, curated, or integrated with other forms of data to enhance its utility. We highlight three important areas to achieve this goal in this brief perspective. We also emphasize the critical requirements for these strategies to be successful. We draw on our own experience and others in using publicly available datasets to support, develop, and extend our research interest. Finally, we underline the beneficiaries and discuss some risks involved in data reuse. |
format | Online Article Text |
id | pubmed-10102460 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-101024602023-04-15 Maximizing the utility of public data Ahmed, Mahmoud Kim, Hyun Joon Kim, Deok Ryong Front Genet Genetics The human genome project galvanized the scientific community around an ambitious goal. Upon completion, the project delivered several discoveries, and a new era of research commenced. More importantly, novel technologies and analysis methods materialized during the project period. The cost reduction allowed many more labs to generate high-throughput datasets. The project also served as a model for other extensive collaborations that generated large datasets. These datasets were made public and continue to accumulate in repositories. As a result, the scientific community should consider how these data can be utilized effectively for the purposes of research and the public good. A dataset can be re-analyzed, curated, or integrated with other forms of data to enhance its utility. We highlight three important areas to achieve this goal in this brief perspective. We also emphasize the critical requirements for these strategies to be successful. We draw on our own experience and others in using publicly available datasets to support, develop, and extend our research interest. Finally, we underline the beneficiaries and discuss some risks involved in data reuse. Frontiers Media S.A. 2023-03-31 /pmc/articles/PMC10102460/ /pubmed/37065493 http://dx.doi.org/10.3389/fgene.2023.1106631 Text en Copyright © 2023 Ahmed, Kim and Kim. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Genetics Ahmed, Mahmoud Kim, Hyun Joon Kim, Deok Ryong Maximizing the utility of public data |
title | Maximizing the utility of public data |
title_full | Maximizing the utility of public data |
title_fullStr | Maximizing the utility of public data |
title_full_unstemmed | Maximizing the utility of public data |
title_short | Maximizing the utility of public data |
title_sort | maximizing the utility of public data |
topic | Genetics |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10102460/ https://www.ncbi.nlm.nih.gov/pubmed/37065493 http://dx.doi.org/10.3389/fgene.2023.1106631 |
work_keys_str_mv | AT ahmedmahmoud maximizingtheutilityofpublicdata AT kimhyunjoon maximizingtheutilityofpublicdata AT kimdeokryong maximizingtheutilityofpublicdata |