Cargando…

Long-term preservation of biomedical research data

Genomics and molecular imaging, along with clinical and translational research have transformed biomedical science into a data-intensive scientific endeavor. For researchers to benefit from Big Data sets, developing long-term biomedical digital data preservation strategy is very important. In this o...

Descripción completa

Detalles Bibliográficos
Autores principales: Navale, Vivek, McAuliffe, Matthew
Formato: Online Artículo Texto
Lenguaje:English
Publicado: F1000 Research Limited 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6144948/
https://www.ncbi.nlm.nih.gov/pubmed/30356367
http://dx.doi.org/10.12688/f1000research.16015.1
_version_ 1783356175175122944
author Navale, Vivek
McAuliffe, Matthew
author_facet Navale, Vivek
McAuliffe, Matthew
author_sort Navale, Vivek
collection PubMed
description Genomics and molecular imaging, along with clinical and translational research have transformed biomedical science into a data-intensive scientific endeavor. For researchers to benefit from Big Data sets, developing long-term biomedical digital data preservation strategy is very important. In this opinion article, we discuss specific actions that researchers and institutions can take to make research data a continued resource even after research projects have reached the end of their lifecycle. The actions involve utilizing an Open Archival Information System model comprised of six functional entities: Ingest, Access, Data Management, Archival Storage, Administration and Preservation Planning. We believe that involvement of data stewards early in the digital data life-cycle management process can significantly contribute towards long term preservation of biomedical data. Developing data collection strategies consistent with institutional policies, and encouraging the use of common data elements in clinical research, patient registries and other human subject research can be advantageous for data sharing and integration purposes. Specifically, data stewards at the onset of research program should engage with established repositories and curators to develop data sustainability plans for research data. Placing equal importance on the requirements for initial activities (e.g., collection, processing, storage) with subsequent activities (data analysis, sharing) can improve data quality, provide traceability and support reproducibility. Preparing and tracking data provenance, using common data elements and biomedical ontologies are important for standardizing the data description, making the interpretation and reuse of data easier. The Big Data biomedical community requires scalable platform that can support the diversity and complexity of data ingest modes (e.g. machine, software or human entry modes). Secure virtual workspaces to integrate and manipulate data, with shared software programs (e.g., bioinformatics tools), can facilitate the FAIR (Findable, Accessible, Interoperable and Reusable) use of data for near- and long-term research needs.
format Online
Article
Text
id pubmed-6144948
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher F1000 Research Limited
record_format MEDLINE/PubMed
spelling pubmed-61449482018-10-22 Long-term preservation of biomedical research data Navale, Vivek McAuliffe, Matthew F1000Res Opinion Article Genomics and molecular imaging, along with clinical and translational research have transformed biomedical science into a data-intensive scientific endeavor. For researchers to benefit from Big Data sets, developing long-term biomedical digital data preservation strategy is very important. In this opinion article, we discuss specific actions that researchers and institutions can take to make research data a continued resource even after research projects have reached the end of their lifecycle. The actions involve utilizing an Open Archival Information System model comprised of six functional entities: Ingest, Access, Data Management, Archival Storage, Administration and Preservation Planning. We believe that involvement of data stewards early in the digital data life-cycle management process can significantly contribute towards long term preservation of biomedical data. Developing data collection strategies consistent with institutional policies, and encouraging the use of common data elements in clinical research, patient registries and other human subject research can be advantageous for data sharing and integration purposes. Specifically, data stewards at the onset of research program should engage with established repositories and curators to develop data sustainability plans for research data. Placing equal importance on the requirements for initial activities (e.g., collection, processing, storage) with subsequent activities (data analysis, sharing) can improve data quality, provide traceability and support reproducibility. Preparing and tracking data provenance, using common data elements and biomedical ontologies are important for standardizing the data description, making the interpretation and reuse of data easier. The Big Data biomedical community requires scalable platform that can support the diversity and complexity of data ingest modes (e.g. machine, software or human entry modes). Secure virtual workspaces to integrate and manipulate data, with shared software programs (e.g., bioinformatics tools), can facilitate the FAIR (Findable, Accessible, Interoperable and Reusable) use of data for near- and long-term research needs. F1000 Research Limited 2018-08-29 /pmc/articles/PMC6144948/ /pubmed/30356367 http://dx.doi.org/10.12688/f1000research.16015.1 Text en Copyright: © 2018 Navale V and McAuliffe M http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. The author(s) is/are employees of the US Government and therefore domestic copyright protection in USA does not apply to this work. The work may be protected under the copyright laws of other jurisdictions when used in those jurisdictions.
spellingShingle Opinion Article
Navale, Vivek
McAuliffe, Matthew
Long-term preservation of biomedical research data
title Long-term preservation of biomedical research data
title_full Long-term preservation of biomedical research data
title_fullStr Long-term preservation of biomedical research data
title_full_unstemmed Long-term preservation of biomedical research data
title_short Long-term preservation of biomedical research data
title_sort long-term preservation of biomedical research data
topic Opinion Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6144948/
https://www.ncbi.nlm.nih.gov/pubmed/30356367
http://dx.doi.org/10.12688/f1000research.16015.1
work_keys_str_mv AT navalevivek longtermpreservationofbiomedicalresearchdata
AT mcauliffematthew longtermpreservationofbiomedicalresearchdata