Cargando…

Where in the genome are we? A cautionary tale of database use in genomics research

With the advent of high throughput data genomic technologies the volume of available data is now staggering. In addition databases that provide resources to annotate, translate, and connect biological data have grown exponentially in content and use. The availability of such data emphasizes the impo...

Descripción completa

Detalles Bibliográficos
Autores principales: Vaughan, Laura K., Srinivasasainagendra, Vinodh
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3604632/
https://www.ncbi.nlm.nih.gov/pubmed/23519237
http://dx.doi.org/10.3389/fgene.2013.00038
_version_ 1782263773938057216
author Vaughan, Laura K.
Srinivasasainagendra, Vinodh
author_facet Vaughan, Laura K.
Srinivasasainagendra, Vinodh
author_sort Vaughan, Laura K.
collection PubMed
description With the advent of high throughput data genomic technologies the volume of available data is now staggering. In addition databases that provide resources to annotate, translate, and connect biological data have grown exponentially in content and use. The availability of such data emphasizes the importance of bioinformatics and computational biology in genomics research and has led to the development of thousands of tools to integrate and utilize these resources. When utilizing such resources, the principles of reproducible research are often overlooked. In this manuscript we provide selected case studies illustrating issues that may arise while working with genes and genetic polymorphisms. These case studies illustrate potential sources of error which can be introduced if the practices of reproducible research are not employed and non-concurrent databases are used. We also show examples of a lack of transparency when these databases are concerned when using popular bioinformatics tools. These examples highlight that resources are constantly evolving, and in order to provide reproducible results, research should be aware of and connected to the correct release of the data, particularly when implementing computational tools.
format Online
Article
Text
id pubmed-3604632
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-36046322013-03-21 Where in the genome are we? A cautionary tale of database use in genomics research Vaughan, Laura K. Srinivasasainagendra, Vinodh Front Genet Genetics With the advent of high throughput data genomic technologies the volume of available data is now staggering. In addition databases that provide resources to annotate, translate, and connect biological data have grown exponentially in content and use. The availability of such data emphasizes the importance of bioinformatics and computational biology in genomics research and has led to the development of thousands of tools to integrate and utilize these resources. When utilizing such resources, the principles of reproducible research are often overlooked. In this manuscript we provide selected case studies illustrating issues that may arise while working with genes and genetic polymorphisms. These case studies illustrate potential sources of error which can be introduced if the practices of reproducible research are not employed and non-concurrent databases are used. We also show examples of a lack of transparency when these databases are concerned when using popular bioinformatics tools. These examples highlight that resources are constantly evolving, and in order to provide reproducible results, research should be aware of and connected to the correct release of the data, particularly when implementing computational tools. Frontiers Media S.A. 2013-03-21 /pmc/articles/PMC3604632/ /pubmed/23519237 http://dx.doi.org/10.3389/fgene.2013.00038 Text en Copyright © 2013 Vaughan and Srinivasasainagendra. http://creativecommons.org/licenses/by/3.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits use, distribution and reproduction in other forums, provided the original authors and source are credited and subject to any copyright notices concerning any third-party graphics etc.
spellingShingle Genetics
Vaughan, Laura K.
Srinivasasainagendra, Vinodh
Where in the genome are we? A cautionary tale of database use in genomics research
title Where in the genome are we? A cautionary tale of database use in genomics research
title_full Where in the genome are we? A cautionary tale of database use in genomics research
title_fullStr Where in the genome are we? A cautionary tale of database use in genomics research
title_full_unstemmed Where in the genome are we? A cautionary tale of database use in genomics research
title_short Where in the genome are we? A cautionary tale of database use in genomics research
title_sort where in the genome are we? a cautionary tale of database use in genomics research
topic Genetics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3604632/
https://www.ncbi.nlm.nih.gov/pubmed/23519237
http://dx.doi.org/10.3389/fgene.2013.00038
work_keys_str_mv AT vaughanlaurak whereinthegenomeareweacautionarytaleofdatabaseuseingenomicsresearch
AT srinivasasainagendravinodh whereinthegenomeareweacautionarytaleofdatabaseuseingenomicsresearch