Cargando…
Where in the genome are we? A cautionary tale of database use in genomics research
With the advent of high throughput data genomic technologies the volume of available data is now staggering. In addition databases that provide resources to annotate, translate, and connect biological data have grown exponentially in content and use. The availability of such data emphasizes the impo...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2013
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3604632/ https://www.ncbi.nlm.nih.gov/pubmed/23519237 http://dx.doi.org/10.3389/fgene.2013.00038 |
_version_ | 1782263773938057216 |
---|---|
author | Vaughan, Laura K. Srinivasasainagendra, Vinodh |
author_facet | Vaughan, Laura K. Srinivasasainagendra, Vinodh |
author_sort | Vaughan, Laura K. |
collection | PubMed |
description | With the advent of high throughput data genomic technologies the volume of available data is now staggering. In addition databases that provide resources to annotate, translate, and connect biological data have grown exponentially in content and use. The availability of such data emphasizes the importance of bioinformatics and computational biology in genomics research and has led to the development of thousands of tools to integrate and utilize these resources. When utilizing such resources, the principles of reproducible research are often overlooked. In this manuscript we provide selected case studies illustrating issues that may arise while working with genes and genetic polymorphisms. These case studies illustrate potential sources of error which can be introduced if the practices of reproducible research are not employed and non-concurrent databases are used. We also show examples of a lack of transparency when these databases are concerned when using popular bioinformatics tools. These examples highlight that resources are constantly evolving, and in order to provide reproducible results, research should be aware of and connected to the correct release of the data, particularly when implementing computational tools. |
format | Online Article Text |
id | pubmed-3604632 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2013 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-36046322013-03-21 Where in the genome are we? A cautionary tale of database use in genomics research Vaughan, Laura K. Srinivasasainagendra, Vinodh Front Genet Genetics With the advent of high throughput data genomic technologies the volume of available data is now staggering. In addition databases that provide resources to annotate, translate, and connect biological data have grown exponentially in content and use. The availability of such data emphasizes the importance of bioinformatics and computational biology in genomics research and has led to the development of thousands of tools to integrate and utilize these resources. When utilizing such resources, the principles of reproducible research are often overlooked. In this manuscript we provide selected case studies illustrating issues that may arise while working with genes and genetic polymorphisms. These case studies illustrate potential sources of error which can be introduced if the practices of reproducible research are not employed and non-concurrent databases are used. We also show examples of a lack of transparency when these databases are concerned when using popular bioinformatics tools. These examples highlight that resources are constantly evolving, and in order to provide reproducible results, research should be aware of and connected to the correct release of the data, particularly when implementing computational tools. Frontiers Media S.A. 2013-03-21 /pmc/articles/PMC3604632/ /pubmed/23519237 http://dx.doi.org/10.3389/fgene.2013.00038 Text en Copyright © 2013 Vaughan and Srinivasasainagendra. http://creativecommons.org/licenses/by/3.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits use, distribution and reproduction in other forums, provided the original authors and source are credited and subject to any copyright notices concerning any third-party graphics etc. |
spellingShingle | Genetics Vaughan, Laura K. Srinivasasainagendra, Vinodh Where in the genome are we? A cautionary tale of database use in genomics research |
title | Where in the genome are we? A cautionary tale of database use in genomics research |
title_full | Where in the genome are we? A cautionary tale of database use in genomics research |
title_fullStr | Where in the genome are we? A cautionary tale of database use in genomics research |
title_full_unstemmed | Where in the genome are we? A cautionary tale of database use in genomics research |
title_short | Where in the genome are we? A cautionary tale of database use in genomics research |
title_sort | where in the genome are we? a cautionary tale of database use in genomics research |
topic | Genetics |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3604632/ https://www.ncbi.nlm.nih.gov/pubmed/23519237 http://dx.doi.org/10.3389/fgene.2013.00038 |
work_keys_str_mv | AT vaughanlaurak whereinthegenomeareweacautionarytaleofdatabaseuseingenomicsresearch AT srinivasasainagendravinodh whereinthegenomeareweacautionarytaleofdatabaseuseingenomicsresearch |