Cargando…

AbsIDconvert: An absolute approach for converting genetic identifiers at different granularities

BACKGROUND: High-throughput molecular biology techniques yield vast amounts of data, often by detecting small portions of ribonucleotides corresponding to specific identifiers. Existing bioinformatic methodologies categorize and compare these elements using inferred descriptive annotation given this...

Descripción completa

Detalles Bibliográficos
Autores principales: Mohammad, Fahim, Flight, Robert M, Harrison, Benjamin J, Petruska, Jeffrey C, Rouchka, Eric C
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3554462/
https://www.ncbi.nlm.nih.gov/pubmed/22967011
http://dx.doi.org/10.1186/1471-2105-13-229
_version_ 1782256896379453440
author Mohammad, Fahim
Flight, Robert M
Harrison, Benjamin J
Petruska, Jeffrey C
Rouchka, Eric C
author_facet Mohammad, Fahim
Flight, Robert M
Harrison, Benjamin J
Petruska, Jeffrey C
Rouchka, Eric C
author_sort Mohammad, Fahim
collection PubMed
description BACKGROUND: High-throughput molecular biology techniques yield vast amounts of data, often by detecting small portions of ribonucleotides corresponding to specific identifiers. Existing bioinformatic methodologies categorize and compare these elements using inferred descriptive annotation given this sequence information irrespective of the fact that it may not be representative of the identifier as a whole. RESULTS: All annotations, no matter the granularity, can be aligned to genomic sequences and therefore annotated by genomic intervals. We have developed AbsIDconvert, a methodology for converting between genomic identifiers by first mapping them onto a common universal coordinate system using an interval tree which is subsequently queried for overlapping identifiers. AbsIDconvert has many potential uses, including gene identifier conversion, identification of features within a genomic region, and cross-species comparisons. The utility is demonstrated in three case studies: 1) comparative genomic study mapping plasmodium gene sequences to corresponding human and mosquito transcriptional regions; 2) cross-species study of Incyte clone sequences; and 3) analysis of human Ensembl transcripts mapped by Affymetrix(®;) and Agilent microarray probes. AbsIDconvert currently supports ID conversion of 53 species for a given list of input identifiers, genomic sequence, or genome intervals. CONCLUSION: AbsIDconvert provides an efficient and reliable mechanism for conversion between identifier domains of interest. The flexibility of this tool allows for custom definition identifier domains contingent upon the availability and determination of a genomic mapping interval. As the genomes and the sequences for genetic elements are further refined, this tool will become increasingly useful and accurate. AbsIDconvert is freely available as a web application or downloadable as a virtual machine at: http://bioinformatics.louisville.edu/abid/.
format Online
Article
Text
id pubmed-3554462
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-35544622013-01-29 AbsIDconvert: An absolute approach for converting genetic identifiers at different granularities Mohammad, Fahim Flight, Robert M Harrison, Benjamin J Petruska, Jeffrey C Rouchka, Eric C BMC Bioinformatics Methodology Article BACKGROUND: High-throughput molecular biology techniques yield vast amounts of data, often by detecting small portions of ribonucleotides corresponding to specific identifiers. Existing bioinformatic methodologies categorize and compare these elements using inferred descriptive annotation given this sequence information irrespective of the fact that it may not be representative of the identifier as a whole. RESULTS: All annotations, no matter the granularity, can be aligned to genomic sequences and therefore annotated by genomic intervals. We have developed AbsIDconvert, a methodology for converting between genomic identifiers by first mapping them onto a common universal coordinate system using an interval tree which is subsequently queried for overlapping identifiers. AbsIDconvert has many potential uses, including gene identifier conversion, identification of features within a genomic region, and cross-species comparisons. The utility is demonstrated in three case studies: 1) comparative genomic study mapping plasmodium gene sequences to corresponding human and mosquito transcriptional regions; 2) cross-species study of Incyte clone sequences; and 3) analysis of human Ensembl transcripts mapped by Affymetrix(®;) and Agilent microarray probes. AbsIDconvert currently supports ID conversion of 53 species for a given list of input identifiers, genomic sequence, or genome intervals. CONCLUSION: AbsIDconvert provides an efficient and reliable mechanism for conversion between identifier domains of interest. The flexibility of this tool allows for custom definition identifier domains contingent upon the availability and determination of a genomic mapping interval. As the genomes and the sequences for genetic elements are further refined, this tool will become increasingly useful and accurate. AbsIDconvert is freely available as a web application or downloadable as a virtual machine at: http://bioinformatics.louisville.edu/abid/. BioMed Central 2012-09-12 /pmc/articles/PMC3554462/ /pubmed/22967011 http://dx.doi.org/10.1186/1471-2105-13-229 Text en Copyright ©2012 Mohammad et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology Article
Mohammad, Fahim
Flight, Robert M
Harrison, Benjamin J
Petruska, Jeffrey C
Rouchka, Eric C
AbsIDconvert: An absolute approach for converting genetic identifiers at different granularities
title AbsIDconvert: An absolute approach for converting genetic identifiers at different granularities
title_full AbsIDconvert: An absolute approach for converting genetic identifiers at different granularities
title_fullStr AbsIDconvert: An absolute approach for converting genetic identifiers at different granularities
title_full_unstemmed AbsIDconvert: An absolute approach for converting genetic identifiers at different granularities
title_short AbsIDconvert: An absolute approach for converting genetic identifiers at different granularities
title_sort absidconvert: an absolute approach for converting genetic identifiers at different granularities
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3554462/
https://www.ncbi.nlm.nih.gov/pubmed/22967011
http://dx.doi.org/10.1186/1471-2105-13-229
work_keys_str_mv AT mohammadfahim absidconvertanabsoluteapproachforconvertinggeneticidentifiersatdifferentgranularities
AT flightrobertm absidconvertanabsoluteapproachforconvertinggeneticidentifiersatdifferentgranularities
AT harrisonbenjaminj absidconvertanabsoluteapproachforconvertinggeneticidentifiersatdifferentgranularities
AT petruskajeffreyc absidconvertanabsoluteapproachforconvertinggeneticidentifiersatdifferentgranularities
AT rouchkaericc absidconvertanabsoluteapproachforconvertinggeneticidentifiersatdifferentgranularities