Cargando…

Accurate and unambiguous tag-to-gene mapping in serial analysis of gene expression

BACKGROUND: In this study, we present a robust and reliable computational method for tag-to-gene assignment in serial analysis of gene expression (SAGE). The method relies on current genome information and annotation, incorporation of several new features, and key improvements over alternative metho...

Descripción completa

Detalles Bibliográficos
Autores principales: Malig, Rodrigo, Varela, Cristian, Agosin, Eduardo, Melo, Francisco
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2006
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1637119/
https://www.ncbi.nlm.nih.gov/pubmed/17083742
http://dx.doi.org/10.1186/1471-2105-7-487
_version_ 1782130790063144960
author Malig, Rodrigo
Varela, Cristian
Agosin, Eduardo
Melo, Francisco
author_facet Malig, Rodrigo
Varela, Cristian
Agosin, Eduardo
Melo, Francisco
author_sort Malig, Rodrigo
collection PubMed
description BACKGROUND: In this study, we present a robust and reliable computational method for tag-to-gene assignment in serial analysis of gene expression (SAGE). The method relies on current genome information and annotation, incorporation of several new features, and key improvements over alternative methods, all of which are important to determine gene expression levels more accurately. The method provides a complete annotation of potential virtual SAGE tags within a genome, along with an estimation of their confidence for experimental observation that ranks tags that present multiple matches in the genome. RESULTS: We applied this method to the Saccharomyces cerevisiae genome, producing the most thorough and accurate annotation of potential virtual SAGE tags that is available today for this organism. The usefulness of this method is exemplified by the significant reduction of ambiguous cases in existing experimental SAGE data. In addition, we report new insights from the analysis of existing SAGE data. First, we found that experimental SAGE tags mapping onto introns, intron-exon boundaries, and non-coding RNA elements are observed in all available SAGE data. Second, a significant fraction of experimental SAGE tags was found to map onto genomic regions currently annotated as intergenic. Third, a significant number of existing experimental SAGE tags for yeast has been derived from truncated cDNAs, which are synthesized through oligo-d(T) priming to internal poly-(A) regions during reverse transcription. CONCLUSION: We conclude that an accurate and unambiguous tag mapping process is essential to increase the quality and the amount of information that can be extracted from SAGE experiments. This is supported by the results obtained here and also by the large impact that the erroneous interpretation of these data could have on downstream applications.
format Text
id pubmed-1637119
institution National Center for Biotechnology Information
language English
publishDate 2006
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-16371192006-11-17 Accurate and unambiguous tag-to-gene mapping in serial analysis of gene expression Malig, Rodrigo Varela, Cristian Agosin, Eduardo Melo, Francisco BMC Bioinformatics Methodology Article BACKGROUND: In this study, we present a robust and reliable computational method for tag-to-gene assignment in serial analysis of gene expression (SAGE). The method relies on current genome information and annotation, incorporation of several new features, and key improvements over alternative methods, all of which are important to determine gene expression levels more accurately. The method provides a complete annotation of potential virtual SAGE tags within a genome, along with an estimation of their confidence for experimental observation that ranks tags that present multiple matches in the genome. RESULTS: We applied this method to the Saccharomyces cerevisiae genome, producing the most thorough and accurate annotation of potential virtual SAGE tags that is available today for this organism. The usefulness of this method is exemplified by the significant reduction of ambiguous cases in existing experimental SAGE data. In addition, we report new insights from the analysis of existing SAGE data. First, we found that experimental SAGE tags mapping onto introns, intron-exon boundaries, and non-coding RNA elements are observed in all available SAGE data. Second, a significant fraction of experimental SAGE tags was found to map onto genomic regions currently annotated as intergenic. Third, a significant number of existing experimental SAGE tags for yeast has been derived from truncated cDNAs, which are synthesized through oligo-d(T) priming to internal poly-(A) regions during reverse transcription. CONCLUSION: We conclude that an accurate and unambiguous tag mapping process is essential to increase the quality and the amount of information that can be extracted from SAGE experiments. This is supported by the results obtained here and also by the large impact that the erroneous interpretation of these data could have on downstream applications. BioMed Central 2006-11-04 /pmc/articles/PMC1637119/ /pubmed/17083742 http://dx.doi.org/10.1186/1471-2105-7-487 Text en Copyright © 2006 Malig et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology Article
Malig, Rodrigo
Varela, Cristian
Agosin, Eduardo
Melo, Francisco
Accurate and unambiguous tag-to-gene mapping in serial analysis of gene expression
title Accurate and unambiguous tag-to-gene mapping in serial analysis of gene expression
title_full Accurate and unambiguous tag-to-gene mapping in serial analysis of gene expression
title_fullStr Accurate and unambiguous tag-to-gene mapping in serial analysis of gene expression
title_full_unstemmed Accurate and unambiguous tag-to-gene mapping in serial analysis of gene expression
title_short Accurate and unambiguous tag-to-gene mapping in serial analysis of gene expression
title_sort accurate and unambiguous tag-to-gene mapping in serial analysis of gene expression
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1637119/
https://www.ncbi.nlm.nih.gov/pubmed/17083742
http://dx.doi.org/10.1186/1471-2105-7-487
work_keys_str_mv AT maligrodrigo accurateandunambiguoustagtogenemappinginserialanalysisofgeneexpression
AT varelacristian accurateandunambiguoustagtogenemappinginserialanalysisofgeneexpression
AT agosineduardo accurateandunambiguoustagtogenemappinginserialanalysisofgeneexpression
AT melofrancisco accurateandunambiguoustagtogenemappinginserialanalysisofgeneexpression