Cargando…
Accurate and unambiguous tag-to-gene mapping in serial analysis of gene expression
BACKGROUND: In this study, we present a robust and reliable computational method for tag-to-gene assignment in serial analysis of gene expression (SAGE). The method relies on current genome information and annotation, incorporation of several new features, and key improvements over alternative metho...
Autores principales: | , , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2006
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1637119/ https://www.ncbi.nlm.nih.gov/pubmed/17083742 http://dx.doi.org/10.1186/1471-2105-7-487 |
_version_ | 1782130790063144960 |
---|---|
author | Malig, Rodrigo Varela, Cristian Agosin, Eduardo Melo, Francisco |
author_facet | Malig, Rodrigo Varela, Cristian Agosin, Eduardo Melo, Francisco |
author_sort | Malig, Rodrigo |
collection | PubMed |
description | BACKGROUND: In this study, we present a robust and reliable computational method for tag-to-gene assignment in serial analysis of gene expression (SAGE). The method relies on current genome information and annotation, incorporation of several new features, and key improvements over alternative methods, all of which are important to determine gene expression levels more accurately. The method provides a complete annotation of potential virtual SAGE tags within a genome, along with an estimation of their confidence for experimental observation that ranks tags that present multiple matches in the genome. RESULTS: We applied this method to the Saccharomyces cerevisiae genome, producing the most thorough and accurate annotation of potential virtual SAGE tags that is available today for this organism. The usefulness of this method is exemplified by the significant reduction of ambiguous cases in existing experimental SAGE data. In addition, we report new insights from the analysis of existing SAGE data. First, we found that experimental SAGE tags mapping onto introns, intron-exon boundaries, and non-coding RNA elements are observed in all available SAGE data. Second, a significant fraction of experimental SAGE tags was found to map onto genomic regions currently annotated as intergenic. Third, a significant number of existing experimental SAGE tags for yeast has been derived from truncated cDNAs, which are synthesized through oligo-d(T) priming to internal poly-(A) regions during reverse transcription. CONCLUSION: We conclude that an accurate and unambiguous tag mapping process is essential to increase the quality and the amount of information that can be extracted from SAGE experiments. This is supported by the results obtained here and also by the large impact that the erroneous interpretation of these data could have on downstream applications. |
format | Text |
id | pubmed-1637119 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2006 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-16371192006-11-17 Accurate and unambiguous tag-to-gene mapping in serial analysis of gene expression Malig, Rodrigo Varela, Cristian Agosin, Eduardo Melo, Francisco BMC Bioinformatics Methodology Article BACKGROUND: In this study, we present a robust and reliable computational method for tag-to-gene assignment in serial analysis of gene expression (SAGE). The method relies on current genome information and annotation, incorporation of several new features, and key improvements over alternative methods, all of which are important to determine gene expression levels more accurately. The method provides a complete annotation of potential virtual SAGE tags within a genome, along with an estimation of their confidence for experimental observation that ranks tags that present multiple matches in the genome. RESULTS: We applied this method to the Saccharomyces cerevisiae genome, producing the most thorough and accurate annotation of potential virtual SAGE tags that is available today for this organism. The usefulness of this method is exemplified by the significant reduction of ambiguous cases in existing experimental SAGE data. In addition, we report new insights from the analysis of existing SAGE data. First, we found that experimental SAGE tags mapping onto introns, intron-exon boundaries, and non-coding RNA elements are observed in all available SAGE data. Second, a significant fraction of experimental SAGE tags was found to map onto genomic regions currently annotated as intergenic. Third, a significant number of existing experimental SAGE tags for yeast has been derived from truncated cDNAs, which are synthesized through oligo-d(T) priming to internal poly-(A) regions during reverse transcription. CONCLUSION: We conclude that an accurate and unambiguous tag mapping process is essential to increase the quality and the amount of information that can be extracted from SAGE experiments. This is supported by the results obtained here and also by the large impact that the erroneous interpretation of these data could have on downstream applications. BioMed Central 2006-11-04 /pmc/articles/PMC1637119/ /pubmed/17083742 http://dx.doi.org/10.1186/1471-2105-7-487 Text en Copyright © 2006 Malig et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Methodology Article Malig, Rodrigo Varela, Cristian Agosin, Eduardo Melo, Francisco Accurate and unambiguous tag-to-gene mapping in serial analysis of gene expression |
title | Accurate and unambiguous tag-to-gene mapping in serial analysis of gene expression |
title_full | Accurate and unambiguous tag-to-gene mapping in serial analysis of gene expression |
title_fullStr | Accurate and unambiguous tag-to-gene mapping in serial analysis of gene expression |
title_full_unstemmed | Accurate and unambiguous tag-to-gene mapping in serial analysis of gene expression |
title_short | Accurate and unambiguous tag-to-gene mapping in serial analysis of gene expression |
title_sort | accurate and unambiguous tag-to-gene mapping in serial analysis of gene expression |
topic | Methodology Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1637119/ https://www.ncbi.nlm.nih.gov/pubmed/17083742 http://dx.doi.org/10.1186/1471-2105-7-487 |
work_keys_str_mv | AT maligrodrigo accurateandunambiguoustagtogenemappinginserialanalysisofgeneexpression AT varelacristian accurateandunambiguoustagtogenemappinginserialanalysisofgeneexpression AT agosineduardo accurateandunambiguoustagtogenemappinginserialanalysisofgeneexpression AT melofrancisco accurateandunambiguoustagtogenemappinginserialanalysisofgeneexpression |