Cargando…

SAGE2Splice: Unmapped SAGE Tags Reveal Novel Splice Junctions

Serial analysis of gene expression (SAGE) not only is a method for profiling the global expression of genes, but also offers the opportunity for the discovery of novel transcripts. SAGE tags are mapped to known transcripts to determine the gene of origin. Tags that map neither to a known transcript...

Descripción completa

Detalles Bibliográficos
Autores principales: Kuo, Byron Yu-Lin, Chen, Ying, Bohacec, Slavita, Johansson, Öjvind, Wasserman, Wyeth W, Simpson, Elizabeth M
Formato: Texto
Lenguaje:English
Publicado: Public Library of Science 2006
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1447652/
https://www.ncbi.nlm.nih.gov/pubmed/16683015
http://dx.doi.org/10.1371/journal.pcbi.0020034
_version_ 1782127356172828672
author Kuo, Byron Yu-Lin
Chen, Ying
Bohacec, Slavita
Johansson, Öjvind
Wasserman, Wyeth W
Simpson, Elizabeth M
author_facet Kuo, Byron Yu-Lin
Chen, Ying
Bohacec, Slavita
Johansson, Öjvind
Wasserman, Wyeth W
Simpson, Elizabeth M
author_sort Kuo, Byron Yu-Lin
collection PubMed
description Serial analysis of gene expression (SAGE) not only is a method for profiling the global expression of genes, but also offers the opportunity for the discovery of novel transcripts. SAGE tags are mapped to known transcripts to determine the gene of origin. Tags that map neither to a known transcript nor to the genome were hypothesized to span a splice junction, for which the exon combination or exon(s) are unknown. To test this hypothesis, we have developed an algorithm, SAGE2Splice, to efficiently map SAGE tags to potential splice junctions in a genome. The algorithm consists of three search levels. A scoring scheme was designed based on position weight matrices to assess the quality of candidates. Using optimized parameters for SAGE2Splice analysis and two sets of SAGE data, candidate junctions were discovered for 5%–6% of unmapped tags. Candidates were classified into three categories, reflecting the previous annotations of the putative splice junctions. Analysis of predicted tags extracted from EST sequences demonstrated that candidate junctions having the splice junction located closer to the center of the tags are more reliable. Nine of these 12 candidates were validated by RT-PCR and sequencing, and among these, four revealed previously uncharacterized exons. Thus, SAGE2Splice provides a new functionality for the identification of novel transcripts and exons. SAGE2Splice is available online at http://www.cisreg.ca.
format Text
id pubmed-1447652
institution National Center for Biotechnology Information
language English
publishDate 2006
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-14476522006-05-08 SAGE2Splice: Unmapped SAGE Tags Reveal Novel Splice Junctions Kuo, Byron Yu-Lin Chen, Ying Bohacec, Slavita Johansson, Öjvind Wasserman, Wyeth W Simpson, Elizabeth M PLoS Comput Biol Research Article Serial analysis of gene expression (SAGE) not only is a method for profiling the global expression of genes, but also offers the opportunity for the discovery of novel transcripts. SAGE tags are mapped to known transcripts to determine the gene of origin. Tags that map neither to a known transcript nor to the genome were hypothesized to span a splice junction, for which the exon combination or exon(s) are unknown. To test this hypothesis, we have developed an algorithm, SAGE2Splice, to efficiently map SAGE tags to potential splice junctions in a genome. The algorithm consists of three search levels. A scoring scheme was designed based on position weight matrices to assess the quality of candidates. Using optimized parameters for SAGE2Splice analysis and two sets of SAGE data, candidate junctions were discovered for 5%–6% of unmapped tags. Candidates were classified into three categories, reflecting the previous annotations of the putative splice junctions. Analysis of predicted tags extracted from EST sequences demonstrated that candidate junctions having the splice junction located closer to the center of the tags are more reliable. Nine of these 12 candidates were validated by RT-PCR and sequencing, and among these, four revealed previously uncharacterized exons. Thus, SAGE2Splice provides a new functionality for the identification of novel transcripts and exons. SAGE2Splice is available online at http://www.cisreg.ca. Public Library of Science 2006-04 2006-04-28 /pmc/articles/PMC1447652/ /pubmed/16683015 http://dx.doi.org/10.1371/journal.pcbi.0020034 Text en © 2006 Kuo et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Kuo, Byron Yu-Lin
Chen, Ying
Bohacec, Slavita
Johansson, Öjvind
Wasserman, Wyeth W
Simpson, Elizabeth M
SAGE2Splice: Unmapped SAGE Tags Reveal Novel Splice Junctions
title SAGE2Splice: Unmapped SAGE Tags Reveal Novel Splice Junctions
title_full SAGE2Splice: Unmapped SAGE Tags Reveal Novel Splice Junctions
title_fullStr SAGE2Splice: Unmapped SAGE Tags Reveal Novel Splice Junctions
title_full_unstemmed SAGE2Splice: Unmapped SAGE Tags Reveal Novel Splice Junctions
title_short SAGE2Splice: Unmapped SAGE Tags Reveal Novel Splice Junctions
title_sort sage2splice: unmapped sage tags reveal novel splice junctions
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1447652/
https://www.ncbi.nlm.nih.gov/pubmed/16683015
http://dx.doi.org/10.1371/journal.pcbi.0020034
work_keys_str_mv AT kuobyronyulin sage2spliceunmappedsagetagsrevealnovelsplicejunctions
AT chenying sage2spliceunmappedsagetagsrevealnovelsplicejunctions
AT bohacecslavita sage2spliceunmappedsagetagsrevealnovelsplicejunctions
AT johanssonojvind sage2spliceunmappedsagetagsrevealnovelsplicejunctions
AT wassermanwyethw sage2spliceunmappedsagetagsrevealnovelsplicejunctions
AT simpsonelizabethm sage2spliceunmappedsagetagsrevealnovelsplicejunctions