Cargando…

Deep analysis of cellular transcriptomes – LongSAGE versus classic MPSS

BACKGROUND: Deep transcriptome analysis will underpin a large fraction of post-genomic biology. 'Closed' technologies, such as microarray analysis, only detect the set of transcripts chosen for analysis, whereas 'open' e.g. tag-based technologies are capable of identifying all po...

Descripción completa

Detalles Bibliográficos
Autores principales: Hene, Lawrence, Sreenu, Vattipally B, Vuong, Mai T, Abidi, S Hussain I, Sutton, Julian K, Rowland-Jones, Sarah L, Davis, Simon J, Evans, Edward J
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2007
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2104538/
https://www.ncbi.nlm.nih.gov/pubmed/17892551
http://dx.doi.org/10.1186/1471-2164-8-333
_version_ 1782138349861994496
author Hene, Lawrence
Sreenu, Vattipally B
Vuong, Mai T
Abidi, S Hussain I
Sutton, Julian K
Rowland-Jones, Sarah L
Davis, Simon J
Evans, Edward J
author_facet Hene, Lawrence
Sreenu, Vattipally B
Vuong, Mai T
Abidi, S Hussain I
Sutton, Julian K
Rowland-Jones, Sarah L
Davis, Simon J
Evans, Edward J
author_sort Hene, Lawrence
collection PubMed
description BACKGROUND: Deep transcriptome analysis will underpin a large fraction of post-genomic biology. 'Closed' technologies, such as microarray analysis, only detect the set of transcripts chosen for analysis, whereas 'open' e.g. tag-based technologies are capable of identifying all possible transcripts, including those that were previously uncharacterized. Although new technologies are now emerging, at present the major resources for open-type analysis are the many publicly available SAGE (serial analysis of gene expression) and MPSS (massively parallel signature sequencing) libraries. These technologies have never been compared for their utility in the context of deep transcriptome mining. RESULTS: We used a single LongSAGE library of 503,431 tags and a "classic" MPSS library of 1,744,173 tags, both prepared from the same T cell-derived RNA sample, to compare the ability of each method to probe, at considerable depth, a human cellular transcriptome. We show that even though LongSAGE is more error-prone than MPSS, our LongSAGE library nevertheless generated 6.3-fold more genome-matching (and therefore likely error-free) tags than the MPSS library. An analysis of a set of 8,132 known genes detectable by both methods, and for which there is no ambiguity about tag matching, shows that MPSS detects only half (54%) the number of transcripts identified by SAGE (3,617 versus 1,955). Analysis of two additional MPSS libraries shows that each library samples a different subset of transcripts, and that in combination the three MPSS libraries (4,274,992 tags in total) still only detect 73% of the genes identified in our test set using SAGE. The fraction of transcripts detected by MPSS is likely to be even lower for uncharacterized transcripts, which tend to be more weakly expressed. The source of the loss of complexity in MPSS libraries compared to SAGE is unclear, but its effects become more severe with each sequencing cycle (i.e. as MPSS tag length increases). CONCLUSION: We show that MPSS libraries are significantly less complex than much smaller SAGE libraries, revealing a serious bias in the generation of MPSS data unlikely to have been circumvented by later technological improvements. Our results emphasize the need for the rigorous testing of new expression profiling technologies.
format Text
id pubmed-2104538
institution National Center for Biotechnology Information
language English
publishDate 2007
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-21045382007-12-04 Deep analysis of cellular transcriptomes – LongSAGE versus classic MPSS Hene, Lawrence Sreenu, Vattipally B Vuong, Mai T Abidi, S Hussain I Sutton, Julian K Rowland-Jones, Sarah L Davis, Simon J Evans, Edward J BMC Genomics Methodology Article BACKGROUND: Deep transcriptome analysis will underpin a large fraction of post-genomic biology. 'Closed' technologies, such as microarray analysis, only detect the set of transcripts chosen for analysis, whereas 'open' e.g. tag-based technologies are capable of identifying all possible transcripts, including those that were previously uncharacterized. Although new technologies are now emerging, at present the major resources for open-type analysis are the many publicly available SAGE (serial analysis of gene expression) and MPSS (massively parallel signature sequencing) libraries. These technologies have never been compared for their utility in the context of deep transcriptome mining. RESULTS: We used a single LongSAGE library of 503,431 tags and a "classic" MPSS library of 1,744,173 tags, both prepared from the same T cell-derived RNA sample, to compare the ability of each method to probe, at considerable depth, a human cellular transcriptome. We show that even though LongSAGE is more error-prone than MPSS, our LongSAGE library nevertheless generated 6.3-fold more genome-matching (and therefore likely error-free) tags than the MPSS library. An analysis of a set of 8,132 known genes detectable by both methods, and for which there is no ambiguity about tag matching, shows that MPSS detects only half (54%) the number of transcripts identified by SAGE (3,617 versus 1,955). Analysis of two additional MPSS libraries shows that each library samples a different subset of transcripts, and that in combination the three MPSS libraries (4,274,992 tags in total) still only detect 73% of the genes identified in our test set using SAGE. The fraction of transcripts detected by MPSS is likely to be even lower for uncharacterized transcripts, which tend to be more weakly expressed. The source of the loss of complexity in MPSS libraries compared to SAGE is unclear, but its effects become more severe with each sequencing cycle (i.e. as MPSS tag length increases). CONCLUSION: We show that MPSS libraries are significantly less complex than much smaller SAGE libraries, revealing a serious bias in the generation of MPSS data unlikely to have been circumvented by later technological improvements. Our results emphasize the need for the rigorous testing of new expression profiling technologies. BioMed Central 2007-09-24 /pmc/articles/PMC2104538/ /pubmed/17892551 http://dx.doi.org/10.1186/1471-2164-8-333 Text en Copyright © 2007 Hene et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology Article
Hene, Lawrence
Sreenu, Vattipally B
Vuong, Mai T
Abidi, S Hussain I
Sutton, Julian K
Rowland-Jones, Sarah L
Davis, Simon J
Evans, Edward J
Deep analysis of cellular transcriptomes – LongSAGE versus classic MPSS
title Deep analysis of cellular transcriptomes – LongSAGE versus classic MPSS
title_full Deep analysis of cellular transcriptomes – LongSAGE versus classic MPSS
title_fullStr Deep analysis of cellular transcriptomes – LongSAGE versus classic MPSS
title_full_unstemmed Deep analysis of cellular transcriptomes – LongSAGE versus classic MPSS
title_short Deep analysis of cellular transcriptomes – LongSAGE versus classic MPSS
title_sort deep analysis of cellular transcriptomes – longsage versus classic mpss
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2104538/
https://www.ncbi.nlm.nih.gov/pubmed/17892551
http://dx.doi.org/10.1186/1471-2164-8-333
work_keys_str_mv AT henelawrence deepanalysisofcellulartranscriptomeslongsageversusclassicmpss
AT sreenuvattipallyb deepanalysisofcellulartranscriptomeslongsageversusclassicmpss
AT vuongmait deepanalysisofcellulartranscriptomeslongsageversusclassicmpss
AT abidishussaini deepanalysisofcellulartranscriptomeslongsageversusclassicmpss
AT suttonjuliank deepanalysisofcellulartranscriptomeslongsageversusclassicmpss
AT rowlandjonessarahl deepanalysisofcellulartranscriptomeslongsageversusclassicmpss
AT davissimonj deepanalysisofcellulartranscriptomeslongsageversusclassicmpss
AT evansedwardj deepanalysisofcellulartranscriptomeslongsageversusclassicmpss