Cargando…

Comparison and calibration of transcriptome data from RNA-Seq and tiling arrays

BACKGROUND: Tiling arrays have been the tool of choice for probing an organism's transcriptome without prior assumptions about the transcribed regions, but RNA-Seq is becoming a viable alternative as the costs of sequencing continue to decrease. Understanding the relative merits of these techno...

Descripción completa

Detalles Bibliográficos
Autores principales: Agarwal, Ashish, Koppstein, David, Rozowsky, Joel, Sboner, Andrea, Habegger, Lukas, Hillier, LaDeana W, Sasidharan, Rajkumar, Reinke, Valerie, Waterston, Robert H, Gerstein, Mark
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3091629/
https://www.ncbi.nlm.nih.gov/pubmed/20565764
http://dx.doi.org/10.1186/1471-2164-11-383
_version_ 1782203290146045952
author Agarwal, Ashish
Koppstein, David
Rozowsky, Joel
Sboner, Andrea
Habegger, Lukas
Hillier, LaDeana W
Sasidharan, Rajkumar
Reinke, Valerie
Waterston, Robert H
Gerstein, Mark
author_facet Agarwal, Ashish
Koppstein, David
Rozowsky, Joel
Sboner, Andrea
Habegger, Lukas
Hillier, LaDeana W
Sasidharan, Rajkumar
Reinke, Valerie
Waterston, Robert H
Gerstein, Mark
author_sort Agarwal, Ashish
collection PubMed
description BACKGROUND: Tiling arrays have been the tool of choice for probing an organism's transcriptome without prior assumptions about the transcribed regions, but RNA-Seq is becoming a viable alternative as the costs of sequencing continue to decrease. Understanding the relative merits of these technologies will help researchers select the appropriate technology for their needs. RESULTS: Here, we compare these two platforms using a matched sample of poly(A)-enriched RNA isolated from the second larval stage of C. elegans. We find that the raw signals from these two technologies are reasonably well correlated but that RNA-Seq outperforms tiling arrays in several respects, notably in exon boundary detection and dynamic range of expression. By exploring the accuracy of sequencing as a function of depth of coverage, we found that about 4 million reads are required to match the sensitivity of two tiling array replicates. The effects of cross-hybridization were analyzed using a "nearest neighbor" classifier applied to array probes; we describe a method for determining potential "black list" regions whose signals are unreliable. Finally, we propose a strategy for using RNA-Seq data as a gold standard set to calibrate tiling array data. All tiling array and RNA-Seq data sets have been submitted to the modENCODE Data Coordinating Center. CONCLUSIONS: Tiling arrays effectively detect transcript expression levels at a low cost for many species while RNA-Seq provides greater accuracy in several regards. Researchers will need to carefully select the technology appropriate to the biological investigations they are undertaking. It will also be important to reconsider a comparison such as ours as sequencing technologies continue to evolve.
format Text
id pubmed-3091629
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-30916292011-05-11 Comparison and calibration of transcriptome data from RNA-Seq and tiling arrays Agarwal, Ashish Koppstein, David Rozowsky, Joel Sboner, Andrea Habegger, Lukas Hillier, LaDeana W Sasidharan, Rajkumar Reinke, Valerie Waterston, Robert H Gerstein, Mark BMC Genomics Research Article BACKGROUND: Tiling arrays have been the tool of choice for probing an organism's transcriptome without prior assumptions about the transcribed regions, but RNA-Seq is becoming a viable alternative as the costs of sequencing continue to decrease. Understanding the relative merits of these technologies will help researchers select the appropriate technology for their needs. RESULTS: Here, we compare these two platforms using a matched sample of poly(A)-enriched RNA isolated from the second larval stage of C. elegans. We find that the raw signals from these two technologies are reasonably well correlated but that RNA-Seq outperforms tiling arrays in several respects, notably in exon boundary detection and dynamic range of expression. By exploring the accuracy of sequencing as a function of depth of coverage, we found that about 4 million reads are required to match the sensitivity of two tiling array replicates. The effects of cross-hybridization were analyzed using a "nearest neighbor" classifier applied to array probes; we describe a method for determining potential "black list" regions whose signals are unreliable. Finally, we propose a strategy for using RNA-Seq data as a gold standard set to calibrate tiling array data. All tiling array and RNA-Seq data sets have been submitted to the modENCODE Data Coordinating Center. CONCLUSIONS: Tiling arrays effectively detect transcript expression levels at a low cost for many species while RNA-Seq provides greater accuracy in several regards. Researchers will need to carefully select the technology appropriate to the biological investigations they are undertaking. It will also be important to reconsider a comparison such as ours as sequencing technologies continue to evolve. BioMed Central 2010-06-17 /pmc/articles/PMC3091629/ /pubmed/20565764 http://dx.doi.org/10.1186/1471-2164-11-383 Text en Copyright ©2010 Agarwal et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Agarwal, Ashish
Koppstein, David
Rozowsky, Joel
Sboner, Andrea
Habegger, Lukas
Hillier, LaDeana W
Sasidharan, Rajkumar
Reinke, Valerie
Waterston, Robert H
Gerstein, Mark
Comparison and calibration of transcriptome data from RNA-Seq and tiling arrays
title Comparison and calibration of transcriptome data from RNA-Seq and tiling arrays
title_full Comparison and calibration of transcriptome data from RNA-Seq and tiling arrays
title_fullStr Comparison and calibration of transcriptome data from RNA-Seq and tiling arrays
title_full_unstemmed Comparison and calibration of transcriptome data from RNA-Seq and tiling arrays
title_short Comparison and calibration of transcriptome data from RNA-Seq and tiling arrays
title_sort comparison and calibration of transcriptome data from rna-seq and tiling arrays
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3091629/
https://www.ncbi.nlm.nih.gov/pubmed/20565764
http://dx.doi.org/10.1186/1471-2164-11-383
work_keys_str_mv AT agarwalashish comparisonandcalibrationoftranscriptomedatafromrnaseqandtilingarrays
AT koppsteindavid comparisonandcalibrationoftranscriptomedatafromrnaseqandtilingarrays
AT rozowskyjoel comparisonandcalibrationoftranscriptomedatafromrnaseqandtilingarrays
AT sbonerandrea comparisonandcalibrationoftranscriptomedatafromrnaseqandtilingarrays
AT habeggerlukas comparisonandcalibrationoftranscriptomedatafromrnaseqandtilingarrays
AT hillierladeanaw comparisonandcalibrationoftranscriptomedatafromrnaseqandtilingarrays
AT sasidharanrajkumar comparisonandcalibrationoftranscriptomedatafromrnaseqandtilingarrays
AT reinkevalerie comparisonandcalibrationoftranscriptomedatafromrnaseqandtilingarrays
AT waterstonroberth comparisonandcalibrationoftranscriptomedatafromrnaseqandtilingarrays
AT gersteinmark comparisonandcalibrationoftranscriptomedatafromrnaseqandtilingarrays