Cargando…
Comparison and calibration of transcriptome data from RNA-Seq and tiling arrays
BACKGROUND: Tiling arrays have been the tool of choice for probing an organism's transcriptome without prior assumptions about the transcribed regions, but RNA-Seq is becoming a viable alternative as the costs of sequencing continue to decrease. Understanding the relative merits of these techno...
Autores principales: | , , , , , , , , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2010
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3091629/ https://www.ncbi.nlm.nih.gov/pubmed/20565764 http://dx.doi.org/10.1186/1471-2164-11-383 |
_version_ | 1782203290146045952 |
---|---|
author | Agarwal, Ashish Koppstein, David Rozowsky, Joel Sboner, Andrea Habegger, Lukas Hillier, LaDeana W Sasidharan, Rajkumar Reinke, Valerie Waterston, Robert H Gerstein, Mark |
author_facet | Agarwal, Ashish Koppstein, David Rozowsky, Joel Sboner, Andrea Habegger, Lukas Hillier, LaDeana W Sasidharan, Rajkumar Reinke, Valerie Waterston, Robert H Gerstein, Mark |
author_sort | Agarwal, Ashish |
collection | PubMed |
description | BACKGROUND: Tiling arrays have been the tool of choice for probing an organism's transcriptome without prior assumptions about the transcribed regions, but RNA-Seq is becoming a viable alternative as the costs of sequencing continue to decrease. Understanding the relative merits of these technologies will help researchers select the appropriate technology for their needs. RESULTS: Here, we compare these two platforms using a matched sample of poly(A)-enriched RNA isolated from the second larval stage of C. elegans. We find that the raw signals from these two technologies are reasonably well correlated but that RNA-Seq outperforms tiling arrays in several respects, notably in exon boundary detection and dynamic range of expression. By exploring the accuracy of sequencing as a function of depth of coverage, we found that about 4 million reads are required to match the sensitivity of two tiling array replicates. The effects of cross-hybridization were analyzed using a "nearest neighbor" classifier applied to array probes; we describe a method for determining potential "black list" regions whose signals are unreliable. Finally, we propose a strategy for using RNA-Seq data as a gold standard set to calibrate tiling array data. All tiling array and RNA-Seq data sets have been submitted to the modENCODE Data Coordinating Center. CONCLUSIONS: Tiling arrays effectively detect transcript expression levels at a low cost for many species while RNA-Seq provides greater accuracy in several regards. Researchers will need to carefully select the technology appropriate to the biological investigations they are undertaking. It will also be important to reconsider a comparison such as ours as sequencing technologies continue to evolve. |
format | Text |
id | pubmed-3091629 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2010 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-30916292011-05-11 Comparison and calibration of transcriptome data from RNA-Seq and tiling arrays Agarwal, Ashish Koppstein, David Rozowsky, Joel Sboner, Andrea Habegger, Lukas Hillier, LaDeana W Sasidharan, Rajkumar Reinke, Valerie Waterston, Robert H Gerstein, Mark BMC Genomics Research Article BACKGROUND: Tiling arrays have been the tool of choice for probing an organism's transcriptome without prior assumptions about the transcribed regions, but RNA-Seq is becoming a viable alternative as the costs of sequencing continue to decrease. Understanding the relative merits of these technologies will help researchers select the appropriate technology for their needs. RESULTS: Here, we compare these two platforms using a matched sample of poly(A)-enriched RNA isolated from the second larval stage of C. elegans. We find that the raw signals from these two technologies are reasonably well correlated but that RNA-Seq outperforms tiling arrays in several respects, notably in exon boundary detection and dynamic range of expression. By exploring the accuracy of sequencing as a function of depth of coverage, we found that about 4 million reads are required to match the sensitivity of two tiling array replicates. The effects of cross-hybridization were analyzed using a "nearest neighbor" classifier applied to array probes; we describe a method for determining potential "black list" regions whose signals are unreliable. Finally, we propose a strategy for using RNA-Seq data as a gold standard set to calibrate tiling array data. All tiling array and RNA-Seq data sets have been submitted to the modENCODE Data Coordinating Center. CONCLUSIONS: Tiling arrays effectively detect transcript expression levels at a low cost for many species while RNA-Seq provides greater accuracy in several regards. Researchers will need to carefully select the technology appropriate to the biological investigations they are undertaking. It will also be important to reconsider a comparison such as ours as sequencing technologies continue to evolve. BioMed Central 2010-06-17 /pmc/articles/PMC3091629/ /pubmed/20565764 http://dx.doi.org/10.1186/1471-2164-11-383 Text en Copyright ©2010 Agarwal et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Agarwal, Ashish Koppstein, David Rozowsky, Joel Sboner, Andrea Habegger, Lukas Hillier, LaDeana W Sasidharan, Rajkumar Reinke, Valerie Waterston, Robert H Gerstein, Mark Comparison and calibration of transcriptome data from RNA-Seq and tiling arrays |
title | Comparison and calibration of transcriptome data from RNA-Seq and tiling arrays |
title_full | Comparison and calibration of transcriptome data from RNA-Seq and tiling arrays |
title_fullStr | Comparison and calibration of transcriptome data from RNA-Seq and tiling arrays |
title_full_unstemmed | Comparison and calibration of transcriptome data from RNA-Seq and tiling arrays |
title_short | Comparison and calibration of transcriptome data from RNA-Seq and tiling arrays |
title_sort | comparison and calibration of transcriptome data from rna-seq and tiling arrays |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3091629/ https://www.ncbi.nlm.nih.gov/pubmed/20565764 http://dx.doi.org/10.1186/1471-2164-11-383 |
work_keys_str_mv | AT agarwalashish comparisonandcalibrationoftranscriptomedatafromrnaseqandtilingarrays AT koppsteindavid comparisonandcalibrationoftranscriptomedatafromrnaseqandtilingarrays AT rozowskyjoel comparisonandcalibrationoftranscriptomedatafromrnaseqandtilingarrays AT sbonerandrea comparisonandcalibrationoftranscriptomedatafromrnaseqandtilingarrays AT habeggerlukas comparisonandcalibrationoftranscriptomedatafromrnaseqandtilingarrays AT hillierladeanaw comparisonandcalibrationoftranscriptomedatafromrnaseqandtilingarrays AT sasidharanrajkumar comparisonandcalibrationoftranscriptomedatafromrnaseqandtilingarrays AT reinkevalerie comparisonandcalibrationoftranscriptomedatafromrnaseqandtilingarrays AT waterstonroberth comparisonandcalibrationoftranscriptomedatafromrnaseqandtilingarrays AT gersteinmark comparisonandcalibrationoftranscriptomedatafromrnaseqandtilingarrays |