Cargando…

A comprehensive assessment of RNA-seq accuracy, reproducibility and information content by the Sequencing Quality Control consortium

We present primary results from the Sequencing Quality Control (SEQC) project, coordinated by the United States Food and Drug Administration. Examining Illumina HiSeq, Life Technologies SOLiD and Roche 454 platforms at multiple laboratory sites using reference RNA samples with built-in controls, we...

Descripción completa

Detalles Bibliográficos
Formato: Online Artículo Texto
Lenguaje:English
Publicado: 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4321899/
https://www.ncbi.nlm.nih.gov/pubmed/25150838
http://dx.doi.org/10.1038/nbt.2957
_version_ 1782356313831899136
collection PubMed
description We present primary results from the Sequencing Quality Control (SEQC) project, coordinated by the United States Food and Drug Administration. Examining Illumina HiSeq, Life Technologies SOLiD and Roche 454 platforms at multiple laboratory sites using reference RNA samples with built-in controls, we assess RNA sequencing (RNA-seq) performance for junction discovery and differential expression profiling and compare it to microarray and quantitative PCR (qPCR) data using complementary metrics. At all sequencing depths, we discover unannotated exon-exon junctions, with >80% validated by qPCR. We find that measurements of relative expression are accurate and reproducible across sites and platforms if specific filters are used. In contrast, RNA-seq and microarrays do not provide accurate absolute measurements, and gene-specific biases are observed, for these and qPCR. Measurement performance depends on the platform and data analysis pipeline, and variation is large for transcript-level profiling. The complete SEQC data sets, comprising >100 billion reads (10Tb), provide unique resources for evaluating RNA-seq analyses for clinical and regulatory settings.
format Online
Article
Text
id pubmed-4321899
institution National Center for Biotechnology Information
language English
publishDate 2014
record_format MEDLINE/PubMed
spelling pubmed-43218992015-03-01 A comprehensive assessment of RNA-seq accuracy, reproducibility and information content by the Sequencing Quality Control consortium Nat Biotechnol Article We present primary results from the Sequencing Quality Control (SEQC) project, coordinated by the United States Food and Drug Administration. Examining Illumina HiSeq, Life Technologies SOLiD and Roche 454 platforms at multiple laboratory sites using reference RNA samples with built-in controls, we assess RNA sequencing (RNA-seq) performance for junction discovery and differential expression profiling and compare it to microarray and quantitative PCR (qPCR) data using complementary metrics. At all sequencing depths, we discover unannotated exon-exon junctions, with >80% validated by qPCR. We find that measurements of relative expression are accurate and reproducible across sites and platforms if specific filters are used. In contrast, RNA-seq and microarrays do not provide accurate absolute measurements, and gene-specific biases are observed, for these and qPCR. Measurement performance depends on the platform and data analysis pipeline, and variation is large for transcript-level profiling. The complete SEQC data sets, comprising >100 billion reads (10Tb), provide unique resources for evaluating RNA-seq analyses for clinical and regulatory settings. 2014-08-24 2014-09 /pmc/articles/PMC4321899/ /pubmed/25150838 http://dx.doi.org/10.1038/nbt.2957 Text en http://www.nature.com/authors/editorial_policies/license.html#terms Users may view, print, copy, and download text and data-mine the content in such documents, for the purposes of academic research, subject always to the full Conditions of use:http://www.nature.com/authors/editorial_policies/license.html#terms
spellingShingle Article
A comprehensive assessment of RNA-seq accuracy, reproducibility and information content by the Sequencing Quality Control consortium
title A comprehensive assessment of RNA-seq accuracy, reproducibility and information content by the Sequencing Quality Control consortium
title_full A comprehensive assessment of RNA-seq accuracy, reproducibility and information content by the Sequencing Quality Control consortium
title_fullStr A comprehensive assessment of RNA-seq accuracy, reproducibility and information content by the Sequencing Quality Control consortium
title_full_unstemmed A comprehensive assessment of RNA-seq accuracy, reproducibility and information content by the Sequencing Quality Control consortium
title_short A comprehensive assessment of RNA-seq accuracy, reproducibility and information content by the Sequencing Quality Control consortium
title_sort comprehensive assessment of rna-seq accuracy, reproducibility and information content by the sequencing quality control consortium
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4321899/
https://www.ncbi.nlm.nih.gov/pubmed/25150838
http://dx.doi.org/10.1038/nbt.2957
work_keys_str_mv AT acomprehensiveassessmentofrnaseqaccuracyreproducibilityandinformationcontentbythesequencingqualitycontrolconsortium
AT comprehensiveassessmentofrnaseqaccuracyreproducibilityandinformationcontentbythesequencingqualitycontrolconsortium