Cargando…

Detecting and correcting systematic variation in large-scale RNA sequencing data

High-throughput RNA sequencing (RNA-seq) enables comprehensive scans of entire transcriptomes, but best practices for analyzing RNA-seq data have not been fully defined, particularly for data collected with multiple sequencing platforms or at multiple sites. Here we used standardized RNA samples wit...

Descripción completa

Detalles Bibliográficos
Autores principales:	Li, Sheng, Łabaj, Paweł P., Zumbo, Paul, Sykacek, Peter, Shi, Wei, Shi, Leming, Phan, John, Wu, Leo, Wang, May, Wang, Charles, Thierry-Mieg, Danielle, Thierry-Mieg, Jean, Kreil, David P., Mason, Christopher E.
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	2014
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4160374/ https://www.ncbi.nlm.nih.gov/pubmed/25150837 http://dx.doi.org/10.1038/nbt.3000

Descripción
Sumario:	High-throughput RNA sequencing (RNA-seq) enables comprehensive scans of entire transcriptomes, but best practices for analyzing RNA-seq data have not been fully defined, particularly for data collected with multiple sequencing platforms or at multiple sites. Here we used standardized RNA samples with built-in controls to examine sources of error in large-scale RNA-seq studies and their impact on the detection of differentially expressed genes (DEGs). Analysis of variations in guanine-cytosine content, gene coverage, sequencing error rate and insert size allowed identification of methods that produce more false positives or are less reproducible across sites. Moreover, commonly used methods fornormalization (cqn, EDASeq, RUV2, sva, PEER) varied in their ability to remove these systematic biases, depending on sample complexity and initial data quality. Normalization methods that combine data from genes across sites are strongly recommended to identify and remove site-specific effects, and can substantially improve RNA-seq studies.

Detecting and correcting systematic variation in large-scale RNA sequencing data

Ejemplares similares