Cargando…

A pipeline for RNA-seq data processing and quality assessment

Summary: We present an R based pipeline, ArrayExpressHTS, for pre-processing, expression estimation and data quality assessment of high-throughput sequencing transcriptional profiling (RNA-seq) datasets. The pipeline starts from raw sequence files and produces standard Bioconductor R objects contain...

Descripción completa

Detalles Bibliográficos
Autores principales: Goncalves, Angela, Tikhonov, Andrew, Brazma, Alvis, Kapushesky, Misha
Formato: Texto
Lenguaje:English
Publicado: Oxford University Press 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3051320/
https://www.ncbi.nlm.nih.gov/pubmed/21233166
http://dx.doi.org/10.1093/bioinformatics/btr012
_version_ 1782199469936214016
author Goncalves, Angela
Tikhonov, Andrew
Brazma, Alvis
Kapushesky, Misha
author_facet Goncalves, Angela
Tikhonov, Andrew
Brazma, Alvis
Kapushesky, Misha
author_sort Goncalves, Angela
collection PubMed
description Summary: We present an R based pipeline, ArrayExpressHTS, for pre-processing, expression estimation and data quality assessment of high-throughput sequencing transcriptional profiling (RNA-seq) datasets. The pipeline starts from raw sequence files and produces standard Bioconductor R objects containing gene or transcript measurements for downstream analysis along with web reports for data quality assessment. It may be run locally on a user's own computer or remotely on a distributed R-cloud farm at the European Bioinformatics Institute. It can be used to analyse user's own datasets or public RNA-seq datasets from the ArrayExpress Archive. Availability: The R package is available at www.ebi.ac.uk/tools/rcloud with online documentation at www.ebi.ac.uk/Tools/rwiki/, also available as supplementary material. Contact: angela.goncalves@ebi.ac.uk Supplementary information: Supplementary data are available at Bioinformatics online.
format Text
id pubmed-3051320
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-30513202011-03-10 A pipeline for RNA-seq data processing and quality assessment Goncalves, Angela Tikhonov, Andrew Brazma, Alvis Kapushesky, Misha Bioinformatics Applications Note Summary: We present an R based pipeline, ArrayExpressHTS, for pre-processing, expression estimation and data quality assessment of high-throughput sequencing transcriptional profiling (RNA-seq) datasets. The pipeline starts from raw sequence files and produces standard Bioconductor R objects containing gene or transcript measurements for downstream analysis along with web reports for data quality assessment. It may be run locally on a user's own computer or remotely on a distributed R-cloud farm at the European Bioinformatics Institute. It can be used to analyse user's own datasets or public RNA-seq datasets from the ArrayExpress Archive. Availability: The R package is available at www.ebi.ac.uk/tools/rcloud with online documentation at www.ebi.ac.uk/Tools/rwiki/, also available as supplementary material. Contact: angela.goncalves@ebi.ac.uk Supplementary information: Supplementary data are available at Bioinformatics online. Oxford University Press 2011-03-15 2011-01-13 /pmc/articles/PMC3051320/ /pubmed/21233166 http://dx.doi.org/10.1093/bioinformatics/btr012 Text en © The Author(s) 2011. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/2.0/uk/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.5), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Applications Note
Goncalves, Angela
Tikhonov, Andrew
Brazma, Alvis
Kapushesky, Misha
A pipeline for RNA-seq data processing and quality assessment
title A pipeline for RNA-seq data processing and quality assessment
title_full A pipeline for RNA-seq data processing and quality assessment
title_fullStr A pipeline for RNA-seq data processing and quality assessment
title_full_unstemmed A pipeline for RNA-seq data processing and quality assessment
title_short A pipeline for RNA-seq data processing and quality assessment
title_sort pipeline for rna-seq data processing and quality assessment
topic Applications Note
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3051320/
https://www.ncbi.nlm.nih.gov/pubmed/21233166
http://dx.doi.org/10.1093/bioinformatics/btr012
work_keys_str_mv AT goncalvesangela apipelineforrnaseqdataprocessingandqualityassessment
AT tikhonovandrew apipelineforrnaseqdataprocessingandqualityassessment
AT brazmaalvis apipelineforrnaseqdataprocessingandqualityassessment
AT kapusheskymisha apipelineforrnaseqdataprocessingandqualityassessment
AT goncalvesangela pipelineforrnaseqdataprocessingandqualityassessment
AT tikhonovandrew pipelineforrnaseqdataprocessingandqualityassessment
AT brazmaalvis pipelineforrnaseqdataprocessingandqualityassessment
AT kapusheskymisha pipelineforrnaseqdataprocessingandqualityassessment