Cargando…

GENE-Counter: A Computational Pipeline for the Analysis of RNA-Seq Data for Gene Expression Differences

GENE-counter is a complete Perl-based computational pipeline for analyzing RNA-Sequencing (RNA-Seq) data for differential gene expression. In addition to its use in studying transcriptomes of eukaryotic model organisms, GENE-counter is applicable for prokaryotes and non-model organisms without an av...

Descripción completa

Detalles Bibliográficos
Autores principales: Cumbie, Jason S., Kimbrel, Jeffrey A., Di, Yanming, Schafer, Daniel W., Wilhelm, Larry J., Fox, Samuel E., Sullivan, Christopher M., Curzon, Aron D., Carrington, James C., Mockler, Todd C., Chang, Jeff H.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3188579/
https://www.ncbi.nlm.nih.gov/pubmed/21998647
http://dx.doi.org/10.1371/journal.pone.0025279
_version_ 1782213398556049408
author Cumbie, Jason S.
Kimbrel, Jeffrey A.
Di, Yanming
Schafer, Daniel W.
Wilhelm, Larry J.
Fox, Samuel E.
Sullivan, Christopher M.
Curzon, Aron D.
Carrington, James C.
Mockler, Todd C.
Chang, Jeff H.
author_facet Cumbie, Jason S.
Kimbrel, Jeffrey A.
Di, Yanming
Schafer, Daniel W.
Wilhelm, Larry J.
Fox, Samuel E.
Sullivan, Christopher M.
Curzon, Aron D.
Carrington, James C.
Mockler, Todd C.
Chang, Jeff H.
author_sort Cumbie, Jason S.
collection PubMed
description GENE-counter is a complete Perl-based computational pipeline for analyzing RNA-Sequencing (RNA-Seq) data for differential gene expression. In addition to its use in studying transcriptomes of eukaryotic model organisms, GENE-counter is applicable for prokaryotes and non-model organisms without an available genome reference sequence. For alignments, GENE-counter is configured for CASHX, Bowtie, and BWA, but an end user can use any Sequence Alignment/Map (SAM)-compliant program of preference. To analyze data for differential gene expression, GENE-counter can be run with any one of three statistics packages that are based on variations of the negative binomial distribution. The default method is a new and simple statistical test we developed based on an over-parameterized version of the negative binomial distribution. GENE-counter also includes three different methods for assessing differentially expressed features for enriched gene ontology (GO) terms. Results are transparent and data are systematically stored in a MySQL relational database to facilitate additional analyses as well as quality assessment. We used next generation sequencing to generate a small-scale RNA-Seq dataset derived from the heavily studied defense response of Arabidopsis thaliana and used GENE-counter to process the data. Collectively, the support from analysis of microarrays as well as the observed and substantial overlap in results from each of the three statistics packages demonstrates that GENE-counter is well suited for handling the unique characteristics of small sample sizes and high variability in gene counts.
format Online
Article
Text
id pubmed-3188579
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-31885792011-10-13 GENE-Counter: A Computational Pipeline for the Analysis of RNA-Seq Data for Gene Expression Differences Cumbie, Jason S. Kimbrel, Jeffrey A. Di, Yanming Schafer, Daniel W. Wilhelm, Larry J. Fox, Samuel E. Sullivan, Christopher M. Curzon, Aron D. Carrington, James C. Mockler, Todd C. Chang, Jeff H. PLoS One Research Article GENE-counter is a complete Perl-based computational pipeline for analyzing RNA-Sequencing (RNA-Seq) data for differential gene expression. In addition to its use in studying transcriptomes of eukaryotic model organisms, GENE-counter is applicable for prokaryotes and non-model organisms without an available genome reference sequence. For alignments, GENE-counter is configured for CASHX, Bowtie, and BWA, but an end user can use any Sequence Alignment/Map (SAM)-compliant program of preference. To analyze data for differential gene expression, GENE-counter can be run with any one of three statistics packages that are based on variations of the negative binomial distribution. The default method is a new and simple statistical test we developed based on an over-parameterized version of the negative binomial distribution. GENE-counter also includes three different methods for assessing differentially expressed features for enriched gene ontology (GO) terms. Results are transparent and data are systematically stored in a MySQL relational database to facilitate additional analyses as well as quality assessment. We used next generation sequencing to generate a small-scale RNA-Seq dataset derived from the heavily studied defense response of Arabidopsis thaliana and used GENE-counter to process the data. Collectively, the support from analysis of microarrays as well as the observed and substantial overlap in results from each of the three statistics packages demonstrates that GENE-counter is well suited for handling the unique characteristics of small sample sizes and high variability in gene counts. Public Library of Science 2011-10-06 /pmc/articles/PMC3188579/ /pubmed/21998647 http://dx.doi.org/10.1371/journal.pone.0025279 Text en Cumbie et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Cumbie, Jason S.
Kimbrel, Jeffrey A.
Di, Yanming
Schafer, Daniel W.
Wilhelm, Larry J.
Fox, Samuel E.
Sullivan, Christopher M.
Curzon, Aron D.
Carrington, James C.
Mockler, Todd C.
Chang, Jeff H.
GENE-Counter: A Computational Pipeline for the Analysis of RNA-Seq Data for Gene Expression Differences
title GENE-Counter: A Computational Pipeline for the Analysis of RNA-Seq Data for Gene Expression Differences
title_full GENE-Counter: A Computational Pipeline for the Analysis of RNA-Seq Data for Gene Expression Differences
title_fullStr GENE-Counter: A Computational Pipeline for the Analysis of RNA-Seq Data for Gene Expression Differences
title_full_unstemmed GENE-Counter: A Computational Pipeline for the Analysis of RNA-Seq Data for Gene Expression Differences
title_short GENE-Counter: A Computational Pipeline for the Analysis of RNA-Seq Data for Gene Expression Differences
title_sort gene-counter: a computational pipeline for the analysis of rna-seq data for gene expression differences
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3188579/
https://www.ncbi.nlm.nih.gov/pubmed/21998647
http://dx.doi.org/10.1371/journal.pone.0025279
work_keys_str_mv AT cumbiejasons genecounteracomputationalpipelinefortheanalysisofrnaseqdataforgeneexpressiondifferences
AT kimbreljeffreya genecounteracomputationalpipelinefortheanalysisofrnaseqdataforgeneexpressiondifferences
AT diyanming genecounteracomputationalpipelinefortheanalysisofrnaseqdataforgeneexpressiondifferences
AT schaferdanielw genecounteracomputationalpipelinefortheanalysisofrnaseqdataforgeneexpressiondifferences
AT wilhelmlarryj genecounteracomputationalpipelinefortheanalysisofrnaseqdataforgeneexpressiondifferences
AT foxsamuele genecounteracomputationalpipelinefortheanalysisofrnaseqdataforgeneexpressiondifferences
AT sullivanchristopherm genecounteracomputationalpipelinefortheanalysisofrnaseqdataforgeneexpressiondifferences
AT curzonarond genecounteracomputationalpipelinefortheanalysisofrnaseqdataforgeneexpressiondifferences
AT carringtonjamesc genecounteracomputationalpipelinefortheanalysisofrnaseqdataforgeneexpressiondifferences
AT mocklertoddc genecounteracomputationalpipelinefortheanalysisofrnaseqdataforgeneexpressiondifferences
AT changjeffh genecounteracomputationalpipelinefortheanalysisofrnaseqdataforgeneexpressiondifferences