Cargando…
GENE-Counter: A Computational Pipeline for the Analysis of RNA-Seq Data for Gene Expression Differences
GENE-counter is a complete Perl-based computational pipeline for analyzing RNA-Sequencing (RNA-Seq) data for differential gene expression. In addition to its use in studying transcriptomes of eukaryotic model organisms, GENE-counter is applicable for prokaryotes and non-model organisms without an av...
Autores principales: | , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2011
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3188579/ https://www.ncbi.nlm.nih.gov/pubmed/21998647 http://dx.doi.org/10.1371/journal.pone.0025279 |
_version_ | 1782213398556049408 |
---|---|
author | Cumbie, Jason S. Kimbrel, Jeffrey A. Di, Yanming Schafer, Daniel W. Wilhelm, Larry J. Fox, Samuel E. Sullivan, Christopher M. Curzon, Aron D. Carrington, James C. Mockler, Todd C. Chang, Jeff H. |
author_facet | Cumbie, Jason S. Kimbrel, Jeffrey A. Di, Yanming Schafer, Daniel W. Wilhelm, Larry J. Fox, Samuel E. Sullivan, Christopher M. Curzon, Aron D. Carrington, James C. Mockler, Todd C. Chang, Jeff H. |
author_sort | Cumbie, Jason S. |
collection | PubMed |
description | GENE-counter is a complete Perl-based computational pipeline for analyzing RNA-Sequencing (RNA-Seq) data for differential gene expression. In addition to its use in studying transcriptomes of eukaryotic model organisms, GENE-counter is applicable for prokaryotes and non-model organisms without an available genome reference sequence. For alignments, GENE-counter is configured for CASHX, Bowtie, and BWA, but an end user can use any Sequence Alignment/Map (SAM)-compliant program of preference. To analyze data for differential gene expression, GENE-counter can be run with any one of three statistics packages that are based on variations of the negative binomial distribution. The default method is a new and simple statistical test we developed based on an over-parameterized version of the negative binomial distribution. GENE-counter also includes three different methods for assessing differentially expressed features for enriched gene ontology (GO) terms. Results are transparent and data are systematically stored in a MySQL relational database to facilitate additional analyses as well as quality assessment. We used next generation sequencing to generate a small-scale RNA-Seq dataset derived from the heavily studied defense response of Arabidopsis thaliana and used GENE-counter to process the data. Collectively, the support from analysis of microarrays as well as the observed and substantial overlap in results from each of the three statistics packages demonstrates that GENE-counter is well suited for handling the unique characteristics of small sample sizes and high variability in gene counts. |
format | Online Article Text |
id | pubmed-3188579 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2011 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-31885792011-10-13 GENE-Counter: A Computational Pipeline for the Analysis of RNA-Seq Data for Gene Expression Differences Cumbie, Jason S. Kimbrel, Jeffrey A. Di, Yanming Schafer, Daniel W. Wilhelm, Larry J. Fox, Samuel E. Sullivan, Christopher M. Curzon, Aron D. Carrington, James C. Mockler, Todd C. Chang, Jeff H. PLoS One Research Article GENE-counter is a complete Perl-based computational pipeline for analyzing RNA-Sequencing (RNA-Seq) data for differential gene expression. In addition to its use in studying transcriptomes of eukaryotic model organisms, GENE-counter is applicable for prokaryotes and non-model organisms without an available genome reference sequence. For alignments, GENE-counter is configured for CASHX, Bowtie, and BWA, but an end user can use any Sequence Alignment/Map (SAM)-compliant program of preference. To analyze data for differential gene expression, GENE-counter can be run with any one of three statistics packages that are based on variations of the negative binomial distribution. The default method is a new and simple statistical test we developed based on an over-parameterized version of the negative binomial distribution. GENE-counter also includes three different methods for assessing differentially expressed features for enriched gene ontology (GO) terms. Results are transparent and data are systematically stored in a MySQL relational database to facilitate additional analyses as well as quality assessment. We used next generation sequencing to generate a small-scale RNA-Seq dataset derived from the heavily studied defense response of Arabidopsis thaliana and used GENE-counter to process the data. Collectively, the support from analysis of microarrays as well as the observed and substantial overlap in results from each of the three statistics packages demonstrates that GENE-counter is well suited for handling the unique characteristics of small sample sizes and high variability in gene counts. Public Library of Science 2011-10-06 /pmc/articles/PMC3188579/ /pubmed/21998647 http://dx.doi.org/10.1371/journal.pone.0025279 Text en Cumbie et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited. |
spellingShingle | Research Article Cumbie, Jason S. Kimbrel, Jeffrey A. Di, Yanming Schafer, Daniel W. Wilhelm, Larry J. Fox, Samuel E. Sullivan, Christopher M. Curzon, Aron D. Carrington, James C. Mockler, Todd C. Chang, Jeff H. GENE-Counter: A Computational Pipeline for the Analysis of RNA-Seq Data for Gene Expression Differences |
title | GENE-Counter: A Computational Pipeline for the Analysis of RNA-Seq Data for Gene Expression Differences |
title_full | GENE-Counter: A Computational Pipeline for the Analysis of RNA-Seq Data for Gene Expression Differences |
title_fullStr | GENE-Counter: A Computational Pipeline for the Analysis of RNA-Seq Data for Gene Expression Differences |
title_full_unstemmed | GENE-Counter: A Computational Pipeline for the Analysis of RNA-Seq Data for Gene Expression Differences |
title_short | GENE-Counter: A Computational Pipeline for the Analysis of RNA-Seq Data for Gene Expression Differences |
title_sort | gene-counter: a computational pipeline for the analysis of rna-seq data for gene expression differences |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3188579/ https://www.ncbi.nlm.nih.gov/pubmed/21998647 http://dx.doi.org/10.1371/journal.pone.0025279 |
work_keys_str_mv | AT cumbiejasons genecounteracomputationalpipelinefortheanalysisofrnaseqdataforgeneexpressiondifferences AT kimbreljeffreya genecounteracomputationalpipelinefortheanalysisofrnaseqdataforgeneexpressiondifferences AT diyanming genecounteracomputationalpipelinefortheanalysisofrnaseqdataforgeneexpressiondifferences AT schaferdanielw genecounteracomputationalpipelinefortheanalysisofrnaseqdataforgeneexpressiondifferences AT wilhelmlarryj genecounteracomputationalpipelinefortheanalysisofrnaseqdataforgeneexpressiondifferences AT foxsamuele genecounteracomputationalpipelinefortheanalysisofrnaseqdataforgeneexpressiondifferences AT sullivanchristopherm genecounteracomputationalpipelinefortheanalysisofrnaseqdataforgeneexpressiondifferences AT curzonarond genecounteracomputationalpipelinefortheanalysisofrnaseqdataforgeneexpressiondifferences AT carringtonjamesc genecounteracomputationalpipelinefortheanalysisofrnaseqdataforgeneexpressiondifferences AT mocklertoddc genecounteracomputationalpipelinefortheanalysisofrnaseqdataforgeneexpressiondifferences AT changjeffh genecounteracomputationalpipelinefortheanalysisofrnaseqdataforgeneexpressiondifferences |