Cargando…
Web-based bioinformatics workflows for end-to-end RNA-seq data computation and analysis in agricultural animal species
BACKGROUND: Remarkable advances in Next Generation Sequencing (NGS) technologies, bioinformatics algorithms and computational technologies have significantly accelerated genomic research. However, complicated NGS data analysis still remains as a major bottleneck. RNA-seq, as one of the major area in...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2016
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5039875/ https://www.ncbi.nlm.nih.gov/pubmed/27678198 http://dx.doi.org/10.1186/s12864-016-3118-z |
_version_ | 1782456139051433984 |
---|---|
author | Li, Weizhong Richter, R. Alexander Jung, Yunsup Zhu, Qiyun Li, Robert W. |
author_facet | Li, Weizhong Richter, R. Alexander Jung, Yunsup Zhu, Qiyun Li, Robert W. |
author_sort | Li, Weizhong |
collection | PubMed |
description | BACKGROUND: Remarkable advances in Next Generation Sequencing (NGS) technologies, bioinformatics algorithms and computational technologies have significantly accelerated genomic research. However, complicated NGS data analysis still remains as a major bottleneck. RNA-seq, as one of the major area in the NGS field, also confronts great challenges in data analysis. RESULTS: To address the challenges in RNA-seq data analysis, we developed a web portal that offers three integrated workflows that can perform end-to-end compute and analysis, including sequence quality control, read-mapping, transcriptome assembly, reconstruction and quantification, and differential analysis. The first workflow utilizes Tuxedo (Tophat, Cufflink, Cuffmerge and Cuffdiff suite of tools). The second workflow deploys Trinity for de novo assembly and uses RSEM for transcript quantification and EdgeR for differential analysis. The third combines STAR, RSEM, and EdgeR for data analysis. All these workflows support multiple samples and multiple groups of samples and perform differential analysis between groups in a single workflow job submission. The calculated results are available for download and post-analysis. The supported animal species include chicken, cow, duck, goat, pig, horse, rabbit, sheep, turkey, as well as several other model organisms including yeast, C. elegans, Drosophila, and human, with genomic sequences and annotations obtained from ENSEMBL. The RNA-seq portal is freely available from http://weizhongli-lab.org/RNA-seq. CONCLUSIONS: The web portal offers not only bioinformatics software, workflows, computation and reference data, but also an integrated environment for complex RNA-seq data analysis for agricultural animal species. In this project, our aim is not to develop new RNA-seq tools, but to build web workflows for using popular existing RNA-seq methods and make these tools more accessible to the communities. |
format | Online Article Text |
id | pubmed-5039875 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2016 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-50398752016-10-05 Web-based bioinformatics workflows for end-to-end RNA-seq data computation and analysis in agricultural animal species Li, Weizhong Richter, R. Alexander Jung, Yunsup Zhu, Qiyun Li, Robert W. BMC Genomics Software BACKGROUND: Remarkable advances in Next Generation Sequencing (NGS) technologies, bioinformatics algorithms and computational technologies have significantly accelerated genomic research. However, complicated NGS data analysis still remains as a major bottleneck. RNA-seq, as one of the major area in the NGS field, also confronts great challenges in data analysis. RESULTS: To address the challenges in RNA-seq data analysis, we developed a web portal that offers three integrated workflows that can perform end-to-end compute and analysis, including sequence quality control, read-mapping, transcriptome assembly, reconstruction and quantification, and differential analysis. The first workflow utilizes Tuxedo (Tophat, Cufflink, Cuffmerge and Cuffdiff suite of tools). The second workflow deploys Trinity for de novo assembly and uses RSEM for transcript quantification and EdgeR for differential analysis. The third combines STAR, RSEM, and EdgeR for data analysis. All these workflows support multiple samples and multiple groups of samples and perform differential analysis between groups in a single workflow job submission. The calculated results are available for download and post-analysis. The supported animal species include chicken, cow, duck, goat, pig, horse, rabbit, sheep, turkey, as well as several other model organisms including yeast, C. elegans, Drosophila, and human, with genomic sequences and annotations obtained from ENSEMBL. The RNA-seq portal is freely available from http://weizhongli-lab.org/RNA-seq. CONCLUSIONS: The web portal offers not only bioinformatics software, workflows, computation and reference data, but also an integrated environment for complex RNA-seq data analysis for agricultural animal species. In this project, our aim is not to develop new RNA-seq tools, but to build web workflows for using popular existing RNA-seq methods and make these tools more accessible to the communities. BioMed Central 2016-09-27 /pmc/articles/PMC5039875/ /pubmed/27678198 http://dx.doi.org/10.1186/s12864-016-3118-z Text en © The Author(s). 2016 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Software Li, Weizhong Richter, R. Alexander Jung, Yunsup Zhu, Qiyun Li, Robert W. Web-based bioinformatics workflows for end-to-end RNA-seq data computation and analysis in agricultural animal species |
title | Web-based bioinformatics workflows for end-to-end RNA-seq data computation and analysis in agricultural animal species |
title_full | Web-based bioinformatics workflows for end-to-end RNA-seq data computation and analysis in agricultural animal species |
title_fullStr | Web-based bioinformatics workflows for end-to-end RNA-seq data computation and analysis in agricultural animal species |
title_full_unstemmed | Web-based bioinformatics workflows for end-to-end RNA-seq data computation and analysis in agricultural animal species |
title_short | Web-based bioinformatics workflows for end-to-end RNA-seq data computation and analysis in agricultural animal species |
title_sort | web-based bioinformatics workflows for end-to-end rna-seq data computation and analysis in agricultural animal species |
topic | Software |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5039875/ https://www.ncbi.nlm.nih.gov/pubmed/27678198 http://dx.doi.org/10.1186/s12864-016-3118-z |
work_keys_str_mv | AT liweizhong webbasedbioinformaticsworkflowsforendtoendrnaseqdatacomputationandanalysisinagriculturalanimalspecies AT richterralexander webbasedbioinformaticsworkflowsforendtoendrnaseqdatacomputationandanalysisinagriculturalanimalspecies AT jungyunsup webbasedbioinformaticsworkflowsforendtoendrnaseqdatacomputationandanalysisinagriculturalanimalspecies AT zhuqiyun webbasedbioinformaticsworkflowsforendtoendrnaseqdatacomputationandanalysisinagriculturalanimalspecies AT lirobertw webbasedbioinformaticsworkflowsforendtoendrnaseqdatacomputationandanalysisinagriculturalanimalspecies |