Cargando…

Web-based bioinformatics workflows for end-to-end RNA-seq data computation and analysis in agricultural animal species

BACKGROUND: Remarkable advances in Next Generation Sequencing (NGS) technologies, bioinformatics algorithms and computational technologies have significantly accelerated genomic research. However, complicated NGS data analysis still remains as a major bottleneck. RNA-seq, as one of the major area in...

Descripción completa

Detalles Bibliográficos
Autores principales: Li, Weizhong, Richter, R. Alexander, Jung, Yunsup, Zhu, Qiyun, Li, Robert W.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5039875/
https://www.ncbi.nlm.nih.gov/pubmed/27678198
http://dx.doi.org/10.1186/s12864-016-3118-z
_version_ 1782456139051433984
author Li, Weizhong
Richter, R. Alexander
Jung, Yunsup
Zhu, Qiyun
Li, Robert W.
author_facet Li, Weizhong
Richter, R. Alexander
Jung, Yunsup
Zhu, Qiyun
Li, Robert W.
author_sort Li, Weizhong
collection PubMed
description BACKGROUND: Remarkable advances in Next Generation Sequencing (NGS) technologies, bioinformatics algorithms and computational technologies have significantly accelerated genomic research. However, complicated NGS data analysis still remains as a major bottleneck. RNA-seq, as one of the major area in the NGS field, also confronts great challenges in data analysis. RESULTS: To address the challenges in RNA-seq data analysis, we developed a web portal that offers three integrated workflows that can perform end-to-end compute and analysis, including sequence quality control, read-mapping, transcriptome assembly, reconstruction and quantification, and differential analysis. The first workflow utilizes Tuxedo (Tophat, Cufflink, Cuffmerge and Cuffdiff suite of tools). The second workflow deploys Trinity for de novo assembly and uses RSEM for transcript quantification and EdgeR for differential analysis. The third combines STAR, RSEM, and EdgeR for data analysis. All these workflows support multiple samples and multiple groups of samples and perform differential analysis between groups in a single workflow job submission. The calculated results are available for download and post-analysis. The supported animal species include chicken, cow, duck, goat, pig, horse, rabbit, sheep, turkey, as well as several other model organisms including yeast, C. elegans, Drosophila, and human, with genomic sequences and annotations obtained from ENSEMBL. The RNA-seq portal is freely available from http://weizhongli-lab.org/RNA-seq. CONCLUSIONS: The web portal offers not only bioinformatics software, workflows, computation and reference data, but also an integrated environment for complex RNA-seq data analysis for agricultural animal species. In this project, our aim is not to develop new RNA-seq tools, but to build web workflows for using popular existing RNA-seq methods and make these tools more accessible to the communities.
format Online
Article
Text
id pubmed-5039875
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-50398752016-10-05 Web-based bioinformatics workflows for end-to-end RNA-seq data computation and analysis in agricultural animal species Li, Weizhong Richter, R. Alexander Jung, Yunsup Zhu, Qiyun Li, Robert W. BMC Genomics Software BACKGROUND: Remarkable advances in Next Generation Sequencing (NGS) technologies, bioinformatics algorithms and computational technologies have significantly accelerated genomic research. However, complicated NGS data analysis still remains as a major bottleneck. RNA-seq, as one of the major area in the NGS field, also confronts great challenges in data analysis. RESULTS: To address the challenges in RNA-seq data analysis, we developed a web portal that offers three integrated workflows that can perform end-to-end compute and analysis, including sequence quality control, read-mapping, transcriptome assembly, reconstruction and quantification, and differential analysis. The first workflow utilizes Tuxedo (Tophat, Cufflink, Cuffmerge and Cuffdiff suite of tools). The second workflow deploys Trinity for de novo assembly and uses RSEM for transcript quantification and EdgeR for differential analysis. The third combines STAR, RSEM, and EdgeR for data analysis. All these workflows support multiple samples and multiple groups of samples and perform differential analysis between groups in a single workflow job submission. The calculated results are available for download and post-analysis. The supported animal species include chicken, cow, duck, goat, pig, horse, rabbit, sheep, turkey, as well as several other model organisms including yeast, C. elegans, Drosophila, and human, with genomic sequences and annotations obtained from ENSEMBL. The RNA-seq portal is freely available from http://weizhongli-lab.org/RNA-seq. CONCLUSIONS: The web portal offers not only bioinformatics software, workflows, computation and reference data, but also an integrated environment for complex RNA-seq data analysis for agricultural animal species. In this project, our aim is not to develop new RNA-seq tools, but to build web workflows for using popular existing RNA-seq methods and make these tools more accessible to the communities. BioMed Central 2016-09-27 /pmc/articles/PMC5039875/ /pubmed/27678198 http://dx.doi.org/10.1186/s12864-016-3118-z Text en © The Author(s). 2016 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Software
Li, Weizhong
Richter, R. Alexander
Jung, Yunsup
Zhu, Qiyun
Li, Robert W.
Web-based bioinformatics workflows for end-to-end RNA-seq data computation and analysis in agricultural animal species
title Web-based bioinformatics workflows for end-to-end RNA-seq data computation and analysis in agricultural animal species
title_full Web-based bioinformatics workflows for end-to-end RNA-seq data computation and analysis in agricultural animal species
title_fullStr Web-based bioinformatics workflows for end-to-end RNA-seq data computation and analysis in agricultural animal species
title_full_unstemmed Web-based bioinformatics workflows for end-to-end RNA-seq data computation and analysis in agricultural animal species
title_short Web-based bioinformatics workflows for end-to-end RNA-seq data computation and analysis in agricultural animal species
title_sort web-based bioinformatics workflows for end-to-end rna-seq data computation and analysis in agricultural animal species
topic Software
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5039875/
https://www.ncbi.nlm.nih.gov/pubmed/27678198
http://dx.doi.org/10.1186/s12864-016-3118-z
work_keys_str_mv AT liweizhong webbasedbioinformaticsworkflowsforendtoendrnaseqdatacomputationandanalysisinagriculturalanimalspecies
AT richterralexander webbasedbioinformaticsworkflowsforendtoendrnaseqdatacomputationandanalysisinagriculturalanimalspecies
AT jungyunsup webbasedbioinformaticsworkflowsforendtoendrnaseqdatacomputationandanalysisinagriculturalanimalspecies
AT zhuqiyun webbasedbioinformaticsworkflowsforendtoendrnaseqdatacomputationandanalysisinagriculturalanimalspecies
AT lirobertw webbasedbioinformaticsworkflowsforendtoendrnaseqdatacomputationandanalysisinagriculturalanimalspecies