Cargando…

MAP-RSeq: Mayo Analysis Pipeline for RNA sequencing

BACKGROUND: Although the costs of next generation sequencing technology have decreased over the past years, there is still a lack of simple-to-use applications, for a comprehensive analysis of RNA sequencing data. There is no one-stop shop for transcriptomic genomics. We have developed MAP-RSeq, a c...

Descripción completa

Detalles Bibliográficos
Autores principales: Kalari, Krishna R, Nair, Asha A, Bhavsar, Jaysheel D, O’Brien, Daniel R, Davila, Jaime I, Bockol, Matthew A, Nie, Jinfu, Tang, Xiaojia, Baheti, Saurabh, Doughty, Jay B, Middha, Sumit, Sicotte, Hugues, Thompson, Aubrey E, Asmann, Yan W, Kocher, Jean-Pierre A
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4228501/
https://www.ncbi.nlm.nih.gov/pubmed/24972667
http://dx.doi.org/10.1186/1471-2105-15-224
_version_ 1782343996030320640
author Kalari, Krishna R
Nair, Asha A
Bhavsar, Jaysheel D
O’Brien, Daniel R
Davila, Jaime I
Bockol, Matthew A
Nie, Jinfu
Tang, Xiaojia
Baheti, Saurabh
Doughty, Jay B
Middha, Sumit
Sicotte, Hugues
Thompson, Aubrey E
Asmann, Yan W
Kocher, Jean-Pierre A
author_facet Kalari, Krishna R
Nair, Asha A
Bhavsar, Jaysheel D
O’Brien, Daniel R
Davila, Jaime I
Bockol, Matthew A
Nie, Jinfu
Tang, Xiaojia
Baheti, Saurabh
Doughty, Jay B
Middha, Sumit
Sicotte, Hugues
Thompson, Aubrey E
Asmann, Yan W
Kocher, Jean-Pierre A
author_sort Kalari, Krishna R
collection PubMed
description BACKGROUND: Although the costs of next generation sequencing technology have decreased over the past years, there is still a lack of simple-to-use applications, for a comprehensive analysis of RNA sequencing data. There is no one-stop shop for transcriptomic genomics. We have developed MAP-RSeq, a comprehensive computational workflow that can be used for obtaining genomic features from transcriptomic sequencing data, for any genome. RESULTS: For optimization of tools and parameters, MAP-RSeq was validated using both simulated and real datasets. MAP-RSeq workflow consists of six major modules such as alignment of reads, quality assessment of reads, gene expression assessment and exon read counting, identification of expressed single nucleotide variants (SNVs), detection of fusion transcripts, summarization of transcriptomics data and final report. This workflow is available for Human transcriptome analysis and can be easily adapted and used for other genomes. Several clinical and research projects at the Mayo Clinic have applied the MAP-RSeq workflow for RNA-Seq studies. The results from MAP-RSeq have thus far enabled clinicians and researchers to understand the transcriptomic landscape of diseases for better diagnosis and treatment of patients. CONCLUSIONS: Our software provides gene counts, exon counts, fusion candidates, expressed single nucleotide variants, mapping statistics, visualizations, and a detailed research data report for RNA-Seq. The workflow can be executed on a standalone virtual machine or on a parallel Sun Grid Engine cluster. The software can be downloaded from http://bioinformaticstools.mayo.edu/research/maprseq/.
format Online
Article
Text
id pubmed-4228501
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-42285012014-11-13 MAP-RSeq: Mayo Analysis Pipeline for RNA sequencing Kalari, Krishna R Nair, Asha A Bhavsar, Jaysheel D O’Brien, Daniel R Davila, Jaime I Bockol, Matthew A Nie, Jinfu Tang, Xiaojia Baheti, Saurabh Doughty, Jay B Middha, Sumit Sicotte, Hugues Thompson, Aubrey E Asmann, Yan W Kocher, Jean-Pierre A BMC Bioinformatics Software BACKGROUND: Although the costs of next generation sequencing technology have decreased over the past years, there is still a lack of simple-to-use applications, for a comprehensive analysis of RNA sequencing data. There is no one-stop shop for transcriptomic genomics. We have developed MAP-RSeq, a comprehensive computational workflow that can be used for obtaining genomic features from transcriptomic sequencing data, for any genome. RESULTS: For optimization of tools and parameters, MAP-RSeq was validated using both simulated and real datasets. MAP-RSeq workflow consists of six major modules such as alignment of reads, quality assessment of reads, gene expression assessment and exon read counting, identification of expressed single nucleotide variants (SNVs), detection of fusion transcripts, summarization of transcriptomics data and final report. This workflow is available for Human transcriptome analysis and can be easily adapted and used for other genomes. Several clinical and research projects at the Mayo Clinic have applied the MAP-RSeq workflow for RNA-Seq studies. The results from MAP-RSeq have thus far enabled clinicians and researchers to understand the transcriptomic landscape of diseases for better diagnosis and treatment of patients. CONCLUSIONS: Our software provides gene counts, exon counts, fusion candidates, expressed single nucleotide variants, mapping statistics, visualizations, and a detailed research data report for RNA-Seq. The workflow can be executed on a standalone virtual machine or on a parallel Sun Grid Engine cluster. The software can be downloaded from http://bioinformaticstools.mayo.edu/research/maprseq/. BioMed Central 2014-06-27 /pmc/articles/PMC4228501/ /pubmed/24972667 http://dx.doi.org/10.1186/1471-2105-15-224 Text en Copyright © 2014 Kalari et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/4.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver ( http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Software
Kalari, Krishna R
Nair, Asha A
Bhavsar, Jaysheel D
O’Brien, Daniel R
Davila, Jaime I
Bockol, Matthew A
Nie, Jinfu
Tang, Xiaojia
Baheti, Saurabh
Doughty, Jay B
Middha, Sumit
Sicotte, Hugues
Thompson, Aubrey E
Asmann, Yan W
Kocher, Jean-Pierre A
MAP-RSeq: Mayo Analysis Pipeline for RNA sequencing
title MAP-RSeq: Mayo Analysis Pipeline for RNA sequencing
title_full MAP-RSeq: Mayo Analysis Pipeline for RNA sequencing
title_fullStr MAP-RSeq: Mayo Analysis Pipeline for RNA sequencing
title_full_unstemmed MAP-RSeq: Mayo Analysis Pipeline for RNA sequencing
title_short MAP-RSeq: Mayo Analysis Pipeline for RNA sequencing
title_sort map-rseq: mayo analysis pipeline for rna sequencing
topic Software
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4228501/
https://www.ncbi.nlm.nih.gov/pubmed/24972667
http://dx.doi.org/10.1186/1471-2105-15-224
work_keys_str_mv AT kalarikrishnar maprseqmayoanalysispipelineforrnasequencing
AT nairashaa maprseqmayoanalysispipelineforrnasequencing
AT bhavsarjaysheeld maprseqmayoanalysispipelineforrnasequencing
AT obriendanielr maprseqmayoanalysispipelineforrnasequencing
AT davilajaimei maprseqmayoanalysispipelineforrnasequencing
AT bockolmatthewa maprseqmayoanalysispipelineforrnasequencing
AT niejinfu maprseqmayoanalysispipelineforrnasequencing
AT tangxiaojia maprseqmayoanalysispipelineforrnasequencing
AT bahetisaurabh maprseqmayoanalysispipelineforrnasequencing
AT doughtyjayb maprseqmayoanalysispipelineforrnasequencing
AT middhasumit maprseqmayoanalysispipelineforrnasequencing
AT sicottehugues maprseqmayoanalysispipelineforrnasequencing
AT thompsonaubreye maprseqmayoanalysispipelineforrnasequencing
AT asmannyanw maprseqmayoanalysispipelineforrnasequencing
AT kocherjeanpierrea maprseqmayoanalysispipelineforrnasequencing