Cargando…

RNA CoMPASS: A Dual Approach for Pathogen and Host Transcriptome Analysis of RNA-Seq Datasets

High-throughput RNA sequencing (RNA-seq) has become an instrumental assay for the analysis of multiple aspects of an organism's transcriptome. Further, the analysis of a biological specimen's associated microbiome can also be performed using RNA-seq data and this application is gaining int...

Descripción completa

Detalles Bibliográficos
Autores principales: Xu, Guorong, Strong, Michael J., Lacey, Michelle R., Baribault, Carl, Flemington, Erik K., Taylor, Christopher M.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3934900/
https://www.ncbi.nlm.nih.gov/pubmed/24586784
http://dx.doi.org/10.1371/journal.pone.0089445
_version_ 1782305119461703680
author Xu, Guorong
Strong, Michael J.
Lacey, Michelle R.
Baribault, Carl
Flemington, Erik K.
Taylor, Christopher M.
author_facet Xu, Guorong
Strong, Michael J.
Lacey, Michelle R.
Baribault, Carl
Flemington, Erik K.
Taylor, Christopher M.
author_sort Xu, Guorong
collection PubMed
description High-throughput RNA sequencing (RNA-seq) has become an instrumental assay for the analysis of multiple aspects of an organism's transcriptome. Further, the analysis of a biological specimen's associated microbiome can also be performed using RNA-seq data and this application is gaining interest in the scientific community. There are many existing bioinformatics tools designed for analysis and visualization of transcriptome data. Despite the availability of an array of next generation sequencing (NGS) analysis tools, the analysis of RNA-seq data sets poses a challenge for many biomedical researchers who are not familiar with command-line tools. Here we present RNA CoMPASS, a comprehensive RNA-seq analysis pipeline for the simultaneous analysis of transcriptomes and metatranscriptomes from diverse biological specimens. RNA CoMPASS leverages existing tools and parallel computing technology to facilitate the analysis of even very large datasets. RNA CoMPASS has a web-based graphical user interface with intrinsic queuing to control a distributed computational pipeline. RNA CoMPASS was evaluated by analyzing RNA-seq data sets from 45 B-cell samples. Twenty-two of these samples were derived from lymphoblastoid cell lines (LCLs) generated by the infection of naïve B-cells with the Epstein Barr virus (EBV), while another 23 samples were derived from Burkitt's lymphomas (BL), some of which arose in part through infection with EBV. Appropriately, RNA CoMPASS identified EBV in all LCLs and in a fraction of the BLs. Cluster analysis of the human transcriptome component of the RNA CoMPASS output clearly separated the BLs (which have a germinal center-like phenotype) from the LCLs (which have a blast-like phenotype) with evidence of activated MYC signaling and lower interferon and NF-kB signaling in the BLs. Together, this analysis illustrates the utility of RNA CoMPASS in the simultaneous analysis of transcriptome and metatranscriptome data. RNA CoMPASS is freely available at http://rnacompass.sourceforge.net/.
format Online
Article
Text
id pubmed-3934900
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-39349002014-03-04 RNA CoMPASS: A Dual Approach for Pathogen and Host Transcriptome Analysis of RNA-Seq Datasets Xu, Guorong Strong, Michael J. Lacey, Michelle R. Baribault, Carl Flemington, Erik K. Taylor, Christopher M. PLoS One Research Article High-throughput RNA sequencing (RNA-seq) has become an instrumental assay for the analysis of multiple aspects of an organism's transcriptome. Further, the analysis of a biological specimen's associated microbiome can also be performed using RNA-seq data and this application is gaining interest in the scientific community. There are many existing bioinformatics tools designed for analysis and visualization of transcriptome data. Despite the availability of an array of next generation sequencing (NGS) analysis tools, the analysis of RNA-seq data sets poses a challenge for many biomedical researchers who are not familiar with command-line tools. Here we present RNA CoMPASS, a comprehensive RNA-seq analysis pipeline for the simultaneous analysis of transcriptomes and metatranscriptomes from diverse biological specimens. RNA CoMPASS leverages existing tools and parallel computing technology to facilitate the analysis of even very large datasets. RNA CoMPASS has a web-based graphical user interface with intrinsic queuing to control a distributed computational pipeline. RNA CoMPASS was evaluated by analyzing RNA-seq data sets from 45 B-cell samples. Twenty-two of these samples were derived from lymphoblastoid cell lines (LCLs) generated by the infection of naïve B-cells with the Epstein Barr virus (EBV), while another 23 samples were derived from Burkitt's lymphomas (BL), some of which arose in part through infection with EBV. Appropriately, RNA CoMPASS identified EBV in all LCLs and in a fraction of the BLs. Cluster analysis of the human transcriptome component of the RNA CoMPASS output clearly separated the BLs (which have a germinal center-like phenotype) from the LCLs (which have a blast-like phenotype) with evidence of activated MYC signaling and lower interferon and NF-kB signaling in the BLs. Together, this analysis illustrates the utility of RNA CoMPASS in the simultaneous analysis of transcriptome and metatranscriptome data. RNA CoMPASS is freely available at http://rnacompass.sourceforge.net/. Public Library of Science 2014-02-25 /pmc/articles/PMC3934900/ /pubmed/24586784 http://dx.doi.org/10.1371/journal.pone.0089445 Text en © 2014 Xu et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Xu, Guorong
Strong, Michael J.
Lacey, Michelle R.
Baribault, Carl
Flemington, Erik K.
Taylor, Christopher M.
RNA CoMPASS: A Dual Approach for Pathogen and Host Transcriptome Analysis of RNA-Seq Datasets
title RNA CoMPASS: A Dual Approach for Pathogen and Host Transcriptome Analysis of RNA-Seq Datasets
title_full RNA CoMPASS: A Dual Approach for Pathogen and Host Transcriptome Analysis of RNA-Seq Datasets
title_fullStr RNA CoMPASS: A Dual Approach for Pathogen and Host Transcriptome Analysis of RNA-Seq Datasets
title_full_unstemmed RNA CoMPASS: A Dual Approach for Pathogen and Host Transcriptome Analysis of RNA-Seq Datasets
title_short RNA CoMPASS: A Dual Approach for Pathogen and Host Transcriptome Analysis of RNA-Seq Datasets
title_sort rna compass: a dual approach for pathogen and host transcriptome analysis of rna-seq datasets
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3934900/
https://www.ncbi.nlm.nih.gov/pubmed/24586784
http://dx.doi.org/10.1371/journal.pone.0089445
work_keys_str_mv AT xuguorong rnacompassadualapproachforpathogenandhosttranscriptomeanalysisofrnaseqdatasets
AT strongmichaelj rnacompassadualapproachforpathogenandhosttranscriptomeanalysisofrnaseqdatasets
AT laceymicheller rnacompassadualapproachforpathogenandhosttranscriptomeanalysisofrnaseqdatasets
AT baribaultcarl rnacompassadualapproachforpathogenandhosttranscriptomeanalysisofrnaseqdatasets
AT flemingtonerikk rnacompassadualapproachforpathogenandhosttranscriptomeanalysisofrnaseqdatasets
AT taylorchristopherm rnacompassadualapproachforpathogenandhosttranscriptomeanalysisofrnaseqdatasets