Cargando…

Mining RNA–Seq Data for Infections and Contaminations

RNA sequencing (RNA–seq) provides novel opportunities for transcriptomic studies at nucleotide resolution, including transcriptomics of viruses or microbes infecting a cell. However, standard approaches for mapping the resulting sequencing reads generally ignore alternative sources of expression oth...

Descripción completa

Detalles Bibliográficos
Autores principales: Bonfert, Thomas, Csaba, Gergely, Zimmer, Ralf, Friedel, Caroline C.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3760913/
https://www.ncbi.nlm.nih.gov/pubmed/24019895
http://dx.doi.org/10.1371/journal.pone.0073071
_version_ 1782282820643717120
author Bonfert, Thomas
Csaba, Gergely
Zimmer, Ralf
Friedel, Caroline C.
author_facet Bonfert, Thomas
Csaba, Gergely
Zimmer, Ralf
Friedel, Caroline C.
author_sort Bonfert, Thomas
collection PubMed
description RNA sequencing (RNA–seq) provides novel opportunities for transcriptomic studies at nucleotide resolution, including transcriptomics of viruses or microbes infecting a cell. However, standard approaches for mapping the resulting sequencing reads generally ignore alternative sources of expression other than the host cell and are little equipped to address the problems arising from redundancies and gaps among sequenced microbe and virus genomes. We show that screening of sequencing reads for contaminations and infections can be performed easily using ContextMap, our recently developed mapping software. Based on mapping–derived statistics, mapping confidence, similarities and misidentifications (e.g. due to missing genome sequences) of species/strains can be assessed. Performance of our approach is evaluated on three real–life sequencing data sets and compared to state–of–the–art metagenomics tools. In particular, ContextMap vastly outperformed GASiC and GRAMMy in terms of runtime. In contrast to MEGAN4, it was capable of providing individual read mappings to species and resolving non–unique mappings, thus allowing the identification of misalignments caused by sequence similarities between genomes and missing genome sequences. Our study illustrates the importance and potentials of routinely mining RNA–seq experiments for infections or contaminations by microbes and viruses. By using ContextMap, gene expression of infecting agents can be analyzed and novel insights in infection processes and tumorigenesis can be obtained.
format Online
Article
Text
id pubmed-3760913
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-37609132013-09-09 Mining RNA–Seq Data for Infections and Contaminations Bonfert, Thomas Csaba, Gergely Zimmer, Ralf Friedel, Caroline C. PLoS One Research Article RNA sequencing (RNA–seq) provides novel opportunities for transcriptomic studies at nucleotide resolution, including transcriptomics of viruses or microbes infecting a cell. However, standard approaches for mapping the resulting sequencing reads generally ignore alternative sources of expression other than the host cell and are little equipped to address the problems arising from redundancies and gaps among sequenced microbe and virus genomes. We show that screening of sequencing reads for contaminations and infections can be performed easily using ContextMap, our recently developed mapping software. Based on mapping–derived statistics, mapping confidence, similarities and misidentifications (e.g. due to missing genome sequences) of species/strains can be assessed. Performance of our approach is evaluated on three real–life sequencing data sets and compared to state–of–the–art metagenomics tools. In particular, ContextMap vastly outperformed GASiC and GRAMMy in terms of runtime. In contrast to MEGAN4, it was capable of providing individual read mappings to species and resolving non–unique mappings, thus allowing the identification of misalignments caused by sequence similarities between genomes and missing genome sequences. Our study illustrates the importance and potentials of routinely mining RNA–seq experiments for infections or contaminations by microbes and viruses. By using ContextMap, gene expression of infecting agents can be analyzed and novel insights in infection processes and tumorigenesis can be obtained. Public Library of Science 2013-09-03 /pmc/articles/PMC3760913/ /pubmed/24019895 http://dx.doi.org/10.1371/journal.pone.0073071 Text en © 2013 Bonfert et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Bonfert, Thomas
Csaba, Gergely
Zimmer, Ralf
Friedel, Caroline C.
Mining RNA–Seq Data for Infections and Contaminations
title Mining RNA–Seq Data for Infections and Contaminations
title_full Mining RNA–Seq Data for Infections and Contaminations
title_fullStr Mining RNA–Seq Data for Infections and Contaminations
title_full_unstemmed Mining RNA–Seq Data for Infections and Contaminations
title_short Mining RNA–Seq Data for Infections and Contaminations
title_sort mining rna–seq data for infections and contaminations
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3760913/
https://www.ncbi.nlm.nih.gov/pubmed/24019895
http://dx.doi.org/10.1371/journal.pone.0073071
work_keys_str_mv AT bonfertthomas miningrnaseqdataforinfectionsandcontaminations
AT csabagergely miningrnaseqdataforinfectionsandcontaminations
AT zimmerralf miningrnaseqdataforinfectionsandcontaminations
AT friedelcarolinec miningrnaseqdataforinfectionsandcontaminations