Cargando…

Utilizing de Bruijn graph of metagenome assembly for metatranscriptome analysis

Motivation: Metagenomics research has accelerated the studies of microbial organisms, providing insights into the composition and potential functionality of various microbial communities. Metatranscriptomics (studies of the transcripts from a mixture of microbial species) and other meta-omics approa...

Descripción completa

Detalles Bibliográficos
Autores principales: Ye, Yuzhen, Tang, Haixu
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4896364/
https://www.ncbi.nlm.nih.gov/pubmed/26319390
http://dx.doi.org/10.1093/bioinformatics/btv510
_version_ 1782436008262893568
author Ye, Yuzhen
Tang, Haixu
author_facet Ye, Yuzhen
Tang, Haixu
author_sort Ye, Yuzhen
collection PubMed
description Motivation: Metagenomics research has accelerated the studies of microbial organisms, providing insights into the composition and potential functionality of various microbial communities. Metatranscriptomics (studies of the transcripts from a mixture of microbial species) and other meta-omics approaches hold even greater promise for providing additional insights into functional and regulatory characteristics of the microbial communities. Current metatranscriptomics projects are often carried out without matched metagenomic datasets (of the same microbial communities). For the projects that produce both metatranscriptomic and metagenomic datasets, their analyses are often not integrated. Metagenome assemblies are far from perfect, partially explaining why metagenome assemblies are not used for the analysis of metatranscriptomic datasets. Results: Here, we report a reads mapping algorithm for mapping of short reads onto a de Bruijn graph of assemblies. A hash table of junction k-mers (k-mers spanning branching structures in the de Bruijn graph) is used to facilitate fast mapping of reads to the graph. We developed an application of this mapping algorithm: a reference-based approach to metatranscriptome assembly using graphs of metagenome assembly as the reference. Our results show that this new approach (called TAG) helps to assemble substantially more transcripts that otherwise would have been missed or truncated because of the fragmented nature of the reference metagenome. Availability and implementation: TAG was implemented in C++ and has been tested extensively on the Linux platform. It is available for download as open source at http://omics.informatics.indiana.edu/TAG. Contact: yye@indiana.edu
format Online
Article
Text
id pubmed-4896364
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-48963642016-06-09 Utilizing de Bruijn graph of metagenome assembly for metatranscriptome analysis Ye, Yuzhen Tang, Haixu Bioinformatics Recomb-Seq/Recomb-Cbb Motivation: Metagenomics research has accelerated the studies of microbial organisms, providing insights into the composition and potential functionality of various microbial communities. Metatranscriptomics (studies of the transcripts from a mixture of microbial species) and other meta-omics approaches hold even greater promise for providing additional insights into functional and regulatory characteristics of the microbial communities. Current metatranscriptomics projects are often carried out without matched metagenomic datasets (of the same microbial communities). For the projects that produce both metatranscriptomic and metagenomic datasets, their analyses are often not integrated. Metagenome assemblies are far from perfect, partially explaining why metagenome assemblies are not used for the analysis of metatranscriptomic datasets. Results: Here, we report a reads mapping algorithm for mapping of short reads onto a de Bruijn graph of assemblies. A hash table of junction k-mers (k-mers spanning branching structures in the de Bruijn graph) is used to facilitate fast mapping of reads to the graph. We developed an application of this mapping algorithm: a reference-based approach to metatranscriptome assembly using graphs of metagenome assembly as the reference. Our results show that this new approach (called TAG) helps to assemble substantially more transcripts that otherwise would have been missed or truncated because of the fragmented nature of the reference metagenome. Availability and implementation: TAG was implemented in C++ and has been tested extensively on the Linux platform. It is available for download as open source at http://omics.informatics.indiana.edu/TAG. Contact: yye@indiana.edu Oxford University Press 2016-04-01 2015-08-29 /pmc/articles/PMC4896364/ /pubmed/26319390 http://dx.doi.org/10.1093/bioinformatics/btv510 Text en © The Author 2015. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Recomb-Seq/Recomb-Cbb
Ye, Yuzhen
Tang, Haixu
Utilizing de Bruijn graph of metagenome assembly for metatranscriptome analysis
title Utilizing de Bruijn graph of metagenome assembly for metatranscriptome analysis
title_full Utilizing de Bruijn graph of metagenome assembly for metatranscriptome analysis
title_fullStr Utilizing de Bruijn graph of metagenome assembly for metatranscriptome analysis
title_full_unstemmed Utilizing de Bruijn graph of metagenome assembly for metatranscriptome analysis
title_short Utilizing de Bruijn graph of metagenome assembly for metatranscriptome analysis
title_sort utilizing de bruijn graph of metagenome assembly for metatranscriptome analysis
topic Recomb-Seq/Recomb-Cbb
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4896364/
https://www.ncbi.nlm.nih.gov/pubmed/26319390
http://dx.doi.org/10.1093/bioinformatics/btv510
work_keys_str_mv AT yeyuzhen utilizingdebruijngraphofmetagenomeassemblyformetatranscriptomeanalysis
AT tanghaixu utilizingdebruijngraphofmetagenomeassemblyformetatranscriptomeanalysis