Cargando…

MetaGT: A pipeline for de novo assembly of metatranscriptomes with the aid of metagenomic data

While metagenome sequencing may provide insights on the genome sequences and composition of microbial communities, metatranscriptome analysis can be useful for studying the functional activity of a microbiome. RNA-Seq data provides the possibility to determine active genes in the community and how t...

Descripción completa

Detalles Bibliográficos
Autores principales: Shafranskaya, Daria, Kale, Varsha, Finn, Rob, Lapidus, Alla L., Korobeynikov, Anton, Prjibelski, Andrey D.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9651917/
https://www.ncbi.nlm.nih.gov/pubmed/36386613
http://dx.doi.org/10.3389/fmicb.2022.981458
_version_ 1784828339301646336
author Shafranskaya, Daria
Kale, Varsha
Finn, Rob
Lapidus, Alla L.
Korobeynikov, Anton
Prjibelski, Andrey D.
author_facet Shafranskaya, Daria
Kale, Varsha
Finn, Rob
Lapidus, Alla L.
Korobeynikov, Anton
Prjibelski, Andrey D.
author_sort Shafranskaya, Daria
collection PubMed
description While metagenome sequencing may provide insights on the genome sequences and composition of microbial communities, metatranscriptome analysis can be useful for studying the functional activity of a microbiome. RNA-Seq data provides the possibility to determine active genes in the community and how their expression levels depend on external conditions. Although the field of metatranscriptomics is relatively young, the number of projects related to metatranscriptome analysis increases every year and the scope of its applications expands. However, there are several problems that complicate metatranscriptome analysis: complexity of microbial communities, wide dynamic range of transcriptome expression and importantly, the lack of high-quality computational methods for assembling meta-RNA sequencing data. These factors deteriorate the contiguity and completeness of metatranscriptome assemblies, therefore affecting further downstream analysis. Here we present MetaGT, a pipeline for de novo assembly of metatranscriptomes, which is based on the idea of combining both metatranscriptomic and metagenomic data sequenced from the same sample. MetaGT assembles metatranscriptomic contigs and fills in missing regions based on their alignments to metagenome assembly. This approach allows to overcome described complexities and obtain complete RNA sequences, and additionally estimate their abundances. Using various publicly available real and simulated datasets, we demonstrate that MetaGT yields significant improvement in coverage and completeness of metatranscriptome assemblies compared to existing methods that do not exploit metagenomic data. The pipeline is implemented in NextFlow and is freely available from https://github.com/ablab/metaGT.
format Online
Article
Text
id pubmed-9651917
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-96519172022-11-15 MetaGT: A pipeline for de novo assembly of metatranscriptomes with the aid of metagenomic data Shafranskaya, Daria Kale, Varsha Finn, Rob Lapidus, Alla L. Korobeynikov, Anton Prjibelski, Andrey D. Front Microbiol Microbiology While metagenome sequencing may provide insights on the genome sequences and composition of microbial communities, metatranscriptome analysis can be useful for studying the functional activity of a microbiome. RNA-Seq data provides the possibility to determine active genes in the community and how their expression levels depend on external conditions. Although the field of metatranscriptomics is relatively young, the number of projects related to metatranscriptome analysis increases every year and the scope of its applications expands. However, there are several problems that complicate metatranscriptome analysis: complexity of microbial communities, wide dynamic range of transcriptome expression and importantly, the lack of high-quality computational methods for assembling meta-RNA sequencing data. These factors deteriorate the contiguity and completeness of metatranscriptome assemblies, therefore affecting further downstream analysis. Here we present MetaGT, a pipeline for de novo assembly of metatranscriptomes, which is based on the idea of combining both metatranscriptomic and metagenomic data sequenced from the same sample. MetaGT assembles metatranscriptomic contigs and fills in missing regions based on their alignments to metagenome assembly. This approach allows to overcome described complexities and obtain complete RNA sequences, and additionally estimate their abundances. Using various publicly available real and simulated datasets, we demonstrate that MetaGT yields significant improvement in coverage and completeness of metatranscriptome assemblies compared to existing methods that do not exploit metagenomic data. The pipeline is implemented in NextFlow and is freely available from https://github.com/ablab/metaGT. Frontiers Media S.A. 2022-10-28 /pmc/articles/PMC9651917/ /pubmed/36386613 http://dx.doi.org/10.3389/fmicb.2022.981458 Text en Copyright © 2022 Shafranskaya, Kale, Finn, Lapidus, Korobeynikov and Prjibelski. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Microbiology
Shafranskaya, Daria
Kale, Varsha
Finn, Rob
Lapidus, Alla L.
Korobeynikov, Anton
Prjibelski, Andrey D.
MetaGT: A pipeline for de novo assembly of metatranscriptomes with the aid of metagenomic data
title MetaGT: A pipeline for de novo assembly of metatranscriptomes with the aid of metagenomic data
title_full MetaGT: A pipeline for de novo assembly of metatranscriptomes with the aid of metagenomic data
title_fullStr MetaGT: A pipeline for de novo assembly of metatranscriptomes with the aid of metagenomic data
title_full_unstemmed MetaGT: A pipeline for de novo assembly of metatranscriptomes with the aid of metagenomic data
title_short MetaGT: A pipeline for de novo assembly of metatranscriptomes with the aid of metagenomic data
title_sort metagt: a pipeline for de novo assembly of metatranscriptomes with the aid of metagenomic data
topic Microbiology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9651917/
https://www.ncbi.nlm.nih.gov/pubmed/36386613
http://dx.doi.org/10.3389/fmicb.2022.981458
work_keys_str_mv AT shafranskayadaria metagtapipelinefordenovoassemblyofmetatranscriptomeswiththeaidofmetagenomicdata
AT kalevarsha metagtapipelinefordenovoassemblyofmetatranscriptomeswiththeaidofmetagenomicdata
AT finnrob metagtapipelinefordenovoassemblyofmetatranscriptomeswiththeaidofmetagenomicdata
AT lapidusallal metagtapipelinefordenovoassemblyofmetatranscriptomeswiththeaidofmetagenomicdata
AT korobeynikovanton metagtapipelinefordenovoassemblyofmetatranscriptomeswiththeaidofmetagenomicdata
AT prjibelskiandreyd metagtapipelinefordenovoassemblyofmetatranscriptomeswiththeaidofmetagenomicdata