Cargando…

Reconstructing ribosomal genes from large scale total RNA meta-transcriptomic data

MOTIVATION: Technological advances in meta-transcriptomics have enabled a deeper understanding of the structure and function of microbial communities. ‘Total RNA’ meta-transcriptomics, sequencing of total reverse transcribed RNA, provides a unique opportunity to investigate both the structure and fu...

Descripción completa

Detalles Bibliográficos
Autores principales: Xue, Yaxin, Lanzén, Anders, Jonassen, Inge
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7267836/
https://www.ncbi.nlm.nih.gov/pubmed/32167532
http://dx.doi.org/10.1093/bioinformatics/btaa177
_version_ 1783541486483144704
author Xue, Yaxin
Lanzén, Anders
Jonassen, Inge
author_facet Xue, Yaxin
Lanzén, Anders
Jonassen, Inge
author_sort Xue, Yaxin
collection PubMed
description MOTIVATION: Technological advances in meta-transcriptomics have enabled a deeper understanding of the structure and function of microbial communities. ‘Total RNA’ meta-transcriptomics, sequencing of total reverse transcribed RNA, provides a unique opportunity to investigate both the structure and function of active microbial communities from all three domains of life simultaneously. A major step of this approach is the reconstruction of full-length taxonomic marker genes such as the small subunit ribosomal RNA. However, current tools for this purpose are mainly targeted towards analysis of amplicon and metagenomic data and thus lack the ability to handle the massive and complex datasets typically resulting from total RNA experiments. RESULTS: In this work, we introduce MetaRib, a new tool for reconstructing ribosomal gene sequences from total RNA meta-transcriptomic data. MetaRib is based on the popular rRNA assembly program EMIRGE, together with several improvements. We address the challenge posed by large complex datasets by integrating sub-assembly, dereplication and mapping in an iterative approach, with additional post-processing steps. We applied the method to both simulated and real-world datasets. Our results show that MetaRib can deal with larger datasets and recover more rRNA genes, which achieve around 60 times speedup and higher F1 score compared to EMIRGE in simulated datasets. In the real-world dataset, it shows similar trends but recovers more contigs compared with a previous analysis based on random sub-sampling, while enabling the comparison of individual contig abundances across samples for the first time. AVAILABILITY AND IMPLEMENTATION: The source code of MetaRib is freely available at https://github.com/yxxue/MetaRib. CONTACT: yaxin.xue@uib.no or Inge.Jonassen@uib.no SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-7267836
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-72678362020-06-09 Reconstructing ribosomal genes from large scale total RNA meta-transcriptomic data Xue, Yaxin Lanzén, Anders Jonassen, Inge Bioinformatics Original Papers MOTIVATION: Technological advances in meta-transcriptomics have enabled a deeper understanding of the structure and function of microbial communities. ‘Total RNA’ meta-transcriptomics, sequencing of total reverse transcribed RNA, provides a unique opportunity to investigate both the structure and function of active microbial communities from all three domains of life simultaneously. A major step of this approach is the reconstruction of full-length taxonomic marker genes such as the small subunit ribosomal RNA. However, current tools for this purpose are mainly targeted towards analysis of amplicon and metagenomic data and thus lack the ability to handle the massive and complex datasets typically resulting from total RNA experiments. RESULTS: In this work, we introduce MetaRib, a new tool for reconstructing ribosomal gene sequences from total RNA meta-transcriptomic data. MetaRib is based on the popular rRNA assembly program EMIRGE, together with several improvements. We address the challenge posed by large complex datasets by integrating sub-assembly, dereplication and mapping in an iterative approach, with additional post-processing steps. We applied the method to both simulated and real-world datasets. Our results show that MetaRib can deal with larger datasets and recover more rRNA genes, which achieve around 60 times speedup and higher F1 score compared to EMIRGE in simulated datasets. In the real-world dataset, it shows similar trends but recovers more contigs compared with a previous analysis based on random sub-sampling, while enabling the comparison of individual contig abundances across samples for the first time. AVAILABILITY AND IMPLEMENTATION: The source code of MetaRib is freely available at https://github.com/yxxue/MetaRib. CONTACT: yaxin.xue@uib.no or Inge.Jonassen@uib.no SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2020-06 2020-03-13 /pmc/articles/PMC7267836/ /pubmed/32167532 http://dx.doi.org/10.1093/bioinformatics/btaa177 Text en © The Author(s) 2020. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Papers
Xue, Yaxin
Lanzén, Anders
Jonassen, Inge
Reconstructing ribosomal genes from large scale total RNA meta-transcriptomic data
title Reconstructing ribosomal genes from large scale total RNA meta-transcriptomic data
title_full Reconstructing ribosomal genes from large scale total RNA meta-transcriptomic data
title_fullStr Reconstructing ribosomal genes from large scale total RNA meta-transcriptomic data
title_full_unstemmed Reconstructing ribosomal genes from large scale total RNA meta-transcriptomic data
title_short Reconstructing ribosomal genes from large scale total RNA meta-transcriptomic data
title_sort reconstructing ribosomal genes from large scale total rna meta-transcriptomic data
topic Original Papers
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7267836/
https://www.ncbi.nlm.nih.gov/pubmed/32167532
http://dx.doi.org/10.1093/bioinformatics/btaa177
work_keys_str_mv AT xueyaxin reconstructingribosomalgenesfromlargescaletotalrnametatranscriptomicdata
AT lanzenanders reconstructingribosomalgenesfromlargescaletotalrnametatranscriptomicdata
AT jonasseninge reconstructingribosomalgenesfromlargescaletotalrnametatranscriptomicdata