Cargando…

DRAGoM: Classification and Quantification of Noncoding RNA in Metagenomic Data

Noncoding RNAs (ncRNAs) play important regulatory and functional roles in microorganisms, such as regulation of gene expression, signaling, protein synthesis, and RNA processing. Hence, their classification and quantification are central tasks toward the understanding of the function of the microbia...

Descripción completa

Detalles Bibliográficos
Autores principales: Liu, Ben, Thippabhotla, Sirisha, Zhang, Jun, Zhong, Cuncong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8131839/
https://www.ncbi.nlm.nih.gov/pubmed/34025724
http://dx.doi.org/10.3389/fgene.2021.669495
_version_ 1783694790713409536
author Liu, Ben
Thippabhotla, Sirisha
Zhang, Jun
Zhong, Cuncong
author_facet Liu, Ben
Thippabhotla, Sirisha
Zhang, Jun
Zhong, Cuncong
author_sort Liu, Ben
collection PubMed
description Noncoding RNAs (ncRNAs) play important regulatory and functional roles in microorganisms, such as regulation of gene expression, signaling, protein synthesis, and RNA processing. Hence, their classification and quantification are central tasks toward the understanding of the function of the microbial community. However, the majority of the current metagenomic sequencing technologies generate short reads, which may contain only a partial secondary structure that complicates ncRNA homology detection. Meanwhile, de novo assembly of the metagenomic sequencing data remains challenging for complex communities. To tackle these challenges, we developed a novel algorithm called DRAGoM (Detection of RNA using Assembly Graph from Metagenomic data). DRAGoM first constructs a hybrid graph by merging an assembly string graph and an assembly de Bruijn graph. Then, it classifies paths in the hybrid graph and their constituent readsinto differentncRNA families based on both sequence and structural homology. Our benchmark experiments show that DRAGoMcan improve the performance and robustness over traditional approaches on the classification and quantification of a wide class of ncRNA families.
format Online
Article
Text
id pubmed-8131839
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-81318392021-05-20 DRAGoM: Classification and Quantification of Noncoding RNA in Metagenomic Data Liu, Ben Thippabhotla, Sirisha Zhang, Jun Zhong, Cuncong Front Genet Genetics Noncoding RNAs (ncRNAs) play important regulatory and functional roles in microorganisms, such as regulation of gene expression, signaling, protein synthesis, and RNA processing. Hence, their classification and quantification are central tasks toward the understanding of the function of the microbial community. However, the majority of the current metagenomic sequencing technologies generate short reads, which may contain only a partial secondary structure that complicates ncRNA homology detection. Meanwhile, de novo assembly of the metagenomic sequencing data remains challenging for complex communities. To tackle these challenges, we developed a novel algorithm called DRAGoM (Detection of RNA using Assembly Graph from Metagenomic data). DRAGoM first constructs a hybrid graph by merging an assembly string graph and an assembly de Bruijn graph. Then, it classifies paths in the hybrid graph and their constituent readsinto differentncRNA families based on both sequence and structural homology. Our benchmark experiments show that DRAGoMcan improve the performance and robustness over traditional approaches on the classification and quantification of a wide class of ncRNA families. Frontiers Media S.A. 2021-05-05 /pmc/articles/PMC8131839/ /pubmed/34025724 http://dx.doi.org/10.3389/fgene.2021.669495 Text en Copyright © 2021 Liu, Thippabhotla, Zhang and Zhong. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Genetics
Liu, Ben
Thippabhotla, Sirisha
Zhang, Jun
Zhong, Cuncong
DRAGoM: Classification and Quantification of Noncoding RNA in Metagenomic Data
title DRAGoM: Classification and Quantification of Noncoding RNA in Metagenomic Data
title_full DRAGoM: Classification and Quantification of Noncoding RNA in Metagenomic Data
title_fullStr DRAGoM: Classification and Quantification of Noncoding RNA in Metagenomic Data
title_full_unstemmed DRAGoM: Classification and Quantification of Noncoding RNA in Metagenomic Data
title_short DRAGoM: Classification and Quantification of Noncoding RNA in Metagenomic Data
title_sort dragom: classification and quantification of noncoding rna in metagenomic data
topic Genetics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8131839/
https://www.ncbi.nlm.nih.gov/pubmed/34025724
http://dx.doi.org/10.3389/fgene.2021.669495
work_keys_str_mv AT liuben dragomclassificationandquantificationofnoncodingrnainmetagenomicdata
AT thippabhotlasirisha dragomclassificationandquantificationofnoncodingrnainmetagenomicdata
AT zhangjun dragomclassificationandquantificationofnoncodingrnainmetagenomicdata
AT zhongcuncong dragomclassificationandquantificationofnoncodingrnainmetagenomicdata