Cargando…

Microbial Identification Using rRNA Operon Region: Database and Tool for Metataxonomics with Long-Read Sequence

Recent development of long-read sequencing platforms has enabled researchers to explore bacterial community structure through analysis of full-length 16S rRNA gene (∼1,500 bp) or 16S-ITS-23S rRNA operon region (∼4,300 bp), resulting in higher taxonomic resolution than short-read sequencing platforms...

Descripción completa

Detalles Bibliográficos
Autores principales: Seol, Donghyeok, Lim, Jin Soo, Sung, Samsun, Lee, Young Ho, Jeong, Misun, Cho, Seoae, Kwak, Woori, Kim, Heebal
Formato: Online Artículo Texto
Lenguaje:English
Publicado: American Society for Microbiology 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9045266/
https://www.ncbi.nlm.nih.gov/pubmed/35352997
http://dx.doi.org/10.1128/spectrum.02017-21
_version_ 1784695277467205632
author Seol, Donghyeok
Lim, Jin Soo
Sung, Samsun
Lee, Young Ho
Jeong, Misun
Cho, Seoae
Kwak, Woori
Kim, Heebal
author_facet Seol, Donghyeok
Lim, Jin Soo
Sung, Samsun
Lee, Young Ho
Jeong, Misun
Cho, Seoae
Kwak, Woori
Kim, Heebal
author_sort Seol, Donghyeok
collection PubMed
description Recent development of long-read sequencing platforms has enabled researchers to explore bacterial community structure through analysis of full-length 16S rRNA gene (∼1,500 bp) or 16S-ITS-23S rRNA operon region (∼4,300 bp), resulting in higher taxonomic resolution than short-read sequencing platforms. Despite the potential of long-read sequencing in metagenomics, resources and protocols for this technology are scarce. Here, we describe MIrROR, the database and analysis tool for metataxonomics using the bacterial 16S-ITS-23S rRNA operon region. We collected 16S-ITS-23S rRNA operon sequences extracted from bacterial genomes from NCBI GenBank and performed curation. A total of 97,781 16S-ITS-23S rRNA operon sequences covering 9,485 species from 43,653 genomes were obtained. For user convenience, we provide an analysis tool based on a mapping strategy that can be used for taxonomic profiling with MIrROR database. To benchmark MIrROR, we compared performance against publicly available databases and tool with mock communities and simulated data sets. Our platform showed promising results in terms of the number of species covered and the accuracy of classification. To encourage active 16S-ITS-23S rRNA operon analysis in the field, BLAST function and taxonomic profiling results with 16S-ITS-23S rRNA operon studies, which have been reported as BioProject on NCBI are provided. MIrROR (http://mirror.egnome.co.kr/) will be a useful platform for researchers who want to perform high-resolution metagenome analysis with a cost-effective sequencer such as MinION from Oxford Nanopore Technologies. IMPORTANCE Metabarcoding is a powerful tool to investigate community diversity in an economic and efficient way by amplifying a specific gene marker region. With the advancement of long-read sequencing technologies, the field of metabarcoding has entered a new phase. The technologies have brought a need for development in several areas, including new markers that long-read can cover, database for the markers, tools that reflect long-read characteristics, and compatibility with downstream analysis tools. By constructing MIrROR, we met the need for a database and tools for the 16S-ITS-23S rRNA operon region, which has recently been shown to have sufficient resolution at the species level. Bacterial community analysis using the 16S-ITS-23S rRNA operon region with MIrROR will provide new insights from various research fields.
format Online
Article
Text
id pubmed-9045266
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher American Society for Microbiology
record_format MEDLINE/PubMed
spelling pubmed-90452662022-04-28 Microbial Identification Using rRNA Operon Region: Database and Tool for Metataxonomics with Long-Read Sequence Seol, Donghyeok Lim, Jin Soo Sung, Samsun Lee, Young Ho Jeong, Misun Cho, Seoae Kwak, Woori Kim, Heebal Microbiol Spectr Resource Report Recent development of long-read sequencing platforms has enabled researchers to explore bacterial community structure through analysis of full-length 16S rRNA gene (∼1,500 bp) or 16S-ITS-23S rRNA operon region (∼4,300 bp), resulting in higher taxonomic resolution than short-read sequencing platforms. Despite the potential of long-read sequencing in metagenomics, resources and protocols for this technology are scarce. Here, we describe MIrROR, the database and analysis tool for metataxonomics using the bacterial 16S-ITS-23S rRNA operon region. We collected 16S-ITS-23S rRNA operon sequences extracted from bacterial genomes from NCBI GenBank and performed curation. A total of 97,781 16S-ITS-23S rRNA operon sequences covering 9,485 species from 43,653 genomes were obtained. For user convenience, we provide an analysis tool based on a mapping strategy that can be used for taxonomic profiling with MIrROR database. To benchmark MIrROR, we compared performance against publicly available databases and tool with mock communities and simulated data sets. Our platform showed promising results in terms of the number of species covered and the accuracy of classification. To encourage active 16S-ITS-23S rRNA operon analysis in the field, BLAST function and taxonomic profiling results with 16S-ITS-23S rRNA operon studies, which have been reported as BioProject on NCBI are provided. MIrROR (http://mirror.egnome.co.kr/) will be a useful platform for researchers who want to perform high-resolution metagenome analysis with a cost-effective sequencer such as MinION from Oxford Nanopore Technologies. IMPORTANCE Metabarcoding is a powerful tool to investigate community diversity in an economic and efficient way by amplifying a specific gene marker region. With the advancement of long-read sequencing technologies, the field of metabarcoding has entered a new phase. The technologies have brought a need for development in several areas, including new markers that long-read can cover, database for the markers, tools that reflect long-read characteristics, and compatibility with downstream analysis tools. By constructing MIrROR, we met the need for a database and tools for the 16S-ITS-23S rRNA operon region, which has recently been shown to have sufficient resolution at the species level. Bacterial community analysis using the 16S-ITS-23S rRNA operon region with MIrROR will provide new insights from various research fields. American Society for Microbiology 2022-03-30 /pmc/articles/PMC9045266/ /pubmed/35352997 http://dx.doi.org/10.1128/spectrum.02017-21 Text en Copyright © 2022 Seol et al. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution 4.0 International license (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Resource Report
Seol, Donghyeok
Lim, Jin Soo
Sung, Samsun
Lee, Young Ho
Jeong, Misun
Cho, Seoae
Kwak, Woori
Kim, Heebal
Microbial Identification Using rRNA Operon Region: Database and Tool for Metataxonomics with Long-Read Sequence
title Microbial Identification Using rRNA Operon Region: Database and Tool for Metataxonomics with Long-Read Sequence
title_full Microbial Identification Using rRNA Operon Region: Database and Tool for Metataxonomics with Long-Read Sequence
title_fullStr Microbial Identification Using rRNA Operon Region: Database and Tool for Metataxonomics with Long-Read Sequence
title_full_unstemmed Microbial Identification Using rRNA Operon Region: Database and Tool for Metataxonomics with Long-Read Sequence
title_short Microbial Identification Using rRNA Operon Region: Database and Tool for Metataxonomics with Long-Read Sequence
title_sort microbial identification using rrna operon region: database and tool for metataxonomics with long-read sequence
topic Resource Report
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9045266/
https://www.ncbi.nlm.nih.gov/pubmed/35352997
http://dx.doi.org/10.1128/spectrum.02017-21
work_keys_str_mv AT seoldonghyeok microbialidentificationusingrrnaoperonregiondatabaseandtoolformetataxonomicswithlongreadsequence
AT limjinsoo microbialidentificationusingrrnaoperonregiondatabaseandtoolformetataxonomicswithlongreadsequence
AT sungsamsun microbialidentificationusingrrnaoperonregiondatabaseandtoolformetataxonomicswithlongreadsequence
AT leeyoungho microbialidentificationusingrrnaoperonregiondatabaseandtoolformetataxonomicswithlongreadsequence
AT jeongmisun microbialidentificationusingrrnaoperonregiondatabaseandtoolformetataxonomicswithlongreadsequence
AT choseoae microbialidentificationusingrrnaoperonregiondatabaseandtoolformetataxonomicswithlongreadsequence
AT kwakwoori microbialidentificationusingrrnaoperonregiondatabaseandtoolformetataxonomicswithlongreadsequence
AT kimheebal microbialidentificationusingrrnaoperonregiondatabaseandtoolformetataxonomicswithlongreadsequence