Cargando…
Microbial Identification Using rRNA Operon Region: Database and Tool for Metataxonomics with Long-Read Sequence
Recent development of long-read sequencing platforms has enabled researchers to explore bacterial community structure through analysis of full-length 16S rRNA gene (∼1,500 bp) or 16S-ITS-23S rRNA operon region (∼4,300 bp), resulting in higher taxonomic resolution than short-read sequencing platforms...
Autores principales: | , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
American Society for Microbiology
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9045266/ https://www.ncbi.nlm.nih.gov/pubmed/35352997 http://dx.doi.org/10.1128/spectrum.02017-21 |
_version_ | 1784695277467205632 |
---|---|
author | Seol, Donghyeok Lim, Jin Soo Sung, Samsun Lee, Young Ho Jeong, Misun Cho, Seoae Kwak, Woori Kim, Heebal |
author_facet | Seol, Donghyeok Lim, Jin Soo Sung, Samsun Lee, Young Ho Jeong, Misun Cho, Seoae Kwak, Woori Kim, Heebal |
author_sort | Seol, Donghyeok |
collection | PubMed |
description | Recent development of long-read sequencing platforms has enabled researchers to explore bacterial community structure through analysis of full-length 16S rRNA gene (∼1,500 bp) or 16S-ITS-23S rRNA operon region (∼4,300 bp), resulting in higher taxonomic resolution than short-read sequencing platforms. Despite the potential of long-read sequencing in metagenomics, resources and protocols for this technology are scarce. Here, we describe MIrROR, the database and analysis tool for metataxonomics using the bacterial 16S-ITS-23S rRNA operon region. We collected 16S-ITS-23S rRNA operon sequences extracted from bacterial genomes from NCBI GenBank and performed curation. A total of 97,781 16S-ITS-23S rRNA operon sequences covering 9,485 species from 43,653 genomes were obtained. For user convenience, we provide an analysis tool based on a mapping strategy that can be used for taxonomic profiling with MIrROR database. To benchmark MIrROR, we compared performance against publicly available databases and tool with mock communities and simulated data sets. Our platform showed promising results in terms of the number of species covered and the accuracy of classification. To encourage active 16S-ITS-23S rRNA operon analysis in the field, BLAST function and taxonomic profiling results with 16S-ITS-23S rRNA operon studies, which have been reported as BioProject on NCBI are provided. MIrROR (http://mirror.egnome.co.kr/) will be a useful platform for researchers who want to perform high-resolution metagenome analysis with a cost-effective sequencer such as MinION from Oxford Nanopore Technologies. IMPORTANCE Metabarcoding is a powerful tool to investigate community diversity in an economic and efficient way by amplifying a specific gene marker region. With the advancement of long-read sequencing technologies, the field of metabarcoding has entered a new phase. The technologies have brought a need for development in several areas, including new markers that long-read can cover, database for the markers, tools that reflect long-read characteristics, and compatibility with downstream analysis tools. By constructing MIrROR, we met the need for a database and tools for the 16S-ITS-23S rRNA operon region, which has recently been shown to have sufficient resolution at the species level. Bacterial community analysis using the 16S-ITS-23S rRNA operon region with MIrROR will provide new insights from various research fields. |
format | Online Article Text |
id | pubmed-9045266 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | American Society for Microbiology |
record_format | MEDLINE/PubMed |
spelling | pubmed-90452662022-04-28 Microbial Identification Using rRNA Operon Region: Database and Tool for Metataxonomics with Long-Read Sequence Seol, Donghyeok Lim, Jin Soo Sung, Samsun Lee, Young Ho Jeong, Misun Cho, Seoae Kwak, Woori Kim, Heebal Microbiol Spectr Resource Report Recent development of long-read sequencing platforms has enabled researchers to explore bacterial community structure through analysis of full-length 16S rRNA gene (∼1,500 bp) or 16S-ITS-23S rRNA operon region (∼4,300 bp), resulting in higher taxonomic resolution than short-read sequencing platforms. Despite the potential of long-read sequencing in metagenomics, resources and protocols for this technology are scarce. Here, we describe MIrROR, the database and analysis tool for metataxonomics using the bacterial 16S-ITS-23S rRNA operon region. We collected 16S-ITS-23S rRNA operon sequences extracted from bacterial genomes from NCBI GenBank and performed curation. A total of 97,781 16S-ITS-23S rRNA operon sequences covering 9,485 species from 43,653 genomes were obtained. For user convenience, we provide an analysis tool based on a mapping strategy that can be used for taxonomic profiling with MIrROR database. To benchmark MIrROR, we compared performance against publicly available databases and tool with mock communities and simulated data sets. Our platform showed promising results in terms of the number of species covered and the accuracy of classification. To encourage active 16S-ITS-23S rRNA operon analysis in the field, BLAST function and taxonomic profiling results with 16S-ITS-23S rRNA operon studies, which have been reported as BioProject on NCBI are provided. MIrROR (http://mirror.egnome.co.kr/) will be a useful platform for researchers who want to perform high-resolution metagenome analysis with a cost-effective sequencer such as MinION from Oxford Nanopore Technologies. IMPORTANCE Metabarcoding is a powerful tool to investigate community diversity in an economic and efficient way by amplifying a specific gene marker region. With the advancement of long-read sequencing technologies, the field of metabarcoding has entered a new phase. The technologies have brought a need for development in several areas, including new markers that long-read can cover, database for the markers, tools that reflect long-read characteristics, and compatibility with downstream analysis tools. By constructing MIrROR, we met the need for a database and tools for the 16S-ITS-23S rRNA operon region, which has recently been shown to have sufficient resolution at the species level. Bacterial community analysis using the 16S-ITS-23S rRNA operon region with MIrROR will provide new insights from various research fields. American Society for Microbiology 2022-03-30 /pmc/articles/PMC9045266/ /pubmed/35352997 http://dx.doi.org/10.1128/spectrum.02017-21 Text en Copyright © 2022 Seol et al. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution 4.0 International license (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Resource Report Seol, Donghyeok Lim, Jin Soo Sung, Samsun Lee, Young Ho Jeong, Misun Cho, Seoae Kwak, Woori Kim, Heebal Microbial Identification Using rRNA Operon Region: Database and Tool for Metataxonomics with Long-Read Sequence |
title | Microbial Identification Using rRNA Operon Region: Database and Tool for Metataxonomics with Long-Read Sequence |
title_full | Microbial Identification Using rRNA Operon Region: Database and Tool for Metataxonomics with Long-Read Sequence |
title_fullStr | Microbial Identification Using rRNA Operon Region: Database and Tool for Metataxonomics with Long-Read Sequence |
title_full_unstemmed | Microbial Identification Using rRNA Operon Region: Database and Tool for Metataxonomics with Long-Read Sequence |
title_short | Microbial Identification Using rRNA Operon Region: Database and Tool for Metataxonomics with Long-Read Sequence |
title_sort | microbial identification using rrna operon region: database and tool for metataxonomics with long-read sequence |
topic | Resource Report |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9045266/ https://www.ncbi.nlm.nih.gov/pubmed/35352997 http://dx.doi.org/10.1128/spectrum.02017-21 |
work_keys_str_mv | AT seoldonghyeok microbialidentificationusingrrnaoperonregiondatabaseandtoolformetataxonomicswithlongreadsequence AT limjinsoo microbialidentificationusingrrnaoperonregiondatabaseandtoolformetataxonomicswithlongreadsequence AT sungsamsun microbialidentificationusingrrnaoperonregiondatabaseandtoolformetataxonomicswithlongreadsequence AT leeyoungho microbialidentificationusingrrnaoperonregiondatabaseandtoolformetataxonomicswithlongreadsequence AT jeongmisun microbialidentificationusingrrnaoperonregiondatabaseandtoolformetataxonomicswithlongreadsequence AT choseoae microbialidentificationusingrrnaoperonregiondatabaseandtoolformetataxonomicswithlongreadsequence AT kwakwoori microbialidentificationusingrrnaoperonregiondatabaseandtoolformetataxonomicswithlongreadsequence AT kimheebal microbialidentificationusingrrnaoperonregiondatabaseandtoolformetataxonomicswithlongreadsequence |