Cargando…
Selection of marker genes for genetic barcoding of microorganisms and binning of metagenomic reads by Barcoder software tools
BACKGROUND: Metagenomic approaches have revealed the complexity of environmental microbiomes with the advancement in whole genome sequencing displaying a significant level of genetic heterogeneity on the species level. It has become apparent that patterns of superior bioactivity of bacteria applicab...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6117900/ https://www.ncbi.nlm.nih.gov/pubmed/30165813 http://dx.doi.org/10.1186/s12859-018-2320-1 |
_version_ | 1783351833933119488 |
---|---|
author | Rotimi, Adeola M. Pierneef, Rian Reva, Oleg N. |
author_facet | Rotimi, Adeola M. Pierneef, Rian Reva, Oleg N. |
author_sort | Rotimi, Adeola M. |
collection | PubMed |
description | BACKGROUND: Metagenomic approaches have revealed the complexity of environmental microbiomes with the advancement in whole genome sequencing displaying a significant level of genetic heterogeneity on the species level. It has become apparent that patterns of superior bioactivity of bacteria applicable in biotechnology as well as the enhanced virulence of pathogens often requires distinguishing between closely related species or sub-species. Current methods for binning of metagenomic reads usually do not allow for identification below the genus level and generally stops at the family level. RESULTS: In this work, an attempt was made to improve metagenomic binning resolution by creating genome specific barcodes based on the core and accessory genomes. This protocol was implemented in novel software tools available for use and download from http://bargene.bi.up.ac.za/. The most abundant barcode genes from the core genomes were found to encode for ribosomal proteins, certain central metabolic genes and ABC transporters. Performance of metabarcode sequences created by this package was evaluated using artificially generated and publically available metagenomic datasets. Furthermore, a program (Barcoding 2.0) was developed to align reads against barcode sequences and thereafter calculate various parameters to score the alignments and the individual barcodes. Taxonomic units were identified in metagenomic samples by comparison of the calculated barcode scores to set cut-off values. In this study, it was found that varying sample sizes, i.e. number of reads in a metagenome and metabarcode lengths, had no significant effect on the sensitivity and specificity of the algorithm. Receiver operating characteristics (ROC) curves were calculated for different taxonomic groups based on the results of identification of the corresponding genomes in artificial metagenomic datasets. The reliability of distinguishing between species of the same genus or family by the program was nearly perfect. CONCLUSIONS: The results showed that the novel online tool BarcodeGenerator (http://bargene.bi.up.ac.za/) is an efficient approach for generating barcode sequences from a set of complete genomes provided by users. Another program, Barcoder 2.0 is available from the same resource to enable an efficient and practical use of metabarcodes for visualization of the distribution of organisms of interest in environmental and clinical samples. |
format | Online Article Text |
id | pubmed-6117900 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-61179002018-09-05 Selection of marker genes for genetic barcoding of microorganisms and binning of metagenomic reads by Barcoder software tools Rotimi, Adeola M. Pierneef, Rian Reva, Oleg N. BMC Bioinformatics Software BACKGROUND: Metagenomic approaches have revealed the complexity of environmental microbiomes with the advancement in whole genome sequencing displaying a significant level of genetic heterogeneity on the species level. It has become apparent that patterns of superior bioactivity of bacteria applicable in biotechnology as well as the enhanced virulence of pathogens often requires distinguishing between closely related species or sub-species. Current methods for binning of metagenomic reads usually do not allow for identification below the genus level and generally stops at the family level. RESULTS: In this work, an attempt was made to improve metagenomic binning resolution by creating genome specific barcodes based on the core and accessory genomes. This protocol was implemented in novel software tools available for use and download from http://bargene.bi.up.ac.za/. The most abundant barcode genes from the core genomes were found to encode for ribosomal proteins, certain central metabolic genes and ABC transporters. Performance of metabarcode sequences created by this package was evaluated using artificially generated and publically available metagenomic datasets. Furthermore, a program (Barcoding 2.0) was developed to align reads against barcode sequences and thereafter calculate various parameters to score the alignments and the individual barcodes. Taxonomic units were identified in metagenomic samples by comparison of the calculated barcode scores to set cut-off values. In this study, it was found that varying sample sizes, i.e. number of reads in a metagenome and metabarcode lengths, had no significant effect on the sensitivity and specificity of the algorithm. Receiver operating characteristics (ROC) curves were calculated for different taxonomic groups based on the results of identification of the corresponding genomes in artificial metagenomic datasets. The reliability of distinguishing between species of the same genus or family by the program was nearly perfect. CONCLUSIONS: The results showed that the novel online tool BarcodeGenerator (http://bargene.bi.up.ac.za/) is an efficient approach for generating barcode sequences from a set of complete genomes provided by users. Another program, Barcoder 2.0 is available from the same resource to enable an efficient and practical use of metabarcodes for visualization of the distribution of organisms of interest in environmental and clinical samples. BioMed Central 2018-08-30 /pmc/articles/PMC6117900/ /pubmed/30165813 http://dx.doi.org/10.1186/s12859-018-2320-1 Text en © The Author(s). 2018 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Software Rotimi, Adeola M. Pierneef, Rian Reva, Oleg N. Selection of marker genes for genetic barcoding of microorganisms and binning of metagenomic reads by Barcoder software tools |
title | Selection of marker genes for genetic barcoding of microorganisms and binning of metagenomic reads by Barcoder software tools |
title_full | Selection of marker genes for genetic barcoding of microorganisms and binning of metagenomic reads by Barcoder software tools |
title_fullStr | Selection of marker genes for genetic barcoding of microorganisms and binning of metagenomic reads by Barcoder software tools |
title_full_unstemmed | Selection of marker genes for genetic barcoding of microorganisms and binning of metagenomic reads by Barcoder software tools |
title_short | Selection of marker genes for genetic barcoding of microorganisms and binning of metagenomic reads by Barcoder software tools |
title_sort | selection of marker genes for genetic barcoding of microorganisms and binning of metagenomic reads by barcoder software tools |
topic | Software |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6117900/ https://www.ncbi.nlm.nih.gov/pubmed/30165813 http://dx.doi.org/10.1186/s12859-018-2320-1 |
work_keys_str_mv | AT rotimiadeolam selectionofmarkergenesforgeneticbarcodingofmicroorganismsandbinningofmetagenomicreadsbybarcodersoftwaretools AT pierneefrian selectionofmarkergenesforgeneticbarcodingofmicroorganismsandbinningofmetagenomicreadsbybarcodersoftwaretools AT revaolegn selectionofmarkergenesforgeneticbarcodingofmicroorganismsandbinningofmetagenomicreadsbybarcodersoftwaretools |