Cargando…
How clear is our current view on microbial dark matter? (Re-)assessing public MAG & SAG datasets with MDMcleaner
As of today, the majority of environmental microorganisms remain uncultured and is therefore referred to as ‘microbial dark matter’ (MDM). Hence, genomic insights into these organisms are limited to cultivation-independent approaches such as single-cell- and metagenomics. However, without access to...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9303271/ https://www.ncbi.nlm.nih.gov/pubmed/35536293 http://dx.doi.org/10.1093/nar/gkac294 |
_version_ | 1784751821505429504 |
---|---|
author | Vollmers, John Wiegand, Sandra Lenk, Florian Kaster, Anne-Kristin |
author_facet | Vollmers, John Wiegand, Sandra Lenk, Florian Kaster, Anne-Kristin |
author_sort | Vollmers, John |
collection | PubMed |
description | As of today, the majority of environmental microorganisms remain uncultured and is therefore referred to as ‘microbial dark matter’ (MDM). Hence, genomic insights into these organisms are limited to cultivation-independent approaches such as single-cell- and metagenomics. However, without access to cultured representatives for verifying correct taxon-assignments, MDM genomes may cause potentially misleading conclusions based on misclassified or contaminant contigs, thereby obfuscating our view on the uncultured microbial majority. Moreover, gradual database contaminations by past genome submissions can cause error propagations which affect present as well as future comparative genome analyses. Consequently, strict contamination detection and filtering need to be applied, especially in the case of uncultured MDM genomes. Current genome reporting standards, however, emphasize completeness over purity and the de facto gold standard genome assessment tool, checkM, discriminates against uncultured taxa and fragmented genomes. To tackle these issues, we present a novel contig classification, screening, and filtering workflow and corresponding open-source python implementation called MDMcleaner, which was tested and compared to other tools on mock and real datasets. MDMcleaner revealed substantial contaminations overlooked by current screening approaches and sensitively detects misattributed contigs in both novel genomes and the underlying reference databases, thereby greatly improving our view on ‘microbial dark matter’. |
format | Online Article Text |
id | pubmed-9303271 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-93032712022-07-22 How clear is our current view on microbial dark matter? (Re-)assessing public MAG & SAG datasets with MDMcleaner Vollmers, John Wiegand, Sandra Lenk, Florian Kaster, Anne-Kristin Nucleic Acids Res Methods Online As of today, the majority of environmental microorganisms remain uncultured and is therefore referred to as ‘microbial dark matter’ (MDM). Hence, genomic insights into these organisms are limited to cultivation-independent approaches such as single-cell- and metagenomics. However, without access to cultured representatives for verifying correct taxon-assignments, MDM genomes may cause potentially misleading conclusions based on misclassified or contaminant contigs, thereby obfuscating our view on the uncultured microbial majority. Moreover, gradual database contaminations by past genome submissions can cause error propagations which affect present as well as future comparative genome analyses. Consequently, strict contamination detection and filtering need to be applied, especially in the case of uncultured MDM genomes. Current genome reporting standards, however, emphasize completeness over purity and the de facto gold standard genome assessment tool, checkM, discriminates against uncultured taxa and fragmented genomes. To tackle these issues, we present a novel contig classification, screening, and filtering workflow and corresponding open-source python implementation called MDMcleaner, which was tested and compared to other tools on mock and real datasets. MDMcleaner revealed substantial contaminations overlooked by current screening approaches and sensitively detects misattributed contigs in both novel genomes and the underlying reference databases, thereby greatly improving our view on ‘microbial dark matter’. Oxford University Press 2022-05-10 /pmc/articles/PMC9303271/ /pubmed/35536293 http://dx.doi.org/10.1093/nar/gkac294 Text en © The Author(s) 2022. Published by Oxford University Press on behalf of Nucleic Acids Research. https://creativecommons.org/licenses/by-nc/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution-NonCommercial License (https://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com |
spellingShingle | Methods Online Vollmers, John Wiegand, Sandra Lenk, Florian Kaster, Anne-Kristin How clear is our current view on microbial dark matter? (Re-)assessing public MAG & SAG datasets with MDMcleaner |
title | How clear is our current view on microbial dark matter? (Re-)assessing public MAG & SAG datasets with MDMcleaner |
title_full | How clear is our current view on microbial dark matter? (Re-)assessing public MAG & SAG datasets with MDMcleaner |
title_fullStr | How clear is our current view on microbial dark matter? (Re-)assessing public MAG & SAG datasets with MDMcleaner |
title_full_unstemmed | How clear is our current view on microbial dark matter? (Re-)assessing public MAG & SAG datasets with MDMcleaner |
title_short | How clear is our current view on microbial dark matter? (Re-)assessing public MAG & SAG datasets with MDMcleaner |
title_sort | how clear is our current view on microbial dark matter? (re-)assessing public mag & sag datasets with mdmcleaner |
topic | Methods Online |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9303271/ https://www.ncbi.nlm.nih.gov/pubmed/35536293 http://dx.doi.org/10.1093/nar/gkac294 |
work_keys_str_mv | AT vollmersjohn howclearisourcurrentviewonmicrobialdarkmatterreassessingpublicmagsagdatasetswithmdmcleaner AT wiegandsandra howclearisourcurrentviewonmicrobialdarkmatterreassessingpublicmagsagdatasetswithmdmcleaner AT lenkflorian howclearisourcurrentviewonmicrobialdarkmatterreassessingpublicmagsagdatasetswithmdmcleaner AT kasterannekristin howclearisourcurrentviewonmicrobialdarkmatterreassessingpublicmagsagdatasetswithmdmcleaner |