Cargando…

How clear is our current view on microbial dark matter? (Re-)assessing public MAG & SAG datasets with MDMcleaner

As of today, the majority of environmental microorganisms remain uncultured and is therefore referred to as ‘microbial dark matter’ (MDM). Hence, genomic insights into these organisms are limited to cultivation-independent approaches such as single-cell- and metagenomics. However, without access to...

Descripción completa

Detalles Bibliográficos
Autores principales: Vollmers, John, Wiegand, Sandra, Lenk, Florian, Kaster, Anne-Kristin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9303271/
https://www.ncbi.nlm.nih.gov/pubmed/35536293
http://dx.doi.org/10.1093/nar/gkac294
_version_ 1784751821505429504
author Vollmers, John
Wiegand, Sandra
Lenk, Florian
Kaster, Anne-Kristin
author_facet Vollmers, John
Wiegand, Sandra
Lenk, Florian
Kaster, Anne-Kristin
author_sort Vollmers, John
collection PubMed
description As of today, the majority of environmental microorganisms remain uncultured and is therefore referred to as ‘microbial dark matter’ (MDM). Hence, genomic insights into these organisms are limited to cultivation-independent approaches such as single-cell- and metagenomics. However, without access to cultured representatives for verifying correct taxon-assignments, MDM genomes may cause potentially misleading conclusions based on misclassified or contaminant contigs, thereby obfuscating our view on the uncultured microbial majority. Moreover, gradual database contaminations by past genome submissions can cause error propagations which affect present as well as future comparative genome analyses. Consequently, strict contamination detection and filtering need to be applied, especially in the case of uncultured MDM genomes. Current genome reporting standards, however, emphasize completeness over purity and the de facto gold standard genome assessment tool, checkM, discriminates against uncultured taxa and fragmented genomes. To tackle these issues, we present a novel contig classification, screening, and filtering workflow and corresponding open-source python implementation called MDMcleaner, which was tested and compared to other tools on mock and real datasets. MDMcleaner revealed substantial contaminations overlooked by current screening approaches and sensitively detects misattributed contigs in both novel genomes and the underlying reference databases, thereby greatly improving our view on ‘microbial dark matter’.
format Online
Article
Text
id pubmed-9303271
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-93032712022-07-22 How clear is our current view on microbial dark matter? (Re-)assessing public MAG & SAG datasets with MDMcleaner Vollmers, John Wiegand, Sandra Lenk, Florian Kaster, Anne-Kristin Nucleic Acids Res Methods Online As of today, the majority of environmental microorganisms remain uncultured and is therefore referred to as ‘microbial dark matter’ (MDM). Hence, genomic insights into these organisms are limited to cultivation-independent approaches such as single-cell- and metagenomics. However, without access to cultured representatives for verifying correct taxon-assignments, MDM genomes may cause potentially misleading conclusions based on misclassified or contaminant contigs, thereby obfuscating our view on the uncultured microbial majority. Moreover, gradual database contaminations by past genome submissions can cause error propagations which affect present as well as future comparative genome analyses. Consequently, strict contamination detection and filtering need to be applied, especially in the case of uncultured MDM genomes. Current genome reporting standards, however, emphasize completeness over purity and the de facto gold standard genome assessment tool, checkM, discriminates against uncultured taxa and fragmented genomes. To tackle these issues, we present a novel contig classification, screening, and filtering workflow and corresponding open-source python implementation called MDMcleaner, which was tested and compared to other tools on mock and real datasets. MDMcleaner revealed substantial contaminations overlooked by current screening approaches and sensitively detects misattributed contigs in both novel genomes and the underlying reference databases, thereby greatly improving our view on ‘microbial dark matter’. Oxford University Press 2022-05-10 /pmc/articles/PMC9303271/ /pubmed/35536293 http://dx.doi.org/10.1093/nar/gkac294 Text en © The Author(s) 2022. Published by Oxford University Press on behalf of Nucleic Acids Research. https://creativecommons.org/licenses/by-nc/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution-NonCommercial License (https://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Methods Online
Vollmers, John
Wiegand, Sandra
Lenk, Florian
Kaster, Anne-Kristin
How clear is our current view on microbial dark matter? (Re-)assessing public MAG & SAG datasets with MDMcleaner
title How clear is our current view on microbial dark matter? (Re-)assessing public MAG & SAG datasets with MDMcleaner
title_full How clear is our current view on microbial dark matter? (Re-)assessing public MAG & SAG datasets with MDMcleaner
title_fullStr How clear is our current view on microbial dark matter? (Re-)assessing public MAG & SAG datasets with MDMcleaner
title_full_unstemmed How clear is our current view on microbial dark matter? (Re-)assessing public MAG & SAG datasets with MDMcleaner
title_short How clear is our current view on microbial dark matter? (Re-)assessing public MAG & SAG datasets with MDMcleaner
title_sort how clear is our current view on microbial dark matter? (re-)assessing public mag & sag datasets with mdmcleaner
topic Methods Online
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9303271/
https://www.ncbi.nlm.nih.gov/pubmed/35536293
http://dx.doi.org/10.1093/nar/gkac294
work_keys_str_mv AT vollmersjohn howclearisourcurrentviewonmicrobialdarkmatterreassessingpublicmagsagdatasetswithmdmcleaner
AT wiegandsandra howclearisourcurrentviewonmicrobialdarkmatterreassessingpublicmagsagdatasetswithmdmcleaner
AT lenkflorian howclearisourcurrentviewonmicrobialdarkmatterreassessingpublicmagsagdatasetswithmdmcleaner
AT kasterannekristin howclearisourcurrentviewonmicrobialdarkmatterreassessingpublicmagsagdatasetswithmdmcleaner