Cargando…

Machine Learning and Deep Learning Applications in Metagenomic Taxonomy and Functional Annotation

Shotgun sequencing of environmental DNA (i.e., metagenomics) has revolutionized the field of environmental microbiology, allowing the characterization of all microorganisms in a sequencing experiment. To identify the microbes in terms of taxonomy and biological activity, the sequenced reads must nec...

Descripción completa

Detalles Bibliográficos
Autores principales: Mathieu, Alban, Leclercq, Mickael, Sanabria, Melissa, Perin, Olivier, Droit, Arnaud
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8964132/
https://www.ncbi.nlm.nih.gov/pubmed/35359727
http://dx.doi.org/10.3389/fmicb.2022.811495
_version_ 1784678143391432704
author Mathieu, Alban
Leclercq, Mickael
Sanabria, Melissa
Perin, Olivier
Droit, Arnaud
author_facet Mathieu, Alban
Leclercq, Mickael
Sanabria, Melissa
Perin, Olivier
Droit, Arnaud
author_sort Mathieu, Alban
collection PubMed
description Shotgun sequencing of environmental DNA (i.e., metagenomics) has revolutionized the field of environmental microbiology, allowing the characterization of all microorganisms in a sequencing experiment. To identify the microbes in terms of taxonomy and biological activity, the sequenced reads must necessarily be aligned on known microbial genomes/genes. However, current alignment methods are limited in terms of speed and can produce a significant number of false positives when detecting bacterial species or false negatives in specific cases (virus, plasmids, and gene detection). Moreover, recent advances in metagenomics have enabled the reconstruction of new genomes using de novo binning strategies, but these genomes, not yet fully characterized, are not used in classic approaches, whereas machine and deep learning methods can use them as models. In this article, we attempted to review the different methods and their efficiency to improve the annotation of metagenomic sequences. Deep learning models have reached the performance of the widely used k-mer alignment-based tools, with better accuracy in certain cases; however, they still must demonstrate their robustness across the variety of environmental samples and across the rapid expansion of accessible genomes in databases.
format Online
Article
Text
id pubmed-8964132
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-89641322022-03-30 Machine Learning and Deep Learning Applications in Metagenomic Taxonomy and Functional Annotation Mathieu, Alban Leclercq, Mickael Sanabria, Melissa Perin, Olivier Droit, Arnaud Front Microbiol Microbiology Shotgun sequencing of environmental DNA (i.e., metagenomics) has revolutionized the field of environmental microbiology, allowing the characterization of all microorganisms in a sequencing experiment. To identify the microbes in terms of taxonomy and biological activity, the sequenced reads must necessarily be aligned on known microbial genomes/genes. However, current alignment methods are limited in terms of speed and can produce a significant number of false positives when detecting bacterial species or false negatives in specific cases (virus, plasmids, and gene detection). Moreover, recent advances in metagenomics have enabled the reconstruction of new genomes using de novo binning strategies, but these genomes, not yet fully characterized, are not used in classic approaches, whereas machine and deep learning methods can use them as models. In this article, we attempted to review the different methods and their efficiency to improve the annotation of metagenomic sequences. Deep learning models have reached the performance of the widely used k-mer alignment-based tools, with better accuracy in certain cases; however, they still must demonstrate their robustness across the variety of environmental samples and across the rapid expansion of accessible genomes in databases. Frontiers Media S.A. 2022-03-14 /pmc/articles/PMC8964132/ /pubmed/35359727 http://dx.doi.org/10.3389/fmicb.2022.811495 Text en Copyright © 2022 Mathieu, Leclercq, Sanabria, Perin and Droit. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Microbiology
Mathieu, Alban
Leclercq, Mickael
Sanabria, Melissa
Perin, Olivier
Droit, Arnaud
Machine Learning and Deep Learning Applications in Metagenomic Taxonomy and Functional Annotation
title Machine Learning and Deep Learning Applications in Metagenomic Taxonomy and Functional Annotation
title_full Machine Learning and Deep Learning Applications in Metagenomic Taxonomy and Functional Annotation
title_fullStr Machine Learning and Deep Learning Applications in Metagenomic Taxonomy and Functional Annotation
title_full_unstemmed Machine Learning and Deep Learning Applications in Metagenomic Taxonomy and Functional Annotation
title_short Machine Learning and Deep Learning Applications in Metagenomic Taxonomy and Functional Annotation
title_sort machine learning and deep learning applications in metagenomic taxonomy and functional annotation
topic Microbiology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8964132/
https://www.ncbi.nlm.nih.gov/pubmed/35359727
http://dx.doi.org/10.3389/fmicb.2022.811495
work_keys_str_mv AT mathieualban machinelearninganddeeplearningapplicationsinmetagenomictaxonomyandfunctionalannotation
AT leclercqmickael machinelearninganddeeplearningapplicationsinmetagenomictaxonomyandfunctionalannotation
AT sanabriamelissa machinelearninganddeeplearningapplicationsinmetagenomictaxonomyandfunctionalannotation
AT perinolivier machinelearninganddeeplearningapplicationsinmetagenomictaxonomyandfunctionalannotation
AT droitarnaud machinelearninganddeeplearningapplicationsinmetagenomictaxonomyandfunctionalannotation