Cargando…

Large-scale genomic analyses with machine learning uncover predictive patterns associated with fungal phytopathogenic lifestyles and traits

Invasive plant pathogenic fungi have a global impact, with devastating economic and environmental effects on crops and forests. Biosurveillance, a critical component of threat mitigation, requires risk prediction based on fungal lifestyles and traits. Recent studies have revealed distinct genomic pa...

Descripción completa

Detalles Bibliográficos
Autores principales: Dort, E. N., Layne, E., Feau, N., Butyaev, A., Henrissat, B., Martin, F. M., Haridas, S., Salamov, A., Grigoriev, I. V., Blanchette, M., Hamelin, R. C.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10567782/
https://www.ncbi.nlm.nih.gov/pubmed/37821494
http://dx.doi.org/10.1038/s41598-023-44005-w
_version_ 1785119208845082624
author Dort, E. N.
Layne, E.
Feau, N.
Butyaev, A.
Henrissat, B.
Martin, F. M.
Haridas, S.
Salamov, A.
Grigoriev, I. V.
Blanchette, M.
Hamelin, R. C.
author_facet Dort, E. N.
Layne, E.
Feau, N.
Butyaev, A.
Henrissat, B.
Martin, F. M.
Haridas, S.
Salamov, A.
Grigoriev, I. V.
Blanchette, M.
Hamelin, R. C.
author_sort Dort, E. N.
collection PubMed
description Invasive plant pathogenic fungi have a global impact, with devastating economic and environmental effects on crops and forests. Biosurveillance, a critical component of threat mitigation, requires risk prediction based on fungal lifestyles and traits. Recent studies have revealed distinct genomic patterns associated with specific groups of plant pathogenic fungi. We sought to establish whether these phytopathogenic genomic patterns hold across diverse taxonomic and ecological groups from the Ascomycota and Basidiomycota, and furthermore, if those patterns can be used in a predictive capacity for biosurveillance. Using a supervised machine learning approach that integrates phylogenetic and genomic data, we analyzed 387 fungal genomes to test a proof-of-concept for the use of genomic signatures in predicting fungal phytopathogenic lifestyles and traits during biosurveillance activities. Our machine learning feature sets were derived from genome annotation data of carbohydrate-active enzymes (CAZymes), peptidases, secondary metabolite clusters (SMCs), transporters, and transcription factors. We found that machine learning could successfully predict fungal lifestyles and traits across taxonomic groups, with the best predictive performance coming from feature sets comprising CAZyme, peptidase, and SMC data. While phylogeny was an important component in most predictions, the inclusion of genomic data improved prediction performance for every lifestyle and trait tested. Plant pathogenicity was one of the best-predicted traits, showing the promise of predictive genomics for biosurveillance applications. Furthermore, our machine learning approach revealed expansions in the number of genes from specific CAZyme and peptidase families in the genomes of plant pathogens compared to non-phytopathogenic genomes (saprotrophs, endo- and ectomycorrhizal fungi). Such genomic feature profiles give insight into the evolution of fungal phytopathogenicity and could be useful to predict the risks of unknown fungi in future biosurveillance activities.
format Online
Article
Text
id pubmed-10567782
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-105677822023-10-13 Large-scale genomic analyses with machine learning uncover predictive patterns associated with fungal phytopathogenic lifestyles and traits Dort, E. N. Layne, E. Feau, N. Butyaev, A. Henrissat, B. Martin, F. M. Haridas, S. Salamov, A. Grigoriev, I. V. Blanchette, M. Hamelin, R. C. Sci Rep Article Invasive plant pathogenic fungi have a global impact, with devastating economic and environmental effects on crops and forests. Biosurveillance, a critical component of threat mitigation, requires risk prediction based on fungal lifestyles and traits. Recent studies have revealed distinct genomic patterns associated with specific groups of plant pathogenic fungi. We sought to establish whether these phytopathogenic genomic patterns hold across diverse taxonomic and ecological groups from the Ascomycota and Basidiomycota, and furthermore, if those patterns can be used in a predictive capacity for biosurveillance. Using a supervised machine learning approach that integrates phylogenetic and genomic data, we analyzed 387 fungal genomes to test a proof-of-concept for the use of genomic signatures in predicting fungal phytopathogenic lifestyles and traits during biosurveillance activities. Our machine learning feature sets were derived from genome annotation data of carbohydrate-active enzymes (CAZymes), peptidases, secondary metabolite clusters (SMCs), transporters, and transcription factors. We found that machine learning could successfully predict fungal lifestyles and traits across taxonomic groups, with the best predictive performance coming from feature sets comprising CAZyme, peptidase, and SMC data. While phylogeny was an important component in most predictions, the inclusion of genomic data improved prediction performance for every lifestyle and trait tested. Plant pathogenicity was one of the best-predicted traits, showing the promise of predictive genomics for biosurveillance applications. Furthermore, our machine learning approach revealed expansions in the number of genes from specific CAZyme and peptidase families in the genomes of plant pathogens compared to non-phytopathogenic genomes (saprotrophs, endo- and ectomycorrhizal fungi). Such genomic feature profiles give insight into the evolution of fungal phytopathogenicity and could be useful to predict the risks of unknown fungi in future biosurveillance activities. Nature Publishing Group UK 2023-10-11 /pmc/articles/PMC10567782/ /pubmed/37821494 http://dx.doi.org/10.1038/s41598-023-44005-w Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Dort, E. N.
Layne, E.
Feau, N.
Butyaev, A.
Henrissat, B.
Martin, F. M.
Haridas, S.
Salamov, A.
Grigoriev, I. V.
Blanchette, M.
Hamelin, R. C.
Large-scale genomic analyses with machine learning uncover predictive patterns associated with fungal phytopathogenic lifestyles and traits
title Large-scale genomic analyses with machine learning uncover predictive patterns associated with fungal phytopathogenic lifestyles and traits
title_full Large-scale genomic analyses with machine learning uncover predictive patterns associated with fungal phytopathogenic lifestyles and traits
title_fullStr Large-scale genomic analyses with machine learning uncover predictive patterns associated with fungal phytopathogenic lifestyles and traits
title_full_unstemmed Large-scale genomic analyses with machine learning uncover predictive patterns associated with fungal phytopathogenic lifestyles and traits
title_short Large-scale genomic analyses with machine learning uncover predictive patterns associated with fungal phytopathogenic lifestyles and traits
title_sort large-scale genomic analyses with machine learning uncover predictive patterns associated with fungal phytopathogenic lifestyles and traits
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10567782/
https://www.ncbi.nlm.nih.gov/pubmed/37821494
http://dx.doi.org/10.1038/s41598-023-44005-w
work_keys_str_mv AT dorten largescalegenomicanalyseswithmachinelearninguncoverpredictivepatternsassociatedwithfungalphytopathogeniclifestylesandtraits
AT laynee largescalegenomicanalyseswithmachinelearninguncoverpredictivepatternsassociatedwithfungalphytopathogeniclifestylesandtraits
AT feaun largescalegenomicanalyseswithmachinelearninguncoverpredictivepatternsassociatedwithfungalphytopathogeniclifestylesandtraits
AT butyaeva largescalegenomicanalyseswithmachinelearninguncoverpredictivepatternsassociatedwithfungalphytopathogeniclifestylesandtraits
AT henrissatb largescalegenomicanalyseswithmachinelearninguncoverpredictivepatternsassociatedwithfungalphytopathogeniclifestylesandtraits
AT martinfm largescalegenomicanalyseswithmachinelearninguncoverpredictivepatternsassociatedwithfungalphytopathogeniclifestylesandtraits
AT haridass largescalegenomicanalyseswithmachinelearninguncoverpredictivepatternsassociatedwithfungalphytopathogeniclifestylesandtraits
AT salamova largescalegenomicanalyseswithmachinelearninguncoverpredictivepatternsassociatedwithfungalphytopathogeniclifestylesandtraits
AT grigorieviv largescalegenomicanalyseswithmachinelearninguncoverpredictivepatternsassociatedwithfungalphytopathogeniclifestylesandtraits
AT blanchettem largescalegenomicanalyseswithmachinelearninguncoverpredictivepatternsassociatedwithfungalphytopathogeniclifestylesandtraits
AT hamelinrc largescalegenomicanalyseswithmachinelearninguncoverpredictivepatternsassociatedwithfungalphytopathogeniclifestylesandtraits