Cargando…

NLRexpress—A bundle of machine learning motif predictors—Reveals motif stability underlying plant Nod-like receptors diversity

Examination of a collection of over 80,000 Plant Nod-like receptors (NLRs) revealed an overwhelming sequence diversity underlying functional specificity of pathogen detection, signaling and cooperativity. The NLR canonical building blocks—CC/TIR/RPW8, NBS and LRR—contain, however, a number of conser...

Descripción completa

Detalles Bibliográficos
Autores principales: Martin, Eliza C., Spiridon, Laurentiu, Goverse, Aska, Petrescu, Andrei-José
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9519389/
https://www.ncbi.nlm.nih.gov/pubmed/36186050
http://dx.doi.org/10.3389/fpls.2022.975888
_version_ 1784799386992115712
author Martin, Eliza C.
Spiridon, Laurentiu
Goverse, Aska
Petrescu, Andrei-José
author_facet Martin, Eliza C.
Spiridon, Laurentiu
Goverse, Aska
Petrescu, Andrei-José
author_sort Martin, Eliza C.
collection PubMed
description Examination of a collection of over 80,000 Plant Nod-like receptors (NLRs) revealed an overwhelming sequence diversity underlying functional specificity of pathogen detection, signaling and cooperativity. The NLR canonical building blocks—CC/TIR/RPW8, NBS and LRR—contain, however, a number of conserved sequence motifs showing a significant degree of invariance amongst different NLR groups. To identify these motifs we developed NLRexpress—a bundle of 17 machine learning (ML)-based predictors, able to swiftly and precisely detect CC, TIR, NBS, and LRR motifs while minimizing computing time without accuracy losses—aimed as an instrument scalable for screening overall proteomes, transcriptomes or genomes for identifying integral NLRs and discriminating them against incomplete sequences lacking key motifs. These predictors were further used to screen a subset of ∼34,000 regular plant NLR sequences. Motifs were analyzed using unsupervised ML techniques to assess the structural correlations hidden underneath pattern variabilities. Both the NB-ARC switch domain which admittedly is the most conserved region of NLRs and the highly diverse LRR domain with its vastly variable lengths and repeat irregularities—show well-defined relations between motif subclasses, highlighting the importance of structural invariance in shaping NLR sequence diversity. The online NLRexpress webserver can be accessed at https://nlrexpress.biochim.ro.
format Online
Article
Text
id pubmed-9519389
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-95193892022-09-30 NLRexpress—A bundle of machine learning motif predictors—Reveals motif stability underlying plant Nod-like receptors diversity Martin, Eliza C. Spiridon, Laurentiu Goverse, Aska Petrescu, Andrei-José Front Plant Sci Plant Science Examination of a collection of over 80,000 Plant Nod-like receptors (NLRs) revealed an overwhelming sequence diversity underlying functional specificity of pathogen detection, signaling and cooperativity. The NLR canonical building blocks—CC/TIR/RPW8, NBS and LRR—contain, however, a number of conserved sequence motifs showing a significant degree of invariance amongst different NLR groups. To identify these motifs we developed NLRexpress—a bundle of 17 machine learning (ML)-based predictors, able to swiftly and precisely detect CC, TIR, NBS, and LRR motifs while minimizing computing time without accuracy losses—aimed as an instrument scalable for screening overall proteomes, transcriptomes or genomes for identifying integral NLRs and discriminating them against incomplete sequences lacking key motifs. These predictors were further used to screen a subset of ∼34,000 regular plant NLR sequences. Motifs were analyzed using unsupervised ML techniques to assess the structural correlations hidden underneath pattern variabilities. Both the NB-ARC switch domain which admittedly is the most conserved region of NLRs and the highly diverse LRR domain with its vastly variable lengths and repeat irregularities—show well-defined relations between motif subclasses, highlighting the importance of structural invariance in shaping NLR sequence diversity. The online NLRexpress webserver can be accessed at https://nlrexpress.biochim.ro. Frontiers Media S.A. 2022-09-15 /pmc/articles/PMC9519389/ /pubmed/36186050 http://dx.doi.org/10.3389/fpls.2022.975888 Text en Copyright © 2022 Martin, Spiridon, Goverse and Petrescu. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Plant Science
Martin, Eliza C.
Spiridon, Laurentiu
Goverse, Aska
Petrescu, Andrei-José
NLRexpress—A bundle of machine learning motif predictors—Reveals motif stability underlying plant Nod-like receptors diversity
title NLRexpress—A bundle of machine learning motif predictors—Reveals motif stability underlying plant Nod-like receptors diversity
title_full NLRexpress—A bundle of machine learning motif predictors—Reveals motif stability underlying plant Nod-like receptors diversity
title_fullStr NLRexpress—A bundle of machine learning motif predictors—Reveals motif stability underlying plant Nod-like receptors diversity
title_full_unstemmed NLRexpress—A bundle of machine learning motif predictors—Reveals motif stability underlying plant Nod-like receptors diversity
title_short NLRexpress—A bundle of machine learning motif predictors—Reveals motif stability underlying plant Nod-like receptors diversity
title_sort nlrexpress—a bundle of machine learning motif predictors—reveals motif stability underlying plant nod-like receptors diversity
topic Plant Science
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9519389/
https://www.ncbi.nlm.nih.gov/pubmed/36186050
http://dx.doi.org/10.3389/fpls.2022.975888
work_keys_str_mv AT martinelizac nlrexpressabundleofmachinelearningmotifpredictorsrevealsmotifstabilityunderlyingplantnodlikereceptorsdiversity
AT spiridonlaurentiu nlrexpressabundleofmachinelearningmotifpredictorsrevealsmotifstabilityunderlyingplantnodlikereceptorsdiversity
AT goverseaska nlrexpressabundleofmachinelearningmotifpredictorsrevealsmotifstabilityunderlyingplantnodlikereceptorsdiversity
AT petrescuandreijose nlrexpressabundleofmachinelearningmotifpredictorsrevealsmotifstabilityunderlyingplantnodlikereceptorsdiversity