Cargando…

RLPredictiOme, a Machine Learning-Derived Method for High-Throughput Prediction of Plant Receptor-like Proteins, Reveals Novel Classes of Transmembrane Receptors

Cell surface receptors play essential roles in perceiving and processing external and internal signals at the cell surface of plants and animals. The receptor-like protein kinases (RLK) and receptor-like proteins (RLPs), two major classes of proteins with membrane receptor configuration, play a cruc...

Descripción completa

Detalles Bibliográficos
Autores principales: Silva, Jose Cleydson F., Ferreira, Marco Aurélio, Carvalho, Thales F. M., Silva, Fabyano F., de A. Silveira, Sabrina, Brommonschenkel, Sergio H., Fontes, Elizabeth P. B.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9603095/
https://www.ncbi.nlm.nih.gov/pubmed/36293031
http://dx.doi.org/10.3390/ijms232012176
_version_ 1784817462883123200
author Silva, Jose Cleydson F.
Ferreira, Marco Aurélio
Carvalho, Thales F. M.
Silva, Fabyano F.
de A. Silveira, Sabrina
Brommonschenkel, Sergio H.
Fontes, Elizabeth P. B.
author_facet Silva, Jose Cleydson F.
Ferreira, Marco Aurélio
Carvalho, Thales F. M.
Silva, Fabyano F.
de A. Silveira, Sabrina
Brommonschenkel, Sergio H.
Fontes, Elizabeth P. B.
author_sort Silva, Jose Cleydson F.
collection PubMed
description Cell surface receptors play essential roles in perceiving and processing external and internal signals at the cell surface of plants and animals. The receptor-like protein kinases (RLK) and receptor-like proteins (RLPs), two major classes of proteins with membrane receptor configuration, play a crucial role in plant development and disease defense. Although RLPs and RLKs share a similar single-pass transmembrane configuration, RLPs harbor short divergent C-terminal regions instead of the conserved kinase domain of RLKs. This RLP receptor structural design precludes sequence comparison algorithms from being used for high-throughput predictions of the RLP family in plant genomes, as has been extensively performed for RLK superfamily predictions. Here, we developed the RLPredictiOme, implemented with machine learning models in combination with Bayesian inference, capable of predicting RLP subfamilies in plant genomes. The ML models were simultaneously trained using six types of features, along with three stages to distinguish RLPs from non-RLPs (NRLPs), RLPs from RLKs, and classify new subfamilies of RLPs in plants. The ML models achieved high accuracy, precision, sensitivity, and specificity for predicting RLPs with relatively high probability ranging from 0.79 to 0.99. The prediction of the method was assessed with three datasets, two of which contained leucine-rich repeats (LRR)-RLPs from Arabidopsis and rice, and the last one consisted of the complete set of previously described Arabidopsis RLPs. In these validation tests, more than 90% of known RLPs were correctly predicted via RLPredictiOme. In addition to predicting previously characterized RLPs, RLPredictiOme uncovered new RLP subfamilies in the Arabidopsis genome. These include probable lipid transfer (PLT)-RLP, plastocyanin-like-RLP, ring finger-RLP, glycosyl-hydrolase-RLP, and glycerophosphoryldiester phosphodiesterase (GDPD, GDPDL)-RLP subfamilies, yet to be characterized. Compared to the only Arabidopsis GDPDL-RLK, molecular evolution studies confirmed that the ectodomain of GDPDL-RLPs might have undergone a purifying selection with a predominance of synonymous substitutions. Expression analyses revealed that predicted GDPGL-RLPs display a basal expression level and respond to developmental and biotic signals. The results of these biological assays indicate that these subfamily members have maintained functional domains during evolution and may play relevant roles in development and plant defense. Therefore, RLPredictiOme provides a framework for genome-wide surveys of the RLP superfamily as a foundation to rationalize functional studies of surface receptors and their relationships with different biological processes.
format Online
Article
Text
id pubmed-9603095
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-96030952022-10-27 RLPredictiOme, a Machine Learning-Derived Method for High-Throughput Prediction of Plant Receptor-like Proteins, Reveals Novel Classes of Transmembrane Receptors Silva, Jose Cleydson F. Ferreira, Marco Aurélio Carvalho, Thales F. M. Silva, Fabyano F. de A. Silveira, Sabrina Brommonschenkel, Sergio H. Fontes, Elizabeth P. B. Int J Mol Sci Article Cell surface receptors play essential roles in perceiving and processing external and internal signals at the cell surface of plants and animals. The receptor-like protein kinases (RLK) and receptor-like proteins (RLPs), two major classes of proteins with membrane receptor configuration, play a crucial role in plant development and disease defense. Although RLPs and RLKs share a similar single-pass transmembrane configuration, RLPs harbor short divergent C-terminal regions instead of the conserved kinase domain of RLKs. This RLP receptor structural design precludes sequence comparison algorithms from being used for high-throughput predictions of the RLP family in plant genomes, as has been extensively performed for RLK superfamily predictions. Here, we developed the RLPredictiOme, implemented with machine learning models in combination with Bayesian inference, capable of predicting RLP subfamilies in plant genomes. The ML models were simultaneously trained using six types of features, along with three stages to distinguish RLPs from non-RLPs (NRLPs), RLPs from RLKs, and classify new subfamilies of RLPs in plants. The ML models achieved high accuracy, precision, sensitivity, and specificity for predicting RLPs with relatively high probability ranging from 0.79 to 0.99. The prediction of the method was assessed with three datasets, two of which contained leucine-rich repeats (LRR)-RLPs from Arabidopsis and rice, and the last one consisted of the complete set of previously described Arabidopsis RLPs. In these validation tests, more than 90% of known RLPs were correctly predicted via RLPredictiOme. In addition to predicting previously characterized RLPs, RLPredictiOme uncovered new RLP subfamilies in the Arabidopsis genome. These include probable lipid transfer (PLT)-RLP, plastocyanin-like-RLP, ring finger-RLP, glycosyl-hydrolase-RLP, and glycerophosphoryldiester phosphodiesterase (GDPD, GDPDL)-RLP subfamilies, yet to be characterized. Compared to the only Arabidopsis GDPDL-RLK, molecular evolution studies confirmed that the ectodomain of GDPDL-RLPs might have undergone a purifying selection with a predominance of synonymous substitutions. Expression analyses revealed that predicted GDPGL-RLPs display a basal expression level and respond to developmental and biotic signals. The results of these biological assays indicate that these subfamily members have maintained functional domains during evolution and may play relevant roles in development and plant defense. Therefore, RLPredictiOme provides a framework for genome-wide surveys of the RLP superfamily as a foundation to rationalize functional studies of surface receptors and their relationships with different biological processes. MDPI 2022-10-12 /pmc/articles/PMC9603095/ /pubmed/36293031 http://dx.doi.org/10.3390/ijms232012176 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Silva, Jose Cleydson F.
Ferreira, Marco Aurélio
Carvalho, Thales F. M.
Silva, Fabyano F.
de A. Silveira, Sabrina
Brommonschenkel, Sergio H.
Fontes, Elizabeth P. B.
RLPredictiOme, a Machine Learning-Derived Method for High-Throughput Prediction of Plant Receptor-like Proteins, Reveals Novel Classes of Transmembrane Receptors
title RLPredictiOme, a Machine Learning-Derived Method for High-Throughput Prediction of Plant Receptor-like Proteins, Reveals Novel Classes of Transmembrane Receptors
title_full RLPredictiOme, a Machine Learning-Derived Method for High-Throughput Prediction of Plant Receptor-like Proteins, Reveals Novel Classes of Transmembrane Receptors
title_fullStr RLPredictiOme, a Machine Learning-Derived Method for High-Throughput Prediction of Plant Receptor-like Proteins, Reveals Novel Classes of Transmembrane Receptors
title_full_unstemmed RLPredictiOme, a Machine Learning-Derived Method for High-Throughput Prediction of Plant Receptor-like Proteins, Reveals Novel Classes of Transmembrane Receptors
title_short RLPredictiOme, a Machine Learning-Derived Method for High-Throughput Prediction of Plant Receptor-like Proteins, Reveals Novel Classes of Transmembrane Receptors
title_sort rlpredictiome, a machine learning-derived method for high-throughput prediction of plant receptor-like proteins, reveals novel classes of transmembrane receptors
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9603095/
https://www.ncbi.nlm.nih.gov/pubmed/36293031
http://dx.doi.org/10.3390/ijms232012176
work_keys_str_mv AT silvajosecleydsonf rlpredictiomeamachinelearningderivedmethodforhighthroughputpredictionofplantreceptorlikeproteinsrevealsnovelclassesoftransmembranereceptors
AT ferreiramarcoaurelio rlpredictiomeamachinelearningderivedmethodforhighthroughputpredictionofplantreceptorlikeproteinsrevealsnovelclassesoftransmembranereceptors
AT carvalhothalesfm rlpredictiomeamachinelearningderivedmethodforhighthroughputpredictionofplantreceptorlikeproteinsrevealsnovelclassesoftransmembranereceptors
AT silvafabyanof rlpredictiomeamachinelearningderivedmethodforhighthroughputpredictionofplantreceptorlikeproteinsrevealsnovelclassesoftransmembranereceptors
AT deasilveirasabrina rlpredictiomeamachinelearningderivedmethodforhighthroughputpredictionofplantreceptorlikeproteinsrevealsnovelclassesoftransmembranereceptors
AT brommonschenkelsergioh rlpredictiomeamachinelearningderivedmethodforhighthroughputpredictionofplantreceptorlikeproteinsrevealsnovelclassesoftransmembranereceptors
AT fonteselizabethpb rlpredictiomeamachinelearningderivedmethodforhighthroughputpredictionofplantreceptorlikeproteinsrevealsnovelclassesoftransmembranereceptors