Cargando…
RLPredictiOme, a Machine Learning-Derived Method for High-Throughput Prediction of Plant Receptor-like Proteins, Reveals Novel Classes of Transmembrane Receptors
Cell surface receptors play essential roles in perceiving and processing external and internal signals at the cell surface of plants and animals. The receptor-like protein kinases (RLK) and receptor-like proteins (RLPs), two major classes of proteins with membrane receptor configuration, play a cruc...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9603095/ https://www.ncbi.nlm.nih.gov/pubmed/36293031 http://dx.doi.org/10.3390/ijms232012176 |
_version_ | 1784817462883123200 |
---|---|
author | Silva, Jose Cleydson F. Ferreira, Marco Aurélio Carvalho, Thales F. M. Silva, Fabyano F. de A. Silveira, Sabrina Brommonschenkel, Sergio H. Fontes, Elizabeth P. B. |
author_facet | Silva, Jose Cleydson F. Ferreira, Marco Aurélio Carvalho, Thales F. M. Silva, Fabyano F. de A. Silveira, Sabrina Brommonschenkel, Sergio H. Fontes, Elizabeth P. B. |
author_sort | Silva, Jose Cleydson F. |
collection | PubMed |
description | Cell surface receptors play essential roles in perceiving and processing external and internal signals at the cell surface of plants and animals. The receptor-like protein kinases (RLK) and receptor-like proteins (RLPs), two major classes of proteins with membrane receptor configuration, play a crucial role in plant development and disease defense. Although RLPs and RLKs share a similar single-pass transmembrane configuration, RLPs harbor short divergent C-terminal regions instead of the conserved kinase domain of RLKs. This RLP receptor structural design precludes sequence comparison algorithms from being used for high-throughput predictions of the RLP family in plant genomes, as has been extensively performed for RLK superfamily predictions. Here, we developed the RLPredictiOme, implemented with machine learning models in combination with Bayesian inference, capable of predicting RLP subfamilies in plant genomes. The ML models were simultaneously trained using six types of features, along with three stages to distinguish RLPs from non-RLPs (NRLPs), RLPs from RLKs, and classify new subfamilies of RLPs in plants. The ML models achieved high accuracy, precision, sensitivity, and specificity for predicting RLPs with relatively high probability ranging from 0.79 to 0.99. The prediction of the method was assessed with three datasets, two of which contained leucine-rich repeats (LRR)-RLPs from Arabidopsis and rice, and the last one consisted of the complete set of previously described Arabidopsis RLPs. In these validation tests, more than 90% of known RLPs were correctly predicted via RLPredictiOme. In addition to predicting previously characterized RLPs, RLPredictiOme uncovered new RLP subfamilies in the Arabidopsis genome. These include probable lipid transfer (PLT)-RLP, plastocyanin-like-RLP, ring finger-RLP, glycosyl-hydrolase-RLP, and glycerophosphoryldiester phosphodiesterase (GDPD, GDPDL)-RLP subfamilies, yet to be characterized. Compared to the only Arabidopsis GDPDL-RLK, molecular evolution studies confirmed that the ectodomain of GDPDL-RLPs might have undergone a purifying selection with a predominance of synonymous substitutions. Expression analyses revealed that predicted GDPGL-RLPs display a basal expression level and respond to developmental and biotic signals. The results of these biological assays indicate that these subfamily members have maintained functional domains during evolution and may play relevant roles in development and plant defense. Therefore, RLPredictiOme provides a framework for genome-wide surveys of the RLP superfamily as a foundation to rationalize functional studies of surface receptors and their relationships with different biological processes. |
format | Online Article Text |
id | pubmed-9603095 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-96030952022-10-27 RLPredictiOme, a Machine Learning-Derived Method for High-Throughput Prediction of Plant Receptor-like Proteins, Reveals Novel Classes of Transmembrane Receptors Silva, Jose Cleydson F. Ferreira, Marco Aurélio Carvalho, Thales F. M. Silva, Fabyano F. de A. Silveira, Sabrina Brommonschenkel, Sergio H. Fontes, Elizabeth P. B. Int J Mol Sci Article Cell surface receptors play essential roles in perceiving and processing external and internal signals at the cell surface of plants and animals. The receptor-like protein kinases (RLK) and receptor-like proteins (RLPs), two major classes of proteins with membrane receptor configuration, play a crucial role in plant development and disease defense. Although RLPs and RLKs share a similar single-pass transmembrane configuration, RLPs harbor short divergent C-terminal regions instead of the conserved kinase domain of RLKs. This RLP receptor structural design precludes sequence comparison algorithms from being used for high-throughput predictions of the RLP family in plant genomes, as has been extensively performed for RLK superfamily predictions. Here, we developed the RLPredictiOme, implemented with machine learning models in combination with Bayesian inference, capable of predicting RLP subfamilies in plant genomes. The ML models were simultaneously trained using six types of features, along with three stages to distinguish RLPs from non-RLPs (NRLPs), RLPs from RLKs, and classify new subfamilies of RLPs in plants. The ML models achieved high accuracy, precision, sensitivity, and specificity for predicting RLPs with relatively high probability ranging from 0.79 to 0.99. The prediction of the method was assessed with three datasets, two of which contained leucine-rich repeats (LRR)-RLPs from Arabidopsis and rice, and the last one consisted of the complete set of previously described Arabidopsis RLPs. In these validation tests, more than 90% of known RLPs were correctly predicted via RLPredictiOme. In addition to predicting previously characterized RLPs, RLPredictiOme uncovered new RLP subfamilies in the Arabidopsis genome. These include probable lipid transfer (PLT)-RLP, plastocyanin-like-RLP, ring finger-RLP, glycosyl-hydrolase-RLP, and glycerophosphoryldiester phosphodiesterase (GDPD, GDPDL)-RLP subfamilies, yet to be characterized. Compared to the only Arabidopsis GDPDL-RLK, molecular evolution studies confirmed that the ectodomain of GDPDL-RLPs might have undergone a purifying selection with a predominance of synonymous substitutions. Expression analyses revealed that predicted GDPGL-RLPs display a basal expression level and respond to developmental and biotic signals. The results of these biological assays indicate that these subfamily members have maintained functional domains during evolution and may play relevant roles in development and plant defense. Therefore, RLPredictiOme provides a framework for genome-wide surveys of the RLP superfamily as a foundation to rationalize functional studies of surface receptors and their relationships with different biological processes. MDPI 2022-10-12 /pmc/articles/PMC9603095/ /pubmed/36293031 http://dx.doi.org/10.3390/ijms232012176 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Silva, Jose Cleydson F. Ferreira, Marco Aurélio Carvalho, Thales F. M. Silva, Fabyano F. de A. Silveira, Sabrina Brommonschenkel, Sergio H. Fontes, Elizabeth P. B. RLPredictiOme, a Machine Learning-Derived Method for High-Throughput Prediction of Plant Receptor-like Proteins, Reveals Novel Classes of Transmembrane Receptors |
title | RLPredictiOme, a Machine Learning-Derived Method for High-Throughput Prediction of Plant Receptor-like Proteins, Reveals Novel Classes of Transmembrane Receptors |
title_full | RLPredictiOme, a Machine Learning-Derived Method for High-Throughput Prediction of Plant Receptor-like Proteins, Reveals Novel Classes of Transmembrane Receptors |
title_fullStr | RLPredictiOme, a Machine Learning-Derived Method for High-Throughput Prediction of Plant Receptor-like Proteins, Reveals Novel Classes of Transmembrane Receptors |
title_full_unstemmed | RLPredictiOme, a Machine Learning-Derived Method for High-Throughput Prediction of Plant Receptor-like Proteins, Reveals Novel Classes of Transmembrane Receptors |
title_short | RLPredictiOme, a Machine Learning-Derived Method for High-Throughput Prediction of Plant Receptor-like Proteins, Reveals Novel Classes of Transmembrane Receptors |
title_sort | rlpredictiome, a machine learning-derived method for high-throughput prediction of plant receptor-like proteins, reveals novel classes of transmembrane receptors |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9603095/ https://www.ncbi.nlm.nih.gov/pubmed/36293031 http://dx.doi.org/10.3390/ijms232012176 |
work_keys_str_mv | AT silvajosecleydsonf rlpredictiomeamachinelearningderivedmethodforhighthroughputpredictionofplantreceptorlikeproteinsrevealsnovelclassesoftransmembranereceptors AT ferreiramarcoaurelio rlpredictiomeamachinelearningderivedmethodforhighthroughputpredictionofplantreceptorlikeproteinsrevealsnovelclassesoftransmembranereceptors AT carvalhothalesfm rlpredictiomeamachinelearningderivedmethodforhighthroughputpredictionofplantreceptorlikeproteinsrevealsnovelclassesoftransmembranereceptors AT silvafabyanof rlpredictiomeamachinelearningderivedmethodforhighthroughputpredictionofplantreceptorlikeproteinsrevealsnovelclassesoftransmembranereceptors AT deasilveirasabrina rlpredictiomeamachinelearningderivedmethodforhighthroughputpredictionofplantreceptorlikeproteinsrevealsnovelclassesoftransmembranereceptors AT brommonschenkelsergioh rlpredictiomeamachinelearningderivedmethodforhighthroughputpredictionofplantreceptorlikeproteinsrevealsnovelclassesoftransmembranereceptors AT fonteselizabethpb rlpredictiomeamachinelearningderivedmethodforhighthroughputpredictionofplantreceptorlikeproteinsrevealsnovelclassesoftransmembranereceptors |