Cargando…

PPVED: A machine learning tool for predicting the effect of single amino acid substitution on protein function in plants

Single amino acid substitution (SAAS) produces the most common variant of protein function change under physiological conditions. As the number of SAAS events in plants has increased exponentially, an effective prediction tool is required to help identify and distinguish functional SAASs from the wh...

Descripción completa

Detalles Bibliográficos
Autores principales: Gou, Xiangjian, Feng, Xuanjun, Shi, Haoran, Guo, Tingting, Xie, Rongqian, Liu, Yaxi, Wang, Qi, Li, Hongxiang, Yang, Banglie, Chen, Lixue, Lu, Yanli
Formato: Online Artículo Texto
Lenguaje:English
Publicado: John Wiley and Sons Inc. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9241370/
https://www.ncbi.nlm.nih.gov/pubmed/35398963
http://dx.doi.org/10.1111/pbi.13823
Descripción
Sumario:Single amino acid substitution (SAAS) produces the most common variant of protein function change under physiological conditions. As the number of SAAS events in plants has increased exponentially, an effective prediction tool is required to help identify and distinguish functional SAASs from the whole genome as either potentially causal traits or as variants. Here, we constructed a plant SAAS database that stores 12 865 SAASs in 6172 proteins and developed a tool called Plant Protein Variation Effect Detector (PPVED) that predicts the effect of SAASs on protein function in plants. PPVED achieved an 87% predictive accuracy when applied to plant SAASs, an accuracy that was much higher than those from six human database software: SIFT, PROVEAN, PANTHER‐PSEP, PhD‐SNP, PolyPhen‐2, and MutPred2. The predictive effect of six SAASs from three proteins in Arabidopsis and maize was validated with wet lab experiments, of which five substitution sites were accurately predicted. PPVED could facilitate the identification and characterization of genetic variants that explain observed phenotype variations in plants, contributing to solutions for challenges in functional genomics and systems biology. PPVED can be accessed under a CC‐BY (4.0) license via http://www.ppved.org.cn.