Cargando…

PPVED: A machine learning tool for predicting the effect of single amino acid substitution on protein function in plants

Single amino acid substitution (SAAS) produces the most common variant of protein function change under physiological conditions. As the number of SAAS events in plants has increased exponentially, an effective prediction tool is required to help identify and distinguish functional SAASs from the wh...

Descripción completa

Detalles Bibliográficos
Autores principales: Gou, Xiangjian, Feng, Xuanjun, Shi, Haoran, Guo, Tingting, Xie, Rongqian, Liu, Yaxi, Wang, Qi, Li, Hongxiang, Yang, Banglie, Chen, Lixue, Lu, Yanli
Formato: Online Artículo Texto
Lenguaje:English
Publicado: John Wiley and Sons Inc. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9241370/
https://www.ncbi.nlm.nih.gov/pubmed/35398963
http://dx.doi.org/10.1111/pbi.13823
_version_ 1784737791207276544
author Gou, Xiangjian
Feng, Xuanjun
Shi, Haoran
Guo, Tingting
Xie, Rongqian
Liu, Yaxi
Wang, Qi
Li, Hongxiang
Yang, Banglie
Chen, Lixue
Lu, Yanli
author_facet Gou, Xiangjian
Feng, Xuanjun
Shi, Haoran
Guo, Tingting
Xie, Rongqian
Liu, Yaxi
Wang, Qi
Li, Hongxiang
Yang, Banglie
Chen, Lixue
Lu, Yanli
author_sort Gou, Xiangjian
collection PubMed
description Single amino acid substitution (SAAS) produces the most common variant of protein function change under physiological conditions. As the number of SAAS events in plants has increased exponentially, an effective prediction tool is required to help identify and distinguish functional SAASs from the whole genome as either potentially causal traits or as variants. Here, we constructed a plant SAAS database that stores 12 865 SAASs in 6172 proteins and developed a tool called Plant Protein Variation Effect Detector (PPVED) that predicts the effect of SAASs on protein function in plants. PPVED achieved an 87% predictive accuracy when applied to plant SAASs, an accuracy that was much higher than those from six human database software: SIFT, PROVEAN, PANTHER‐PSEP, PhD‐SNP, PolyPhen‐2, and MutPred2. The predictive effect of six SAASs from three proteins in Arabidopsis and maize was validated with wet lab experiments, of which five substitution sites were accurately predicted. PPVED could facilitate the identification and characterization of genetic variants that explain observed phenotype variations in plants, contributing to solutions for challenges in functional genomics and systems biology. PPVED can be accessed under a CC‐BY (4.0) license via http://www.ppved.org.cn.
format Online
Article
Text
id pubmed-9241370
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher John Wiley and Sons Inc.
record_format MEDLINE/PubMed
spelling pubmed-92413702022-07-01 PPVED: A machine learning tool for predicting the effect of single amino acid substitution on protein function in plants Gou, Xiangjian Feng, Xuanjun Shi, Haoran Guo, Tingting Xie, Rongqian Liu, Yaxi Wang, Qi Li, Hongxiang Yang, Banglie Chen, Lixue Lu, Yanli Plant Biotechnol J Research Articles Single amino acid substitution (SAAS) produces the most common variant of protein function change under physiological conditions. As the number of SAAS events in plants has increased exponentially, an effective prediction tool is required to help identify and distinguish functional SAASs from the whole genome as either potentially causal traits or as variants. Here, we constructed a plant SAAS database that stores 12 865 SAASs in 6172 proteins and developed a tool called Plant Protein Variation Effect Detector (PPVED) that predicts the effect of SAASs on protein function in plants. PPVED achieved an 87% predictive accuracy when applied to plant SAASs, an accuracy that was much higher than those from six human database software: SIFT, PROVEAN, PANTHER‐PSEP, PhD‐SNP, PolyPhen‐2, and MutPred2. The predictive effect of six SAASs from three proteins in Arabidopsis and maize was validated with wet lab experiments, of which five substitution sites were accurately predicted. PPVED could facilitate the identification and characterization of genetic variants that explain observed phenotype variations in plants, contributing to solutions for challenges in functional genomics and systems biology. PPVED can be accessed under a CC‐BY (4.0) license via http://www.ppved.org.cn. John Wiley and Sons Inc. 2022-04-27 2022-07 /pmc/articles/PMC9241370/ /pubmed/35398963 http://dx.doi.org/10.1111/pbi.13823 Text en © 2022 The Authors. Plant Biotechnology Journal published by Society for Experimental Biology and The Association of Applied Biologists and John Wiley & Sons Ltd. https://creativecommons.org/licenses/by/4.0/This is an open access article under the terms of the http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Articles
Gou, Xiangjian
Feng, Xuanjun
Shi, Haoran
Guo, Tingting
Xie, Rongqian
Liu, Yaxi
Wang, Qi
Li, Hongxiang
Yang, Banglie
Chen, Lixue
Lu, Yanli
PPVED: A machine learning tool for predicting the effect of single amino acid substitution on protein function in plants
title PPVED: A machine learning tool for predicting the effect of single amino acid substitution on protein function in plants
title_full PPVED: A machine learning tool for predicting the effect of single amino acid substitution on protein function in plants
title_fullStr PPVED: A machine learning tool for predicting the effect of single amino acid substitution on protein function in plants
title_full_unstemmed PPVED: A machine learning tool for predicting the effect of single amino acid substitution on protein function in plants
title_short PPVED: A machine learning tool for predicting the effect of single amino acid substitution on protein function in plants
title_sort ppved: a machine learning tool for predicting the effect of single amino acid substitution on protein function in plants
topic Research Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9241370/
https://www.ncbi.nlm.nih.gov/pubmed/35398963
http://dx.doi.org/10.1111/pbi.13823
work_keys_str_mv AT gouxiangjian ppvedamachinelearningtoolforpredictingtheeffectofsingleaminoacidsubstitutiononproteinfunctioninplants
AT fengxuanjun ppvedamachinelearningtoolforpredictingtheeffectofsingleaminoacidsubstitutiononproteinfunctioninplants
AT shihaoran ppvedamachinelearningtoolforpredictingtheeffectofsingleaminoacidsubstitutiononproteinfunctioninplants
AT guotingting ppvedamachinelearningtoolforpredictingtheeffectofsingleaminoacidsubstitutiononproteinfunctioninplants
AT xierongqian ppvedamachinelearningtoolforpredictingtheeffectofsingleaminoacidsubstitutiononproteinfunctioninplants
AT liuyaxi ppvedamachinelearningtoolforpredictingtheeffectofsingleaminoacidsubstitutiononproteinfunctioninplants
AT wangqi ppvedamachinelearningtoolforpredictingtheeffectofsingleaminoacidsubstitutiononproteinfunctioninplants
AT lihongxiang ppvedamachinelearningtoolforpredictingtheeffectofsingleaminoacidsubstitutiononproteinfunctioninplants
AT yangbanglie ppvedamachinelearningtoolforpredictingtheeffectofsingleaminoacidsubstitutiononproteinfunctioninplants
AT chenlixue ppvedamachinelearningtoolforpredictingtheeffectofsingleaminoacidsubstitutiononproteinfunctioninplants
AT luyanli ppvedamachinelearningtoolforpredictingtheeffectofsingleaminoacidsubstitutiononproteinfunctioninplants