Cargando…

Prediction of Deleterious Non-Synonymous SNPs Based on Protein Interaction Network and Hybrid Properties

Non-synonymous SNPs (nsSNPs), also known as Single Amino acid Polymorphisms (SAPs) account for the majority of human inherited diseases. It is important to distinguish the deleterious SAPs from neutral ones. Most traditional computational methods to classify SAPs are based on sequential or structura...

Descripción completa

Detalles Bibliográficos
Autores principales: Huang, Tao, Wang, Ping, Ye, Zhi-Qiang, Xu, Heng, He, Zhisong, Feng, Kai-Yan, Hu, LeLe, Cui, WeiRen, Wang, Kai, Dong, Xiao, Xie, Lu, Kong, Xiangyin, Cai, Yu-Dong, Li, Yixue
Formato: Texto
Lenguaje:English
Publicado: Public Library of Science 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2912763/
https://www.ncbi.nlm.nih.gov/pubmed/20689580
http://dx.doi.org/10.1371/journal.pone.0011900
_version_ 1782184614645727232
author Huang, Tao
Wang, Ping
Ye, Zhi-Qiang
Xu, Heng
He, Zhisong
Feng, Kai-Yan
Hu, LeLe
Cui, WeiRen
Wang, Kai
Dong, Xiao
Xie, Lu
Kong, Xiangyin
Cai, Yu-Dong
Li, Yixue
author_facet Huang, Tao
Wang, Ping
Ye, Zhi-Qiang
Xu, Heng
He, Zhisong
Feng, Kai-Yan
Hu, LeLe
Cui, WeiRen
Wang, Kai
Dong, Xiao
Xie, Lu
Kong, Xiangyin
Cai, Yu-Dong
Li, Yixue
author_sort Huang, Tao
collection PubMed
description Non-synonymous SNPs (nsSNPs), also known as Single Amino acid Polymorphisms (SAPs) account for the majority of human inherited diseases. It is important to distinguish the deleterious SAPs from neutral ones. Most traditional computational methods to classify SAPs are based on sequential or structural features. However, these features cannot fully explain the association between a SAP and the observed pathophysiological phenotype. We believe the better rationale for deleterious SAP prediction should be: If a SAP lies in the protein with important functions and it can change the protein sequence and structure severely, it is more likely related to disease. So we established a method to predict deleterious SAPs based on both protein interaction network and traditional hybrid properties. Each SAP is represented by 472 features that include sequential features, structural features and network features. Maximum Relevance Minimum Redundancy (mRMR) method and Incremental Feature Selection (IFS) were applied to obtain the optimal feature set and the prediction model was Nearest Neighbor Algorithm (NNA). In jackknife cross-validation, 83.27% of SAPs were correctly predicted when the optimized 263 features were used. The optimized predictor with 263 features was also tested in an independent dataset and the accuracy was still 80.00%. In contrast, SIFT, a widely used predictor of deleterious SAPs based on sequential features, has a prediction accuracy of 71.05% on the same dataset. In our study, network features were found to be most important for accurate prediction and can significantly improve the prediction performance. Our results suggest that the protein interaction context could provide important clues to help better illustrate SAP's functional association. This research will facilitate the post genome-wide association studies.
format Text
id pubmed-2912763
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-29127632010-08-04 Prediction of Deleterious Non-Synonymous SNPs Based on Protein Interaction Network and Hybrid Properties Huang, Tao Wang, Ping Ye, Zhi-Qiang Xu, Heng He, Zhisong Feng, Kai-Yan Hu, LeLe Cui, WeiRen Wang, Kai Dong, Xiao Xie, Lu Kong, Xiangyin Cai, Yu-Dong Li, Yixue PLoS One Research Article Non-synonymous SNPs (nsSNPs), also known as Single Amino acid Polymorphisms (SAPs) account for the majority of human inherited diseases. It is important to distinguish the deleterious SAPs from neutral ones. Most traditional computational methods to classify SAPs are based on sequential or structural features. However, these features cannot fully explain the association between a SAP and the observed pathophysiological phenotype. We believe the better rationale for deleterious SAP prediction should be: If a SAP lies in the protein with important functions and it can change the protein sequence and structure severely, it is more likely related to disease. So we established a method to predict deleterious SAPs based on both protein interaction network and traditional hybrid properties. Each SAP is represented by 472 features that include sequential features, structural features and network features. Maximum Relevance Minimum Redundancy (mRMR) method and Incremental Feature Selection (IFS) were applied to obtain the optimal feature set and the prediction model was Nearest Neighbor Algorithm (NNA). In jackknife cross-validation, 83.27% of SAPs were correctly predicted when the optimized 263 features were used. The optimized predictor with 263 features was also tested in an independent dataset and the accuracy was still 80.00%. In contrast, SIFT, a widely used predictor of deleterious SAPs based on sequential features, has a prediction accuracy of 71.05% on the same dataset. In our study, network features were found to be most important for accurate prediction and can significantly improve the prediction performance. Our results suggest that the protein interaction context could provide important clues to help better illustrate SAP's functional association. This research will facilitate the post genome-wide association studies. Public Library of Science 2010-07-30 /pmc/articles/PMC2912763/ /pubmed/20689580 http://dx.doi.org/10.1371/journal.pone.0011900 Text en Huang et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Huang, Tao
Wang, Ping
Ye, Zhi-Qiang
Xu, Heng
He, Zhisong
Feng, Kai-Yan
Hu, LeLe
Cui, WeiRen
Wang, Kai
Dong, Xiao
Xie, Lu
Kong, Xiangyin
Cai, Yu-Dong
Li, Yixue
Prediction of Deleterious Non-Synonymous SNPs Based on Protein Interaction Network and Hybrid Properties
title Prediction of Deleterious Non-Synonymous SNPs Based on Protein Interaction Network and Hybrid Properties
title_full Prediction of Deleterious Non-Synonymous SNPs Based on Protein Interaction Network and Hybrid Properties
title_fullStr Prediction of Deleterious Non-Synonymous SNPs Based on Protein Interaction Network and Hybrid Properties
title_full_unstemmed Prediction of Deleterious Non-Synonymous SNPs Based on Protein Interaction Network and Hybrid Properties
title_short Prediction of Deleterious Non-Synonymous SNPs Based on Protein Interaction Network and Hybrid Properties
title_sort prediction of deleterious non-synonymous snps based on protein interaction network and hybrid properties
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2912763/
https://www.ncbi.nlm.nih.gov/pubmed/20689580
http://dx.doi.org/10.1371/journal.pone.0011900
work_keys_str_mv AT huangtao predictionofdeleteriousnonsynonymoussnpsbasedonproteininteractionnetworkandhybridproperties
AT wangping predictionofdeleteriousnonsynonymoussnpsbasedonproteininteractionnetworkandhybridproperties
AT yezhiqiang predictionofdeleteriousnonsynonymoussnpsbasedonproteininteractionnetworkandhybridproperties
AT xuheng predictionofdeleteriousnonsynonymoussnpsbasedonproteininteractionnetworkandhybridproperties
AT hezhisong predictionofdeleteriousnonsynonymoussnpsbasedonproteininteractionnetworkandhybridproperties
AT fengkaiyan predictionofdeleteriousnonsynonymoussnpsbasedonproteininteractionnetworkandhybridproperties
AT hulele predictionofdeleteriousnonsynonymoussnpsbasedonproteininteractionnetworkandhybridproperties
AT cuiweiren predictionofdeleteriousnonsynonymoussnpsbasedonproteininteractionnetworkandhybridproperties
AT wangkai predictionofdeleteriousnonsynonymoussnpsbasedonproteininteractionnetworkandhybridproperties
AT dongxiao predictionofdeleteriousnonsynonymoussnpsbasedonproteininteractionnetworkandhybridproperties
AT xielu predictionofdeleteriousnonsynonymoussnpsbasedonproteininteractionnetworkandhybridproperties
AT kongxiangyin predictionofdeleteriousnonsynonymoussnpsbasedonproteininteractionnetworkandhybridproperties
AT caiyudong predictionofdeleteriousnonsynonymoussnpsbasedonproteininteractionnetworkandhybridproperties
AT liyixue predictionofdeleteriousnonsynonymoussnpsbasedonproteininteractionnetworkandhybridproperties