Cargando…

mvPPT: A Highly Efficient and Sensitive Pathogenicity Prediction Tool for Missense Variants

Next-generation sequencing technologies both boost the discovery of variants in the human genome and exacerbate the challenges of pathogenic variant identification. In this study, we developed Pathogenicity Prediction Tool for missense variants (mvPPT), a highly sensitive and accurate missense varia...

Descripción completa

Detalles Bibliográficos
Autores principales: Tong, Shi-Yuan, Fan, Ke, Zhou, Zai-Wei, Liu, Lin-Yun, Zhang, Shu-Qing, Fu, Yinghui, Wang, Guang-Zhong, Zhu, Ying, Yu, Yong-Chun
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10626173/
https://www.ncbi.nlm.nih.gov/pubmed/35940520
http://dx.doi.org/10.1016/j.gpb.2022.07.005
_version_ 1785131288166924288
author Tong, Shi-Yuan
Fan, Ke
Zhou, Zai-Wei
Liu, Lin-Yun
Zhang, Shu-Qing
Fu, Yinghui
Wang, Guang-Zhong
Zhu, Ying
Yu, Yong-Chun
author_facet Tong, Shi-Yuan
Fan, Ke
Zhou, Zai-Wei
Liu, Lin-Yun
Zhang, Shu-Qing
Fu, Yinghui
Wang, Guang-Zhong
Zhu, Ying
Yu, Yong-Chun
author_sort Tong, Shi-Yuan
collection PubMed
description Next-generation sequencing technologies both boost the discovery of variants in the human genome and exacerbate the challenges of pathogenic variant identification. In this study, we developed Pathogenicity Prediction Tool for missense variants (mvPPT), a highly sensitive and accurate missense variant classifier based on gradient boosting. mvPPT adopts high-confidence training sets with a wide spectrum of variant profiles, and extracts three categories of features, including scores from existing prediction tools, frequencies (allele frequencies, amino acid frequencies, and genotype frequencies), and genomic context. Compared with established predictors, mvPPT achieves superior performance in all test sets, regardless of data source. In addition, our study also provides guidance for training set and feature selection strategies, as well as reveals highly relevant features, which may further provide biological insights into variant pathogenicity. mvPPT is freely available at http://www.mvppt.club/.
format Online
Article
Text
id pubmed-10626173
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Elsevier
record_format MEDLINE/PubMed
spelling pubmed-106261732023-11-07 mvPPT: A Highly Efficient and Sensitive Pathogenicity Prediction Tool for Missense Variants Tong, Shi-Yuan Fan, Ke Zhou, Zai-Wei Liu, Lin-Yun Zhang, Shu-Qing Fu, Yinghui Wang, Guang-Zhong Zhu, Ying Yu, Yong-Chun Genomics Proteomics Bioinformatics Application Note Next-generation sequencing technologies both boost the discovery of variants in the human genome and exacerbate the challenges of pathogenic variant identification. In this study, we developed Pathogenicity Prediction Tool for missense variants (mvPPT), a highly sensitive and accurate missense variant classifier based on gradient boosting. mvPPT adopts high-confidence training sets with a wide spectrum of variant profiles, and extracts three categories of features, including scores from existing prediction tools, frequencies (allele frequencies, amino acid frequencies, and genotype frequencies), and genomic context. Compared with established predictors, mvPPT achieves superior performance in all test sets, regardless of data source. In addition, our study also provides guidance for training set and feature selection strategies, as well as reveals highly relevant features, which may further provide biological insights into variant pathogenicity. mvPPT is freely available at http://www.mvppt.club/. Elsevier 2023-04 2022-08-05 /pmc/articles/PMC10626173/ /pubmed/35940520 http://dx.doi.org/10.1016/j.gpb.2022.07.005 Text en © 2023 The Authors https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
spellingShingle Application Note
Tong, Shi-Yuan
Fan, Ke
Zhou, Zai-Wei
Liu, Lin-Yun
Zhang, Shu-Qing
Fu, Yinghui
Wang, Guang-Zhong
Zhu, Ying
Yu, Yong-Chun
mvPPT: A Highly Efficient and Sensitive Pathogenicity Prediction Tool for Missense Variants
title mvPPT: A Highly Efficient and Sensitive Pathogenicity Prediction Tool for Missense Variants
title_full mvPPT: A Highly Efficient and Sensitive Pathogenicity Prediction Tool for Missense Variants
title_fullStr mvPPT: A Highly Efficient and Sensitive Pathogenicity Prediction Tool for Missense Variants
title_full_unstemmed mvPPT: A Highly Efficient and Sensitive Pathogenicity Prediction Tool for Missense Variants
title_short mvPPT: A Highly Efficient and Sensitive Pathogenicity Prediction Tool for Missense Variants
title_sort mvppt: a highly efficient and sensitive pathogenicity prediction tool for missense variants
topic Application Note
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10626173/
https://www.ncbi.nlm.nih.gov/pubmed/35940520
http://dx.doi.org/10.1016/j.gpb.2022.07.005
work_keys_str_mv AT tongshiyuan mvpptahighlyefficientandsensitivepathogenicitypredictiontoolformissensevariants
AT fanke mvpptahighlyefficientandsensitivepathogenicitypredictiontoolformissensevariants
AT zhouzaiwei mvpptahighlyefficientandsensitivepathogenicitypredictiontoolformissensevariants
AT liulinyun mvpptahighlyefficientandsensitivepathogenicitypredictiontoolformissensevariants
AT zhangshuqing mvpptahighlyefficientandsensitivepathogenicitypredictiontoolformissensevariants
AT fuyinghui mvpptahighlyefficientandsensitivepathogenicitypredictiontoolformissensevariants
AT wangguangzhong mvpptahighlyefficientandsensitivepathogenicitypredictiontoolformissensevariants
AT zhuying mvpptahighlyefficientandsensitivepathogenicitypredictiontoolformissensevariants
AT yuyongchun mvpptahighlyefficientandsensitivepathogenicitypredictiontoolformissensevariants