Cargando…
TP53_PROF: a machine learning model to predict impact of missense mutations in TP53
Correctly identifying the true driver mutations in a patient’s tumor is a major challenge in precision oncology. Most efforts address frequent mutations, leaving medium- and low-frequency variants mostly unaddressed. For TP53, this identification is crucial for both somatic and germline mutations, w...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8921628/ https://www.ncbi.nlm.nih.gov/pubmed/35043155 http://dx.doi.org/10.1093/bib/bbab524 |
_version_ | 1784669361024270336 |
---|---|
author | Ben-Cohen, Gil Doffe, Flora Devir, Michal Leroy, Bernard Soussi, Thierry Rosenberg, Shai |
author_facet | Ben-Cohen, Gil Doffe, Flora Devir, Michal Leroy, Bernard Soussi, Thierry Rosenberg, Shai |
author_sort | Ben-Cohen, Gil |
collection | PubMed |
description | Correctly identifying the true driver mutations in a patient’s tumor is a major challenge in precision oncology. Most efforts address frequent mutations, leaving medium- and low-frequency variants mostly unaddressed. For TP53, this identification is crucial for both somatic and germline mutations, with the latter associated with the Li-Fraumeni syndrome (LFS), a multiorgan cancer predisposition. We present TP53_PROF (prediction of functionality), a gene specific machine learning model to predict the functional consequences of every possible missense mutation in TP53, integrating human cell- and yeast-based functional assays scores along with computational scores. Variants were labeled for the training set using well-defined criteria of prevalence in four cancer genomics databases. The model’s predictions provided accuracy of 96.5%. They were validated experimentally, and were compared to population data, LFS datasets, ClinVar annotations and to TCGA survival data. Very high accuracy was shown through all methods of validation. TP53_PROF allows accurate classification of TP53 missense mutations applicable for clinical practice. Our gene specific approach integrated machine learning, highly reliable features and biological knowledge, to create an unprecedented, thoroughly validated and clinically oriented classification model. This approach currently addresses TP53 mutations and will be applied in the future to other important cancer genes. |
format | Online Article Text |
id | pubmed-8921628 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-89216282022-03-15 TP53_PROF: a machine learning model to predict impact of missense mutations in TP53 Ben-Cohen, Gil Doffe, Flora Devir, Michal Leroy, Bernard Soussi, Thierry Rosenberg, Shai Brief Bioinform Problem Solving Protocol Correctly identifying the true driver mutations in a patient’s tumor is a major challenge in precision oncology. Most efforts address frequent mutations, leaving medium- and low-frequency variants mostly unaddressed. For TP53, this identification is crucial for both somatic and germline mutations, with the latter associated with the Li-Fraumeni syndrome (LFS), a multiorgan cancer predisposition. We present TP53_PROF (prediction of functionality), a gene specific machine learning model to predict the functional consequences of every possible missense mutation in TP53, integrating human cell- and yeast-based functional assays scores along with computational scores. Variants were labeled for the training set using well-defined criteria of prevalence in four cancer genomics databases. The model’s predictions provided accuracy of 96.5%. They were validated experimentally, and were compared to population data, LFS datasets, ClinVar annotations and to TCGA survival data. Very high accuracy was shown through all methods of validation. TP53_PROF allows accurate classification of TP53 missense mutations applicable for clinical practice. Our gene specific approach integrated machine learning, highly reliable features and biological knowledge, to create an unprecedented, thoroughly validated and clinically oriented classification model. This approach currently addresses TP53 mutations and will be applied in the future to other important cancer genes. Oxford University Press 2022-01-18 /pmc/articles/PMC8921628/ /pubmed/35043155 http://dx.doi.org/10.1093/bib/bbab524 Text en © The Author(s) 2022. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Problem Solving Protocol Ben-Cohen, Gil Doffe, Flora Devir, Michal Leroy, Bernard Soussi, Thierry Rosenberg, Shai TP53_PROF: a machine learning model to predict impact of missense mutations in TP53 |
title | TP53_PROF: a machine learning model to predict impact of missense mutations in TP53 |
title_full | TP53_PROF: a machine learning model to predict impact of missense mutations in TP53 |
title_fullStr | TP53_PROF: a machine learning model to predict impact of missense mutations in TP53 |
title_full_unstemmed | TP53_PROF: a machine learning model to predict impact of missense mutations in TP53 |
title_short | TP53_PROF: a machine learning model to predict impact of missense mutations in TP53 |
title_sort | tp53_prof: a machine learning model to predict impact of missense mutations in tp53 |
topic | Problem Solving Protocol |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8921628/ https://www.ncbi.nlm.nih.gov/pubmed/35043155 http://dx.doi.org/10.1093/bib/bbab524 |
work_keys_str_mv | AT bencohengil tp53profamachinelearningmodeltopredictimpactofmissensemutationsintp53 AT doffeflora tp53profamachinelearningmodeltopredictimpactofmissensemutationsintp53 AT devirmichal tp53profamachinelearningmodeltopredictimpactofmissensemutationsintp53 AT leroybernard tp53profamachinelearningmodeltopredictimpactofmissensemutationsintp53 AT soussithierry tp53profamachinelearningmodeltopredictimpactofmissensemutationsintp53 AT rosenbergshai tp53profamachinelearningmodeltopredictimpactofmissensemutationsintp53 |