Cargando…

TP53_PROF: a machine learning model to predict impact of missense mutations in TP53

Correctly identifying the true driver mutations in a patient’s tumor is a major challenge in precision oncology. Most efforts address frequent mutations, leaving medium- and low-frequency variants mostly unaddressed. For TP53, this identification is crucial for both somatic and germline mutations, w...

Descripción completa

Detalles Bibliográficos
Autores principales: Ben-Cohen, Gil, Doffe, Flora, Devir, Michal, Leroy, Bernard, Soussi, Thierry, Rosenberg, Shai
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8921628/
https://www.ncbi.nlm.nih.gov/pubmed/35043155
http://dx.doi.org/10.1093/bib/bbab524
_version_ 1784669361024270336
author Ben-Cohen, Gil
Doffe, Flora
Devir, Michal
Leroy, Bernard
Soussi, Thierry
Rosenberg, Shai
author_facet Ben-Cohen, Gil
Doffe, Flora
Devir, Michal
Leroy, Bernard
Soussi, Thierry
Rosenberg, Shai
author_sort Ben-Cohen, Gil
collection PubMed
description Correctly identifying the true driver mutations in a patient’s tumor is a major challenge in precision oncology. Most efforts address frequent mutations, leaving medium- and low-frequency variants mostly unaddressed. For TP53, this identification is crucial for both somatic and germline mutations, with the latter associated with the Li-Fraumeni syndrome (LFS), a multiorgan cancer predisposition. We present TP53_PROF (prediction of functionality), a gene specific machine learning model to predict the functional consequences of every possible missense mutation in TP53, integrating human cell- and yeast-based functional assays scores along with computational scores. Variants were labeled for the training set using well-defined criteria of prevalence in four cancer genomics databases. The model’s predictions provided accuracy of 96.5%. They were validated experimentally, and were compared to population data, LFS datasets, ClinVar annotations and to TCGA survival data. Very high accuracy was shown through all methods of validation. TP53_PROF allows accurate classification of TP53 missense mutations applicable for clinical practice. Our gene specific approach integrated machine learning, highly reliable features and biological knowledge, to create an unprecedented, thoroughly validated and clinically oriented classification model. This approach currently addresses TP53 mutations and will be applied in the future to other important cancer genes.
format Online
Article
Text
id pubmed-8921628
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-89216282022-03-15 TP53_PROF: a machine learning model to predict impact of missense mutations in TP53 Ben-Cohen, Gil Doffe, Flora Devir, Michal Leroy, Bernard Soussi, Thierry Rosenberg, Shai Brief Bioinform Problem Solving Protocol Correctly identifying the true driver mutations in a patient’s tumor is a major challenge in precision oncology. Most efforts address frequent mutations, leaving medium- and low-frequency variants mostly unaddressed. For TP53, this identification is crucial for both somatic and germline mutations, with the latter associated with the Li-Fraumeni syndrome (LFS), a multiorgan cancer predisposition. We present TP53_PROF (prediction of functionality), a gene specific machine learning model to predict the functional consequences of every possible missense mutation in TP53, integrating human cell- and yeast-based functional assays scores along with computational scores. Variants were labeled for the training set using well-defined criteria of prevalence in four cancer genomics databases. The model’s predictions provided accuracy of 96.5%. They were validated experimentally, and were compared to population data, LFS datasets, ClinVar annotations and to TCGA survival data. Very high accuracy was shown through all methods of validation. TP53_PROF allows accurate classification of TP53 missense mutations applicable for clinical practice. Our gene specific approach integrated machine learning, highly reliable features and biological knowledge, to create an unprecedented, thoroughly validated and clinically oriented classification model. This approach currently addresses TP53 mutations and will be applied in the future to other important cancer genes. Oxford University Press 2022-01-18 /pmc/articles/PMC8921628/ /pubmed/35043155 http://dx.doi.org/10.1093/bib/bbab524 Text en © The Author(s) 2022. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Problem Solving Protocol
Ben-Cohen, Gil
Doffe, Flora
Devir, Michal
Leroy, Bernard
Soussi, Thierry
Rosenberg, Shai
TP53_PROF: a machine learning model to predict impact of missense mutations in TP53
title TP53_PROF: a machine learning model to predict impact of missense mutations in TP53
title_full TP53_PROF: a machine learning model to predict impact of missense mutations in TP53
title_fullStr TP53_PROF: a machine learning model to predict impact of missense mutations in TP53
title_full_unstemmed TP53_PROF: a machine learning model to predict impact of missense mutations in TP53
title_short TP53_PROF: a machine learning model to predict impact of missense mutations in TP53
title_sort tp53_prof: a machine learning model to predict impact of missense mutations in tp53
topic Problem Solving Protocol
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8921628/
https://www.ncbi.nlm.nih.gov/pubmed/35043155
http://dx.doi.org/10.1093/bib/bbab524
work_keys_str_mv AT bencohengil tp53profamachinelearningmodeltopredictimpactofmissensemutationsintp53
AT doffeflora tp53profamachinelearningmodeltopredictimpactofmissensemutationsintp53
AT devirmichal tp53profamachinelearningmodeltopredictimpactofmissensemutationsintp53
AT leroybernard tp53profamachinelearningmodeltopredictimpactofmissensemutationsintp53
AT soussithierry tp53profamachinelearningmodeltopredictimpactofmissensemutationsintp53
AT rosenbergshai tp53profamachinelearningmodeltopredictimpactofmissensemutationsintp53