Cargando…

Simple Linear Cancer Risk Prediction Models With Novel Features Outperform Complex Approaches

The ability to accurately predict an individual's risk for cancer is critical to the implementation of precision prevention measures. Current cancer risk predictions are frequently made with simple models that use a few proven risk factors, such as the Gail model for breast cancer, which are ea...

Descripción completa

Detalles Bibliográficos
Autores principales:	Kulm, Scott, Kofman, Lior, Mezey, Jason, Elemento, Olivier
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Wolters Kluwer Health 2022
Materias:	ORIGINAL REPORTS
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8920463/ https://www.ncbi.nlm.nih.gov/pubmed/35239414 http://dx.doi.org/10.1200/CCI.21.00166

_version_	1784669133324943360
author	Kulm, Scott Kofman, Lior Mezey, Jason Elemento, Olivier
author_facet	Kulm, Scott Kofman, Lior Mezey, Jason Elemento, Olivier
author_sort	Kulm, Scott
collection	PubMed
description	The ability to accurately predict an individual's risk for cancer is critical to the implementation of precision prevention measures. Current cancer risk predictions are frequently made with simple models that use a few proven risk factors, such as the Gail model for breast cancer, which are easy to interpret, but may theoretically be less accurate than advanced machine learning (ML) models. METHODS: With the UK Biobank, a large prospective study, we developed models that predicted 13 cancer diagnoses within a 10-year time span. ML and linear models fit with all features, linear models fit with 10 features, and externally developed QCancer models, which are available to more than 4,000 general practices, were assessed. RESULTS: The average area under the receiver operator curve (AUC) of the linear models (0.722, SE = 0.015) was greater than the average AUC of the ML models (0.720, SE = 0.016) when all 931 features were used. Linear models with only 10 features generated an average AUC of 0.706 (SE 0.015), which was comparable to the complex models using all features and greater than the average AUC of the QCancer models (0.684, SE 0.021). The high performance of the 10-feature linear model may be caused by the consideration of often omitted feature types, including census records and genetic information. CONCLUSION: The high performance of the 10-feature linear models indicate that unbiased selection of diverse features, not ML models, may lead to impressively accurate predictions, possibly enabling personalized screening schedules that increase cancer survival.
format	Online Article Text
id	pubmed-8920463
institution	National Center for Biotechnology Information
language	English
publishDate	2022
publisher	Wolters Kluwer Health
record_format	MEDLINE/PubMed
spelling	pubmed-89204632023-03-03 Simple Linear Cancer Risk Prediction Models With Novel Features Outperform Complex Approaches Kulm, Scott Kofman, Lior Mezey, Jason Elemento, Olivier JCO Clin Cancer Inform ORIGINAL REPORTS The ability to accurately predict an individual's risk for cancer is critical to the implementation of precision prevention measures. Current cancer risk predictions are frequently made with simple models that use a few proven risk factors, such as the Gail model for breast cancer, which are easy to interpret, but may theoretically be less accurate than advanced machine learning (ML) models. METHODS: With the UK Biobank, a large prospective study, we developed models that predicted 13 cancer diagnoses within a 10-year time span. ML and linear models fit with all features, linear models fit with 10 features, and externally developed QCancer models, which are available to more than 4,000 general practices, were assessed. RESULTS: The average area under the receiver operator curve (AUC) of the linear models (0.722, SE = 0.015) was greater than the average AUC of the ML models (0.720, SE = 0.016) when all 931 features were used. Linear models with only 10 features generated an average AUC of 0.706 (SE 0.015), which was comparable to the complex models using all features and greater than the average AUC of the QCancer models (0.684, SE 0.021). The high performance of the 10-feature linear model may be caused by the consideration of often omitted feature types, including census records and genetic information. CONCLUSION: The high performance of the 10-feature linear models indicate that unbiased selection of diverse features, not ML models, may lead to impressively accurate predictions, possibly enabling personalized screening schedules that increase cancer survival. Wolters Kluwer Health 2022-03-03 /pmc/articles/PMC8920463/ /pubmed/35239414 http://dx.doi.org/10.1200/CCI.21.00166 Text en © 2022 by American Society of Clinical Oncology https://creativecommons.org/licenses/by/4.0/Licensed under the Creative Commons Attribution 4.0 License: https://creativecommons.org/licenses/by/4.0/
spellingShingle	ORIGINAL REPORTS Kulm, Scott Kofman, Lior Mezey, Jason Elemento, Olivier Simple Linear Cancer Risk Prediction Models With Novel Features Outperform Complex Approaches
title	Simple Linear Cancer Risk Prediction Models With Novel Features Outperform Complex Approaches
title_full	Simple Linear Cancer Risk Prediction Models With Novel Features Outperform Complex Approaches
title_fullStr	Simple Linear Cancer Risk Prediction Models With Novel Features Outperform Complex Approaches
title_full_unstemmed	Simple Linear Cancer Risk Prediction Models With Novel Features Outperform Complex Approaches
title_short	Simple Linear Cancer Risk Prediction Models With Novel Features Outperform Complex Approaches
title_sort	simple linear cancer risk prediction models with novel features outperform complex approaches
topic	ORIGINAL REPORTS
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8920463/ https://www.ncbi.nlm.nih.gov/pubmed/35239414 http://dx.doi.org/10.1200/CCI.21.00166
work_keys_str_mv	AT kulmscott simplelinearcancerriskpredictionmodelswithnovelfeaturesoutperformcomplexapproaches AT kofmanlior simplelinearcancerriskpredictionmodelswithnovelfeaturesoutperformcomplexapproaches AT mezeyjason simplelinearcancerriskpredictionmodelswithnovelfeaturesoutperformcomplexapproaches AT elementoolivier simplelinearcancerriskpredictionmodelswithnovelfeaturesoutperformcomplexapproaches

Simple Linear Cancer Risk Prediction Models With Novel Features Outperform Complex Approaches

Ejemplares similares