Cargando…
Towards a data-driven system for personalized cervical cancer risk stratification
Mass-screening programs for cervical cancer prevention in the Nordic countries have been effective in reducing cancer incidence and mortality at the population level. Women who have been regularly diagnosed with normal screening exams represent a sub-population with a low risk of disease and distinc...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9287371/ https://www.ncbi.nlm.nih.gov/pubmed/35840652 http://dx.doi.org/10.1038/s41598-022-16361-6 |
_version_ | 1784748238572617728 |
---|---|
author | Langberg, Geir Severin R. E. Nygård, Jan F. Gogineni, Vinay Chakravarthi Nygård, Mari Grasmair, Markus Naumova, Valeriya |
author_facet | Langberg, Geir Severin R. E. Nygård, Jan F. Gogineni, Vinay Chakravarthi Nygård, Mari Grasmair, Markus Naumova, Valeriya |
author_sort | Langberg, Geir Severin R. E. |
collection | PubMed |
description | Mass-screening programs for cervical cancer prevention in the Nordic countries have been effective in reducing cancer incidence and mortality at the population level. Women who have been regularly diagnosed with normal screening exams represent a sub-population with a low risk of disease and distinctive screening strategies which avoid over-screening while identifying those with high-grade lesions are needed to improve the existing one-size-fits-all approach. Machine learning methods for more personalized cervical cancer risk estimation may be of great utility to screening programs shifting to more targeted screening. However, deriving personalized risk prediction models is challenging as effective screening has made cervical cancer rare and the exam results are strongly skewed towards normal. Moreover, changes in female lifestyle and screening habits over time can cause a non-stationary data distribution. In this paper, we treat cervical cancer risk prediction as a longitudinal forecasting problem. We define risk estimators by extending existing frameworks developed on cervical cancer screening data to incremental learning for longitudinal risk predictions and compare these estimators to machine learning methods popular in biomedical applications. As input to the prediction models, we utilize all the available data from the individual screening histories.Using data from the Cancer Registry of Norway, we find in numerical experiments that the models are strongly biased towards normal results due to imbalanced data. To identify females at risk of cancer development, we adapt an imbalanced classification strategy to non-stationary data. Using this strategy, we estimate the absolute risk from longitudinal model predictions and a hold-out set of screening data. Comparing absolute risk curves demonstrate that prediction models can closely reflect the absolute risk observed in the hold-out set. Such models have great potential for improving cervical cancer risk stratification for more personalized screening recommendations. |
format | Online Article Text |
id | pubmed-9287371 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-92873712022-07-17 Towards a data-driven system for personalized cervical cancer risk stratification Langberg, Geir Severin R. E. Nygård, Jan F. Gogineni, Vinay Chakravarthi Nygård, Mari Grasmair, Markus Naumova, Valeriya Sci Rep Article Mass-screening programs for cervical cancer prevention in the Nordic countries have been effective in reducing cancer incidence and mortality at the population level. Women who have been regularly diagnosed with normal screening exams represent a sub-population with a low risk of disease and distinctive screening strategies which avoid over-screening while identifying those with high-grade lesions are needed to improve the existing one-size-fits-all approach. Machine learning methods for more personalized cervical cancer risk estimation may be of great utility to screening programs shifting to more targeted screening. However, deriving personalized risk prediction models is challenging as effective screening has made cervical cancer rare and the exam results are strongly skewed towards normal. Moreover, changes in female lifestyle and screening habits over time can cause a non-stationary data distribution. In this paper, we treat cervical cancer risk prediction as a longitudinal forecasting problem. We define risk estimators by extending existing frameworks developed on cervical cancer screening data to incremental learning for longitudinal risk predictions and compare these estimators to machine learning methods popular in biomedical applications. As input to the prediction models, we utilize all the available data from the individual screening histories.Using data from the Cancer Registry of Norway, we find in numerical experiments that the models are strongly biased towards normal results due to imbalanced data. To identify females at risk of cancer development, we adapt an imbalanced classification strategy to non-stationary data. Using this strategy, we estimate the absolute risk from longitudinal model predictions and a hold-out set of screening data. Comparing absolute risk curves demonstrate that prediction models can closely reflect the absolute risk observed in the hold-out set. Such models have great potential for improving cervical cancer risk stratification for more personalized screening recommendations. Nature Publishing Group UK 2022-07-15 /pmc/articles/PMC9287371/ /pubmed/35840652 http://dx.doi.org/10.1038/s41598-022-16361-6 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Article Langberg, Geir Severin R. E. Nygård, Jan F. Gogineni, Vinay Chakravarthi Nygård, Mari Grasmair, Markus Naumova, Valeriya Towards a data-driven system for personalized cervical cancer risk stratification |
title | Towards a data-driven system for personalized cervical cancer risk stratification |
title_full | Towards a data-driven system for personalized cervical cancer risk stratification |
title_fullStr | Towards a data-driven system for personalized cervical cancer risk stratification |
title_full_unstemmed | Towards a data-driven system for personalized cervical cancer risk stratification |
title_short | Towards a data-driven system for personalized cervical cancer risk stratification |
title_sort | towards a data-driven system for personalized cervical cancer risk stratification |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9287371/ https://www.ncbi.nlm.nih.gov/pubmed/35840652 http://dx.doi.org/10.1038/s41598-022-16361-6 |
work_keys_str_mv | AT langberggeirseverinre towardsadatadrivensystemforpersonalizedcervicalcancerriskstratification AT nygardjanf towardsadatadrivensystemforpersonalizedcervicalcancerriskstratification AT goginenivinaychakravarthi towardsadatadrivensystemforpersonalizedcervicalcancerriskstratification AT nygardmari towardsadatadrivensystemforpersonalizedcervicalcancerriskstratification AT grasmairmarkus towardsadatadrivensystemforpersonalizedcervicalcancerriskstratification AT naumovavaleriya towardsadatadrivensystemforpersonalizedcervicalcancerriskstratification |