Cargando…

Towards a data-driven system for personalized cervical cancer risk stratification

Mass-screening programs for cervical cancer prevention in the Nordic countries have been effective in reducing cancer incidence and mortality at the population level. Women who have been regularly diagnosed with normal screening exams represent a sub-population with a low risk of disease and distinc...

Descripción completa

Detalles Bibliográficos
Autores principales: Langberg, Geir Severin R. E., Nygård, Jan F., Gogineni, Vinay Chakravarthi, Nygård, Mari, Grasmair, Markus, Naumova, Valeriya
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9287371/
https://www.ncbi.nlm.nih.gov/pubmed/35840652
http://dx.doi.org/10.1038/s41598-022-16361-6
_version_ 1784748238572617728
author Langberg, Geir Severin R. E.
Nygård, Jan F.
Gogineni, Vinay Chakravarthi
Nygård, Mari
Grasmair, Markus
Naumova, Valeriya
author_facet Langberg, Geir Severin R. E.
Nygård, Jan F.
Gogineni, Vinay Chakravarthi
Nygård, Mari
Grasmair, Markus
Naumova, Valeriya
author_sort Langberg, Geir Severin R. E.
collection PubMed
description Mass-screening programs for cervical cancer prevention in the Nordic countries have been effective in reducing cancer incidence and mortality at the population level. Women who have been regularly diagnosed with normal screening exams represent a sub-population with a low risk of disease and distinctive screening strategies which avoid over-screening while identifying those with high-grade lesions are needed to improve the existing one-size-fits-all approach. Machine learning methods for more personalized cervical cancer risk estimation may be of great utility to screening programs shifting to more targeted screening. However, deriving personalized risk prediction models is challenging as effective screening has made cervical cancer rare and the exam results are strongly skewed towards normal. Moreover, changes in female lifestyle and screening habits over time can cause a non-stationary data distribution. In this paper, we treat cervical cancer risk prediction as a longitudinal forecasting problem. We define risk estimators by extending existing frameworks developed on cervical cancer screening data to incremental learning for longitudinal risk predictions and compare these estimators to machine learning methods popular in biomedical applications. As input to the prediction models, we utilize all the available data from the individual screening histories.Using data from the Cancer Registry of Norway, we find in numerical experiments that the models are strongly biased towards normal results due to imbalanced data. To identify females at risk of cancer development, we adapt an imbalanced classification strategy to non-stationary data. Using this strategy, we estimate the absolute risk from longitudinal model predictions and a hold-out set of screening data. Comparing absolute risk curves demonstrate that prediction models can closely reflect the absolute risk observed in the hold-out set. Such models have great potential for improving cervical cancer risk stratification for more personalized screening recommendations.
format Online
Article
Text
id pubmed-9287371
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-92873712022-07-17 Towards a data-driven system for personalized cervical cancer risk stratification Langberg, Geir Severin R. E. Nygård, Jan F. Gogineni, Vinay Chakravarthi Nygård, Mari Grasmair, Markus Naumova, Valeriya Sci Rep Article Mass-screening programs for cervical cancer prevention in the Nordic countries have been effective in reducing cancer incidence and mortality at the population level. Women who have been regularly diagnosed with normal screening exams represent a sub-population with a low risk of disease and distinctive screening strategies which avoid over-screening while identifying those with high-grade lesions are needed to improve the existing one-size-fits-all approach. Machine learning methods for more personalized cervical cancer risk estimation may be of great utility to screening programs shifting to more targeted screening. However, deriving personalized risk prediction models is challenging as effective screening has made cervical cancer rare and the exam results are strongly skewed towards normal. Moreover, changes in female lifestyle and screening habits over time can cause a non-stationary data distribution. In this paper, we treat cervical cancer risk prediction as a longitudinal forecasting problem. We define risk estimators by extending existing frameworks developed on cervical cancer screening data to incremental learning for longitudinal risk predictions and compare these estimators to machine learning methods popular in biomedical applications. As input to the prediction models, we utilize all the available data from the individual screening histories.Using data from the Cancer Registry of Norway, we find in numerical experiments that the models are strongly biased towards normal results due to imbalanced data. To identify females at risk of cancer development, we adapt an imbalanced classification strategy to non-stationary data. Using this strategy, we estimate the absolute risk from longitudinal model predictions and a hold-out set of screening data. Comparing absolute risk curves demonstrate that prediction models can closely reflect the absolute risk observed in the hold-out set. Such models have great potential for improving cervical cancer risk stratification for more personalized screening recommendations. Nature Publishing Group UK 2022-07-15 /pmc/articles/PMC9287371/ /pubmed/35840652 http://dx.doi.org/10.1038/s41598-022-16361-6 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Langberg, Geir Severin R. E.
Nygård, Jan F.
Gogineni, Vinay Chakravarthi
Nygård, Mari
Grasmair, Markus
Naumova, Valeriya
Towards a data-driven system for personalized cervical cancer risk stratification
title Towards a data-driven system for personalized cervical cancer risk stratification
title_full Towards a data-driven system for personalized cervical cancer risk stratification
title_fullStr Towards a data-driven system for personalized cervical cancer risk stratification
title_full_unstemmed Towards a data-driven system for personalized cervical cancer risk stratification
title_short Towards a data-driven system for personalized cervical cancer risk stratification
title_sort towards a data-driven system for personalized cervical cancer risk stratification
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9287371/
https://www.ncbi.nlm.nih.gov/pubmed/35840652
http://dx.doi.org/10.1038/s41598-022-16361-6
work_keys_str_mv AT langberggeirseverinre towardsadatadrivensystemforpersonalizedcervicalcancerriskstratification
AT nygardjanf towardsadatadrivensystemforpersonalizedcervicalcancerriskstratification
AT goginenivinaychakravarthi towardsadatadrivensystemforpersonalizedcervicalcancerriskstratification
AT nygardmari towardsadatadrivensystemforpersonalizedcervicalcancerriskstratification
AT grasmairmarkus towardsadatadrivensystemforpersonalizedcervicalcancerriskstratification
AT naumovavaleriya towardsadatadrivensystemforpersonalizedcervicalcancerriskstratification