Cargando…

Use of Virus Genotypes in Machine Learning Diagnostic Prediction Models for Cervical Cancer in Women With High-Risk Human Papillomavirus Infection

IMPORTANCE: High-risk human papillomavirus (hrHPV) is recognized as an etiologic agent for cervical cancer, and hrHPV DNA testing is recommended as the preferred method of cervical cancer screening in recent World Health Organization guidelines. Cervical cancer prediction models may be useful for sc...

Descripción completa

Detalles Bibliográficos
Autores principales: Xiao, Ting, Wang, Chunhua, Yang, Mei, Yang, Jun, Xu, Xiaohan, Shen, Liang, Yang, Zhou, Xing, Hui, Ou, Chun-Quan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: American Medical Association 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10398410/
https://www.ncbi.nlm.nih.gov/pubmed/37531108
http://dx.doi.org/10.1001/jamanetworkopen.2023.26890
_version_ 1785084049961779200
author Xiao, Ting
Wang, Chunhua
Yang, Mei
Yang, Jun
Xu, Xiaohan
Shen, Liang
Yang, Zhou
Xing, Hui
Ou, Chun-Quan
author_facet Xiao, Ting
Wang, Chunhua
Yang, Mei
Yang, Jun
Xu, Xiaohan
Shen, Liang
Yang, Zhou
Xing, Hui
Ou, Chun-Quan
author_sort Xiao, Ting
collection PubMed
description IMPORTANCE: High-risk human papillomavirus (hrHPV) is recognized as an etiologic agent for cervical cancer, and hrHPV DNA testing is recommended as the preferred method of cervical cancer screening in recent World Health Organization guidelines. Cervical cancer prediction models may be useful for screening and monitoring, particularly in low-resource settings with unavailable cytological and colposcopic examination results, but previous studies did not include women infected with hrHPV. OBJECTIVES: To develop and validate a cervical cancer prediction model that includes women positive for hrHPV infection and examine whether the inclusion of HPV genotypes improves the cervical cancer prediction ability. DESIGN, SETTING, AND PARTICIPANTS: This diagnostic study included diagnostic data from 314 587 women collected from 136 primary care centers in China between January 15, 2017, and February 28, 2018. The data set was separated geographically into data from 100 primary care centers in 6 districts for model development (training data set) and 36 centers in 3 districts for model validation. A total of 24 391 women identified with positive hrHPV test results in the cervical cancer screening program were included in the study. Data were analyzed from January 1, 2022, to July 14, 2022. MAIN OUTCOMES AND MEASURES: Cervical intraepithelial neoplasia grade 3 or worse (CIN3+) was the primary outcome, and cervical intraepithelial neoplasia grade 2 or worse (CIN2+) was the secondary outcome. The ability of the prediction models to discriminate CIN3+ and CIN2+ was evaluated using the area under the receiver operating characteristic curve (AUROC), sensitivity, specificity, positive likelihood ratio, and negative likelihood ratio. The calibration and clinical utility of the models were assessed using calibration plots and decision curves, respectively. RESULTS: After excluding women without screening outcomes, the study included 21 720 women (median [IQR] age, 50 [44-55] years). Of 14 553 women in the training data set, 349 (2.4%) received a diagnosis of CIN3+ and 673 (4.6%) of CIN2+. Of 7167 women in the validation set, 167 (2.3%) received a diagnosis of CIN3+ and 228 (3.2%) of CIN2+. Including HPV genotype in the model improved the AUROC by 35.9% for CIN3+ and 41.7% for CIN2+. With HPV genotype, epidemiological factors, and pelvic examination as predictors, the stacking model had an AUROC of 0.87 (95% CI, 0.84-0.90) for predicting CIN3+. The sensitivity was 80.1%, specificity was 83.4%, positive likelihood ratio was 4.83, and negative likelihood ratio was 0.24. The model for predicting CIN2+ had an AUROC of 0.85 (95% CI, 0.82-0.88), with a sensitivity of 80.4%, specificity of 81.0%, positive likelihood ratio of 4.23, and negative likelihood ratio of 0.24. The decision curve analysis indicated that the stacking model provided a superior standardized net benefit when the threshold probability for clinical decision was lower than 23% for CIN3+ and lower than 17% for CIN2+. CONCLUSIONS AND RELEVANCE: This diagnostic study found that inclusion of HPV genotypes markedly improved the ability of a stacking model to predict cervical cancer among women who tested positive for hrHPV infection. This prediction model may be an important tool for screening and monitoring cervical cancer, particularly in low-resource settings.
format Online
Article
Text
id pubmed-10398410
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher American Medical Association
record_format MEDLINE/PubMed
spelling pubmed-103984102023-08-04 Use of Virus Genotypes in Machine Learning Diagnostic Prediction Models for Cervical Cancer in Women With High-Risk Human Papillomavirus Infection Xiao, Ting Wang, Chunhua Yang, Mei Yang, Jun Xu, Xiaohan Shen, Liang Yang, Zhou Xing, Hui Ou, Chun-Quan JAMA Netw Open Original Investigation IMPORTANCE: High-risk human papillomavirus (hrHPV) is recognized as an etiologic agent for cervical cancer, and hrHPV DNA testing is recommended as the preferred method of cervical cancer screening in recent World Health Organization guidelines. Cervical cancer prediction models may be useful for screening and monitoring, particularly in low-resource settings with unavailable cytological and colposcopic examination results, but previous studies did not include women infected with hrHPV. OBJECTIVES: To develop and validate a cervical cancer prediction model that includes women positive for hrHPV infection and examine whether the inclusion of HPV genotypes improves the cervical cancer prediction ability. DESIGN, SETTING, AND PARTICIPANTS: This diagnostic study included diagnostic data from 314 587 women collected from 136 primary care centers in China between January 15, 2017, and February 28, 2018. The data set was separated geographically into data from 100 primary care centers in 6 districts for model development (training data set) and 36 centers in 3 districts for model validation. A total of 24 391 women identified with positive hrHPV test results in the cervical cancer screening program were included in the study. Data were analyzed from January 1, 2022, to July 14, 2022. MAIN OUTCOMES AND MEASURES: Cervical intraepithelial neoplasia grade 3 or worse (CIN3+) was the primary outcome, and cervical intraepithelial neoplasia grade 2 or worse (CIN2+) was the secondary outcome. The ability of the prediction models to discriminate CIN3+ and CIN2+ was evaluated using the area under the receiver operating characteristic curve (AUROC), sensitivity, specificity, positive likelihood ratio, and negative likelihood ratio. The calibration and clinical utility of the models were assessed using calibration plots and decision curves, respectively. RESULTS: After excluding women without screening outcomes, the study included 21 720 women (median [IQR] age, 50 [44-55] years). Of 14 553 women in the training data set, 349 (2.4%) received a diagnosis of CIN3+ and 673 (4.6%) of CIN2+. Of 7167 women in the validation set, 167 (2.3%) received a diagnosis of CIN3+ and 228 (3.2%) of CIN2+. Including HPV genotype in the model improved the AUROC by 35.9% for CIN3+ and 41.7% for CIN2+. With HPV genotype, epidemiological factors, and pelvic examination as predictors, the stacking model had an AUROC of 0.87 (95% CI, 0.84-0.90) for predicting CIN3+. The sensitivity was 80.1%, specificity was 83.4%, positive likelihood ratio was 4.83, and negative likelihood ratio was 0.24. The model for predicting CIN2+ had an AUROC of 0.85 (95% CI, 0.82-0.88), with a sensitivity of 80.4%, specificity of 81.0%, positive likelihood ratio of 4.23, and negative likelihood ratio of 0.24. The decision curve analysis indicated that the stacking model provided a superior standardized net benefit when the threshold probability for clinical decision was lower than 23% for CIN3+ and lower than 17% for CIN2+. CONCLUSIONS AND RELEVANCE: This diagnostic study found that inclusion of HPV genotypes markedly improved the ability of a stacking model to predict cervical cancer among women who tested positive for hrHPV infection. This prediction model may be an important tool for screening and monitoring cervical cancer, particularly in low-resource settings. American Medical Association 2023-08-02 /pmc/articles/PMC10398410/ /pubmed/37531108 http://dx.doi.org/10.1001/jamanetworkopen.2023.26890 Text en Copyright 2023 Xiao T et al. JAMA Network Open. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the CC-BY License.
spellingShingle Original Investigation
Xiao, Ting
Wang, Chunhua
Yang, Mei
Yang, Jun
Xu, Xiaohan
Shen, Liang
Yang, Zhou
Xing, Hui
Ou, Chun-Quan
Use of Virus Genotypes in Machine Learning Diagnostic Prediction Models for Cervical Cancer in Women With High-Risk Human Papillomavirus Infection
title Use of Virus Genotypes in Machine Learning Diagnostic Prediction Models for Cervical Cancer in Women With High-Risk Human Papillomavirus Infection
title_full Use of Virus Genotypes in Machine Learning Diagnostic Prediction Models for Cervical Cancer in Women With High-Risk Human Papillomavirus Infection
title_fullStr Use of Virus Genotypes in Machine Learning Diagnostic Prediction Models for Cervical Cancer in Women With High-Risk Human Papillomavirus Infection
title_full_unstemmed Use of Virus Genotypes in Machine Learning Diagnostic Prediction Models for Cervical Cancer in Women With High-Risk Human Papillomavirus Infection
title_short Use of Virus Genotypes in Machine Learning Diagnostic Prediction Models for Cervical Cancer in Women With High-Risk Human Papillomavirus Infection
title_sort use of virus genotypes in machine learning diagnostic prediction models for cervical cancer in women with high-risk human papillomavirus infection
topic Original Investigation
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10398410/
https://www.ncbi.nlm.nih.gov/pubmed/37531108
http://dx.doi.org/10.1001/jamanetworkopen.2023.26890
work_keys_str_mv AT xiaoting useofvirusgenotypesinmachinelearningdiagnosticpredictionmodelsforcervicalcancerinwomenwithhighriskhumanpapillomavirusinfection
AT wangchunhua useofvirusgenotypesinmachinelearningdiagnosticpredictionmodelsforcervicalcancerinwomenwithhighriskhumanpapillomavirusinfection
AT yangmei useofvirusgenotypesinmachinelearningdiagnosticpredictionmodelsforcervicalcancerinwomenwithhighriskhumanpapillomavirusinfection
AT yangjun useofvirusgenotypesinmachinelearningdiagnosticpredictionmodelsforcervicalcancerinwomenwithhighriskhumanpapillomavirusinfection
AT xuxiaohan useofvirusgenotypesinmachinelearningdiagnosticpredictionmodelsforcervicalcancerinwomenwithhighriskhumanpapillomavirusinfection
AT shenliang useofvirusgenotypesinmachinelearningdiagnosticpredictionmodelsforcervicalcancerinwomenwithhighriskhumanpapillomavirusinfection
AT yangzhou useofvirusgenotypesinmachinelearningdiagnosticpredictionmodelsforcervicalcancerinwomenwithhighriskhumanpapillomavirusinfection
AT xinghui useofvirusgenotypesinmachinelearningdiagnosticpredictionmodelsforcervicalcancerinwomenwithhighriskhumanpapillomavirusinfection
AT ouchunquan useofvirusgenotypesinmachinelearningdiagnosticpredictionmodelsforcervicalcancerinwomenwithhighriskhumanpapillomavirusinfection