Cargando…

Data mining-based model and risk prediction of colorectal cancer by using secondary health data: A systematic review

OBJECTIVE: Prevention and early detection of colorectal cancer (CRC) can increase the chances of successful treatment and reduce burden. Various data mining technologies have been utilized to strengthen the early detection of CRC in primary care. Evidence synthesis on the model’s effectiveness is sc...

Descripción completa

Detalles Bibliográficos
Autores principales: Liang, Hailun, Yang, Lei, Tao, Lei, Shi, Leiyu, Yang, Wuyang, Bai, Jiawei, Zheng, Da, Wang, Ning, Ji, Jiafu
Formato: Online Artículo Texto
Lenguaje:English
Publicado: AME Publishing Company 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7219096/
https://www.ncbi.nlm.nih.gov/pubmed/32410801
http://dx.doi.org/10.21147/j.issn.1000-9604.2020.02.11
_version_ 1783532927900975104
author Liang, Hailun
Yang, Lei
Tao, Lei
Shi, Leiyu
Yang, Wuyang
Bai, Jiawei
Zheng, Da
Wang, Ning
Ji, Jiafu
author_facet Liang, Hailun
Yang, Lei
Tao, Lei
Shi, Leiyu
Yang, Wuyang
Bai, Jiawei
Zheng, Da
Wang, Ning
Ji, Jiafu
author_sort Liang, Hailun
collection PubMed
description OBJECTIVE: Prevention and early detection of colorectal cancer (CRC) can increase the chances of successful treatment and reduce burden. Various data mining technologies have been utilized to strengthen the early detection of CRC in primary care. Evidence synthesis on the model’s effectiveness is scant. This systematic review synthesizes studies that examine the effect of data mining on improving risk prediction of CRC. METHODS: The PRISMA framework guided the conduct of this study. We obtained papers via PubMed, Cochrane Library, EMBASE and Google Scholar. Quality appraisal was performed using Downs and Black’s quality checklist. To evaluate the performance of included models, the values of specificity and sensitivity were comparted, the values of area under the curve (AUC) were plotted, and the median of overall AUC of included studies was computed. RESULTS: A total of 316 studies were reviewed for full text. Seven articles were included. Included studies implement techniques including artificial neural networks, Bayesian networks and decision trees. Six articles reported the overall model accuracy. Overall, the median AUC is 0.8243 [interquartile range (IQR): 0.8050−0.8886]. In the two articles that reported comparison results with traditional models, the data mining method performed better than the traditional models, with the best AUC improvement of 10.7%. CONCLUSIONS: The adoption of data mining technologies for CRC detection is at an early stage. Limited numbers of included articles and heterogeneity of those studies implied that more rigorous research is expected to further investigate the techniques’ effects.
format Online
Article
Text
id pubmed-7219096
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher AME Publishing Company
record_format MEDLINE/PubMed
spelling pubmed-72190962020-05-14 Data mining-based model and risk prediction of colorectal cancer by using secondary health data: A systematic review Liang, Hailun Yang, Lei Tao, Lei Shi, Leiyu Yang, Wuyang Bai, Jiawei Zheng, Da Wang, Ning Ji, Jiafu Chin J Cancer Res Original Article OBJECTIVE: Prevention and early detection of colorectal cancer (CRC) can increase the chances of successful treatment and reduce burden. Various data mining technologies have been utilized to strengthen the early detection of CRC in primary care. Evidence synthesis on the model’s effectiveness is scant. This systematic review synthesizes studies that examine the effect of data mining on improving risk prediction of CRC. METHODS: The PRISMA framework guided the conduct of this study. We obtained papers via PubMed, Cochrane Library, EMBASE and Google Scholar. Quality appraisal was performed using Downs and Black’s quality checklist. To evaluate the performance of included models, the values of specificity and sensitivity were comparted, the values of area under the curve (AUC) were plotted, and the median of overall AUC of included studies was computed. RESULTS: A total of 316 studies were reviewed for full text. Seven articles were included. Included studies implement techniques including artificial neural networks, Bayesian networks and decision trees. Six articles reported the overall model accuracy. Overall, the median AUC is 0.8243 [interquartile range (IQR): 0.8050−0.8886]. In the two articles that reported comparison results with traditional models, the data mining method performed better than the traditional models, with the best AUC improvement of 10.7%. CONCLUSIONS: The adoption of data mining technologies for CRC detection is at an early stage. Limited numbers of included articles and heterogeneity of those studies implied that more rigorous research is expected to further investigate the techniques’ effects. AME Publishing Company 2020-04 /pmc/articles/PMC7219096/ /pubmed/32410801 http://dx.doi.org/10.21147/j.issn.1000-9604.2020.02.11 Text en Copyright © 2020 Chinese Journal of Cancer Research. All rights reserved. http://creativecommons.org/licenses/by-nc-sa/4.0/ This work is licensed under a Creative Commons Attribution-Non Commercial-Share Alike 4.0 Unported License. To view a copy of this license, visit http://creativecommons.org/licenses/by-nc-sa/4.0/
spellingShingle Original Article
Liang, Hailun
Yang, Lei
Tao, Lei
Shi, Leiyu
Yang, Wuyang
Bai, Jiawei
Zheng, Da
Wang, Ning
Ji, Jiafu
Data mining-based model and risk prediction of colorectal cancer by using secondary health data: A systematic review
title Data mining-based model and risk prediction of colorectal cancer by using secondary health data: A systematic review
title_full Data mining-based model and risk prediction of colorectal cancer by using secondary health data: A systematic review
title_fullStr Data mining-based model and risk prediction of colorectal cancer by using secondary health data: A systematic review
title_full_unstemmed Data mining-based model and risk prediction of colorectal cancer by using secondary health data: A systematic review
title_short Data mining-based model and risk prediction of colorectal cancer by using secondary health data: A systematic review
title_sort data mining-based model and risk prediction of colorectal cancer by using secondary health data: a systematic review
topic Original Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7219096/
https://www.ncbi.nlm.nih.gov/pubmed/32410801
http://dx.doi.org/10.21147/j.issn.1000-9604.2020.02.11
work_keys_str_mv AT lianghailun dataminingbasedmodelandriskpredictionofcolorectalcancerbyusingsecondaryhealthdataasystematicreview
AT yanglei dataminingbasedmodelandriskpredictionofcolorectalcancerbyusingsecondaryhealthdataasystematicreview
AT taolei dataminingbasedmodelandriskpredictionofcolorectalcancerbyusingsecondaryhealthdataasystematicreview
AT shileiyu dataminingbasedmodelandriskpredictionofcolorectalcancerbyusingsecondaryhealthdataasystematicreview
AT yangwuyang dataminingbasedmodelandriskpredictionofcolorectalcancerbyusingsecondaryhealthdataasystematicreview
AT baijiawei dataminingbasedmodelandriskpredictionofcolorectalcancerbyusingsecondaryhealthdataasystematicreview
AT zhengda dataminingbasedmodelandriskpredictionofcolorectalcancerbyusingsecondaryhealthdataasystematicreview
AT wangning dataminingbasedmodelandriskpredictionofcolorectalcancerbyusingsecondaryhealthdataasystematicreview
AT jijiafu dataminingbasedmodelandriskpredictionofcolorectalcancerbyusingsecondaryhealthdataasystematicreview