Cargando…
Data mining-based model and risk prediction of colorectal cancer by using secondary health data: A systematic review
OBJECTIVE: Prevention and early detection of colorectal cancer (CRC) can increase the chances of successful treatment and reduce burden. Various data mining technologies have been utilized to strengthen the early detection of CRC in primary care. Evidence synthesis on the model’s effectiveness is sc...
Autores principales: | , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
AME Publishing Company
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7219096/ https://www.ncbi.nlm.nih.gov/pubmed/32410801 http://dx.doi.org/10.21147/j.issn.1000-9604.2020.02.11 |
_version_ | 1783532927900975104 |
---|---|
author | Liang, Hailun Yang, Lei Tao, Lei Shi, Leiyu Yang, Wuyang Bai, Jiawei Zheng, Da Wang, Ning Ji, Jiafu |
author_facet | Liang, Hailun Yang, Lei Tao, Lei Shi, Leiyu Yang, Wuyang Bai, Jiawei Zheng, Da Wang, Ning Ji, Jiafu |
author_sort | Liang, Hailun |
collection | PubMed |
description | OBJECTIVE: Prevention and early detection of colorectal cancer (CRC) can increase the chances of successful treatment and reduce burden. Various data mining technologies have been utilized to strengthen the early detection of CRC in primary care. Evidence synthesis on the model’s effectiveness is scant. This systematic review synthesizes studies that examine the effect of data mining on improving risk prediction of CRC. METHODS: The PRISMA framework guided the conduct of this study. We obtained papers via PubMed, Cochrane Library, EMBASE and Google Scholar. Quality appraisal was performed using Downs and Black’s quality checklist. To evaluate the performance of included models, the values of specificity and sensitivity were comparted, the values of area under the curve (AUC) were plotted, and the median of overall AUC of included studies was computed. RESULTS: A total of 316 studies were reviewed for full text. Seven articles were included. Included studies implement techniques including artificial neural networks, Bayesian networks and decision trees. Six articles reported the overall model accuracy. Overall, the median AUC is 0.8243 [interquartile range (IQR): 0.8050−0.8886]. In the two articles that reported comparison results with traditional models, the data mining method performed better than the traditional models, with the best AUC improvement of 10.7%. CONCLUSIONS: The adoption of data mining technologies for CRC detection is at an early stage. Limited numbers of included articles and heterogeneity of those studies implied that more rigorous research is expected to further investigate the techniques’ effects. |
format | Online Article Text |
id | pubmed-7219096 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | AME Publishing Company |
record_format | MEDLINE/PubMed |
spelling | pubmed-72190962020-05-14 Data mining-based model and risk prediction of colorectal cancer by using secondary health data: A systematic review Liang, Hailun Yang, Lei Tao, Lei Shi, Leiyu Yang, Wuyang Bai, Jiawei Zheng, Da Wang, Ning Ji, Jiafu Chin J Cancer Res Original Article OBJECTIVE: Prevention and early detection of colorectal cancer (CRC) can increase the chances of successful treatment and reduce burden. Various data mining technologies have been utilized to strengthen the early detection of CRC in primary care. Evidence synthesis on the model’s effectiveness is scant. This systematic review synthesizes studies that examine the effect of data mining on improving risk prediction of CRC. METHODS: The PRISMA framework guided the conduct of this study. We obtained papers via PubMed, Cochrane Library, EMBASE and Google Scholar. Quality appraisal was performed using Downs and Black’s quality checklist. To evaluate the performance of included models, the values of specificity and sensitivity were comparted, the values of area under the curve (AUC) were plotted, and the median of overall AUC of included studies was computed. RESULTS: A total of 316 studies were reviewed for full text. Seven articles were included. Included studies implement techniques including artificial neural networks, Bayesian networks and decision trees. Six articles reported the overall model accuracy. Overall, the median AUC is 0.8243 [interquartile range (IQR): 0.8050−0.8886]. In the two articles that reported comparison results with traditional models, the data mining method performed better than the traditional models, with the best AUC improvement of 10.7%. CONCLUSIONS: The adoption of data mining technologies for CRC detection is at an early stage. Limited numbers of included articles and heterogeneity of those studies implied that more rigorous research is expected to further investigate the techniques’ effects. AME Publishing Company 2020-04 /pmc/articles/PMC7219096/ /pubmed/32410801 http://dx.doi.org/10.21147/j.issn.1000-9604.2020.02.11 Text en Copyright © 2020 Chinese Journal of Cancer Research. All rights reserved. http://creativecommons.org/licenses/by-nc-sa/4.0/ This work is licensed under a Creative Commons Attribution-Non Commercial-Share Alike 4.0 Unported License. To view a copy of this license, visit http://creativecommons.org/licenses/by-nc-sa/4.0/ |
spellingShingle | Original Article Liang, Hailun Yang, Lei Tao, Lei Shi, Leiyu Yang, Wuyang Bai, Jiawei Zheng, Da Wang, Ning Ji, Jiafu Data mining-based model and risk prediction of colorectal cancer by using secondary health data: A systematic review |
title | Data mining-based model and risk prediction of colorectal cancer by using secondary health data: A systematic review |
title_full | Data mining-based model and risk prediction of colorectal cancer by using secondary health data: A systematic review |
title_fullStr | Data mining-based model and risk prediction of colorectal cancer by using secondary health data: A systematic review |
title_full_unstemmed | Data mining-based model and risk prediction of colorectal cancer by using secondary health data: A systematic review |
title_short | Data mining-based model and risk prediction of colorectal cancer by using secondary health data: A systematic review |
title_sort | data mining-based model and risk prediction of colorectal cancer by using secondary health data: a systematic review |
topic | Original Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7219096/ https://www.ncbi.nlm.nih.gov/pubmed/32410801 http://dx.doi.org/10.21147/j.issn.1000-9604.2020.02.11 |
work_keys_str_mv | AT lianghailun dataminingbasedmodelandriskpredictionofcolorectalcancerbyusingsecondaryhealthdataasystematicreview AT yanglei dataminingbasedmodelandriskpredictionofcolorectalcancerbyusingsecondaryhealthdataasystematicreview AT taolei dataminingbasedmodelandriskpredictionofcolorectalcancerbyusingsecondaryhealthdataasystematicreview AT shileiyu dataminingbasedmodelandriskpredictionofcolorectalcancerbyusingsecondaryhealthdataasystematicreview AT yangwuyang dataminingbasedmodelandriskpredictionofcolorectalcancerbyusingsecondaryhealthdataasystematicreview AT baijiawei dataminingbasedmodelandriskpredictionofcolorectalcancerbyusingsecondaryhealthdataasystematicreview AT zhengda dataminingbasedmodelandriskpredictionofcolorectalcancerbyusingsecondaryhealthdataasystematicreview AT wangning dataminingbasedmodelandriskpredictionofcolorectalcancerbyusingsecondaryhealthdataasystematicreview AT jijiafu dataminingbasedmodelandriskpredictionofcolorectalcancerbyusingsecondaryhealthdataasystematicreview |