Cargando…
Computational Discrimination of Breast Cancer for Korean Women Based on Epidemiologic Data Only
Breast cancer is the second leading cancer for Korean women and its incidence rate has been increasing annually. If early diagnosis were implemented with epidemiologic data, the women could easily assess breast cancer risk using internet. National Cancer Institute in the United States has released a...
Autores principales: | , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
The Korean Academy of Medical Sciences
2015
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4520931/ https://www.ncbi.nlm.nih.gov/pubmed/26240478 http://dx.doi.org/10.3346/jkms.2015.30.8.1025 |
_version_ | 1782383731931086848 |
---|---|
author | Lee, Chiwon Lee, Jung Chan Park, Boyoung Bae, Jonghee Lim, Min Hyuk Kang, Daehee Yoo, Keun-Young Park, Sue K. Kim, Youdan Kim, Sungwan |
author_facet | Lee, Chiwon Lee, Jung Chan Park, Boyoung Bae, Jonghee Lim, Min Hyuk Kang, Daehee Yoo, Keun-Young Park, Sue K. Kim, Youdan Kim, Sungwan |
author_sort | Lee, Chiwon |
collection | PubMed |
description | Breast cancer is the second leading cancer for Korean women and its incidence rate has been increasing annually. If early diagnosis were implemented with epidemiologic data, the women could easily assess breast cancer risk using internet. National Cancer Institute in the United States has released a Web-based Breast Cancer Risk Assessment Tool based on Gail model. However, it is inapplicable directly to Korean women since breast cancer risk is dependent on race. Also, it shows low accuracy (58%-59%). In this study, breast cancer discrimination models for Korean women are developed using only epidemiological case-control data (n = 4,574). The models are configured by different classification techniques: support vector machine, artificial neural network, and Bayesian network. A 1,000-time repeated random sub-sampling validation is performed for diverse parameter conditions, respectively. The performance is evaluated and compared as an area under the receiver operating characteristic curve (AUC). According to age group and classification techniques, AUC, accuracy, sensitivity, specificity, and calculation time of all models were calculated and compared. Although the support vector machine took the longest calculation time, the highest classification performance has been achieved in the case of women older than 50 yr (AUC = 64%). The proposed model is dependent on demographic characteristics, reproductive factors, and lifestyle habits without using any clinical or genetic test. It is expected that the model could be implemented as a web-based discrimination tool for breast cancer. This tool can encourage potential breast cancer prone women to go the hospital for diagnostic tests. GRAPHICAL ABSTRACT: [Image: see text] |
format | Online Article Text |
id | pubmed-4520931 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2015 |
publisher | The Korean Academy of Medical Sciences |
record_format | MEDLINE/PubMed |
spelling | pubmed-45209312015-08-03 Computational Discrimination of Breast Cancer for Korean Women Based on Epidemiologic Data Only Lee, Chiwon Lee, Jung Chan Park, Boyoung Bae, Jonghee Lim, Min Hyuk Kang, Daehee Yoo, Keun-Young Park, Sue K. Kim, Youdan Kim, Sungwan J Korean Med Sci Original Article Breast cancer is the second leading cancer for Korean women and its incidence rate has been increasing annually. If early diagnosis were implemented with epidemiologic data, the women could easily assess breast cancer risk using internet. National Cancer Institute in the United States has released a Web-based Breast Cancer Risk Assessment Tool based on Gail model. However, it is inapplicable directly to Korean women since breast cancer risk is dependent on race. Also, it shows low accuracy (58%-59%). In this study, breast cancer discrimination models for Korean women are developed using only epidemiological case-control data (n = 4,574). The models are configured by different classification techniques: support vector machine, artificial neural network, and Bayesian network. A 1,000-time repeated random sub-sampling validation is performed for diverse parameter conditions, respectively. The performance is evaluated and compared as an area under the receiver operating characteristic curve (AUC). According to age group and classification techniques, AUC, accuracy, sensitivity, specificity, and calculation time of all models were calculated and compared. Although the support vector machine took the longest calculation time, the highest classification performance has been achieved in the case of women older than 50 yr (AUC = 64%). The proposed model is dependent on demographic characteristics, reproductive factors, and lifestyle habits without using any clinical or genetic test. It is expected that the model could be implemented as a web-based discrimination tool for breast cancer. This tool can encourage potential breast cancer prone women to go the hospital for diagnostic tests. GRAPHICAL ABSTRACT: [Image: see text] The Korean Academy of Medical Sciences 2015-08 2015-07-15 /pmc/articles/PMC4520931/ /pubmed/26240478 http://dx.doi.org/10.3346/jkms.2015.30.8.1025 Text en © 2015 The Korean Academy of Medical Sciences. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Original Article Lee, Chiwon Lee, Jung Chan Park, Boyoung Bae, Jonghee Lim, Min Hyuk Kang, Daehee Yoo, Keun-Young Park, Sue K. Kim, Youdan Kim, Sungwan Computational Discrimination of Breast Cancer for Korean Women Based on Epidemiologic Data Only |
title | Computational Discrimination of Breast Cancer for Korean Women Based on Epidemiologic Data Only |
title_full | Computational Discrimination of Breast Cancer for Korean Women Based on Epidemiologic Data Only |
title_fullStr | Computational Discrimination of Breast Cancer for Korean Women Based on Epidemiologic Data Only |
title_full_unstemmed | Computational Discrimination of Breast Cancer for Korean Women Based on Epidemiologic Data Only |
title_short | Computational Discrimination of Breast Cancer for Korean Women Based on Epidemiologic Data Only |
title_sort | computational discrimination of breast cancer for korean women based on epidemiologic data only |
topic | Original Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4520931/ https://www.ncbi.nlm.nih.gov/pubmed/26240478 http://dx.doi.org/10.3346/jkms.2015.30.8.1025 |
work_keys_str_mv | AT leechiwon computationaldiscriminationofbreastcancerforkoreanwomenbasedonepidemiologicdataonly AT leejungchan computationaldiscriminationofbreastcancerforkoreanwomenbasedonepidemiologicdataonly AT parkboyoung computationaldiscriminationofbreastcancerforkoreanwomenbasedonepidemiologicdataonly AT baejonghee computationaldiscriminationofbreastcancerforkoreanwomenbasedonepidemiologicdataonly AT limminhyuk computationaldiscriminationofbreastcancerforkoreanwomenbasedonepidemiologicdataonly AT kangdaehee computationaldiscriminationofbreastcancerforkoreanwomenbasedonepidemiologicdataonly AT yookeunyoung computationaldiscriminationofbreastcancerforkoreanwomenbasedonepidemiologicdataonly AT parksuek computationaldiscriminationofbreastcancerforkoreanwomenbasedonepidemiologicdataonly AT kimyoudan computationaldiscriminationofbreastcancerforkoreanwomenbasedonepidemiologicdataonly AT kimsungwan computationaldiscriminationofbreastcancerforkoreanwomenbasedonepidemiologicdataonly |