Cargando…

Computational Discrimination of Breast Cancer for Korean Women Based on Epidemiologic Data Only

Breast cancer is the second leading cancer for Korean women and its incidence rate has been increasing annually. If early diagnosis were implemented with epidemiologic data, the women could easily assess breast cancer risk using internet. National Cancer Institute in the United States has released a...

Descripción completa

Detalles Bibliográficos
Autores principales: Lee, Chiwon, Lee, Jung Chan, Park, Boyoung, Bae, Jonghee, Lim, Min Hyuk, Kang, Daehee, Yoo, Keun-Young, Park, Sue K., Kim, Youdan, Kim, Sungwan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: The Korean Academy of Medical Sciences 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4520931/
https://www.ncbi.nlm.nih.gov/pubmed/26240478
http://dx.doi.org/10.3346/jkms.2015.30.8.1025
_version_ 1782383731931086848
author Lee, Chiwon
Lee, Jung Chan
Park, Boyoung
Bae, Jonghee
Lim, Min Hyuk
Kang, Daehee
Yoo, Keun-Young
Park, Sue K.
Kim, Youdan
Kim, Sungwan
author_facet Lee, Chiwon
Lee, Jung Chan
Park, Boyoung
Bae, Jonghee
Lim, Min Hyuk
Kang, Daehee
Yoo, Keun-Young
Park, Sue K.
Kim, Youdan
Kim, Sungwan
author_sort Lee, Chiwon
collection PubMed
description Breast cancer is the second leading cancer for Korean women and its incidence rate has been increasing annually. If early diagnosis were implemented with epidemiologic data, the women could easily assess breast cancer risk using internet. National Cancer Institute in the United States has released a Web-based Breast Cancer Risk Assessment Tool based on Gail model. However, it is inapplicable directly to Korean women since breast cancer risk is dependent on race. Also, it shows low accuracy (58%-59%). In this study, breast cancer discrimination models for Korean women are developed using only epidemiological case-control data (n = 4,574). The models are configured by different classification techniques: support vector machine, artificial neural network, and Bayesian network. A 1,000-time repeated random sub-sampling validation is performed for diverse parameter conditions, respectively. The performance is evaluated and compared as an area under the receiver operating characteristic curve (AUC). According to age group and classification techniques, AUC, accuracy, sensitivity, specificity, and calculation time of all models were calculated and compared. Although the support vector machine took the longest calculation time, the highest classification performance has been achieved in the case of women older than 50 yr (AUC = 64%). The proposed model is dependent on demographic characteristics, reproductive factors, and lifestyle habits without using any clinical or genetic test. It is expected that the model could be implemented as a web-based discrimination tool for breast cancer. This tool can encourage potential breast cancer prone women to go the hospital for diagnostic tests. GRAPHICAL ABSTRACT: [Image: see text]
format Online
Article
Text
id pubmed-4520931
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher The Korean Academy of Medical Sciences
record_format MEDLINE/PubMed
spelling pubmed-45209312015-08-03 Computational Discrimination of Breast Cancer for Korean Women Based on Epidemiologic Data Only Lee, Chiwon Lee, Jung Chan Park, Boyoung Bae, Jonghee Lim, Min Hyuk Kang, Daehee Yoo, Keun-Young Park, Sue K. Kim, Youdan Kim, Sungwan J Korean Med Sci Original Article Breast cancer is the second leading cancer for Korean women and its incidence rate has been increasing annually. If early diagnosis were implemented with epidemiologic data, the women could easily assess breast cancer risk using internet. National Cancer Institute in the United States has released a Web-based Breast Cancer Risk Assessment Tool based on Gail model. However, it is inapplicable directly to Korean women since breast cancer risk is dependent on race. Also, it shows low accuracy (58%-59%). In this study, breast cancer discrimination models for Korean women are developed using only epidemiological case-control data (n = 4,574). The models are configured by different classification techniques: support vector machine, artificial neural network, and Bayesian network. A 1,000-time repeated random sub-sampling validation is performed for diverse parameter conditions, respectively. The performance is evaluated and compared as an area under the receiver operating characteristic curve (AUC). According to age group and classification techniques, AUC, accuracy, sensitivity, specificity, and calculation time of all models were calculated and compared. Although the support vector machine took the longest calculation time, the highest classification performance has been achieved in the case of women older than 50 yr (AUC = 64%). The proposed model is dependent on demographic characteristics, reproductive factors, and lifestyle habits without using any clinical or genetic test. It is expected that the model could be implemented as a web-based discrimination tool for breast cancer. This tool can encourage potential breast cancer prone women to go the hospital for diagnostic tests. GRAPHICAL ABSTRACT: [Image: see text] The Korean Academy of Medical Sciences 2015-08 2015-07-15 /pmc/articles/PMC4520931/ /pubmed/26240478 http://dx.doi.org/10.3346/jkms.2015.30.8.1025 Text en © 2015 The Korean Academy of Medical Sciences. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Article
Lee, Chiwon
Lee, Jung Chan
Park, Boyoung
Bae, Jonghee
Lim, Min Hyuk
Kang, Daehee
Yoo, Keun-Young
Park, Sue K.
Kim, Youdan
Kim, Sungwan
Computational Discrimination of Breast Cancer for Korean Women Based on Epidemiologic Data Only
title Computational Discrimination of Breast Cancer for Korean Women Based on Epidemiologic Data Only
title_full Computational Discrimination of Breast Cancer for Korean Women Based on Epidemiologic Data Only
title_fullStr Computational Discrimination of Breast Cancer for Korean Women Based on Epidemiologic Data Only
title_full_unstemmed Computational Discrimination of Breast Cancer for Korean Women Based on Epidemiologic Data Only
title_short Computational Discrimination of Breast Cancer for Korean Women Based on Epidemiologic Data Only
title_sort computational discrimination of breast cancer for korean women based on epidemiologic data only
topic Original Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4520931/
https://www.ncbi.nlm.nih.gov/pubmed/26240478
http://dx.doi.org/10.3346/jkms.2015.30.8.1025
work_keys_str_mv AT leechiwon computationaldiscriminationofbreastcancerforkoreanwomenbasedonepidemiologicdataonly
AT leejungchan computationaldiscriminationofbreastcancerforkoreanwomenbasedonepidemiologicdataonly
AT parkboyoung computationaldiscriminationofbreastcancerforkoreanwomenbasedonepidemiologicdataonly
AT baejonghee computationaldiscriminationofbreastcancerforkoreanwomenbasedonepidemiologicdataonly
AT limminhyuk computationaldiscriminationofbreastcancerforkoreanwomenbasedonepidemiologicdataonly
AT kangdaehee computationaldiscriminationofbreastcancerforkoreanwomenbasedonepidemiologicdataonly
AT yookeunyoung computationaldiscriminationofbreastcancerforkoreanwomenbasedonepidemiologicdataonly
AT parksuek computationaldiscriminationofbreastcancerforkoreanwomenbasedonepidemiologicdataonly
AT kimyoudan computationaldiscriminationofbreastcancerforkoreanwomenbasedonepidemiologicdataonly
AT kimsungwan computationaldiscriminationofbreastcancerforkoreanwomenbasedonepidemiologicdataonly