Cargando…

Analysis of Hormone Receptor Status in Primary and Recurrent Breast Cancer Via Data Mining Pathology Reports

BACKGROUND: Hormone receptors of breast cancer, such as estrogen receptor (ER), progesterone receptor (PR), and human epidermal growth factor receptor 2 (Her-2), are important prognostic factors for breast cancer. OBJECTIVE: The current study aimed to develop a method to retrieve the statistics of h...

Descripción completa

Detalles Bibliográficos
Autores principales: Chang, Kai-Po, Chu, Yen-Wei, Wang, John
Formato: Online Artículo Texto
Lenguaje:English
Publicado: De Gruyter 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6401490/
https://www.ncbi.nlm.nih.gov/pubmed/30847396
http://dx.doi.org/10.1515/med-2019-0013
_version_ 1783400146341462016
author Chang, Kai-Po
Chu, Yen-Wei
Wang, John
author_facet Chang, Kai-Po
Chu, Yen-Wei
Wang, John
author_sort Chang, Kai-Po
collection PubMed
description BACKGROUND: Hormone receptors of breast cancer, such as estrogen receptor (ER), progesterone receptor (PR), and human epidermal growth factor receptor 2 (Her-2), are important prognostic factors for breast cancer. OBJECTIVE: The current study aimed to develop a method to retrieve the statistics of hormone receptor expression status, documented in pathology reports, given their importance in research for primary and recurrent breast cancer, and quality management of pathology laboratories. METHOD: A two-stage text mining approach via regular expression-based word/phrase matching, was developed to retrieve the data. RESULTS: The method achieved a sensitivity of 98.8%, 98.7% and 98.4% for extraction of ER, PR, and Her-2 results. The hormone expression status from 3679 primary and 44 recurrent breast cancer cases was successfully retrieved with the method. Statistical analysis of these data showed that the recurrent disease had a significantly lower positivity rate for ER (54.5% vs 76.5%, p=0.001278) than primary breast cancer and a higher positivity rate for Her-2 (48.8% vs 16.2%, p=9.79e-8). These results corroborated the previous literature. CONCLUSION: Text mining on pathology reports using the developed method may benefit research of primary and recurrent breast cancer.
format Online
Article
Text
id pubmed-6401490
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher De Gruyter
record_format MEDLINE/PubMed
spelling pubmed-64014902019-03-07 Analysis of Hormone Receptor Status in Primary and Recurrent Breast Cancer Via Data Mining Pathology Reports Chang, Kai-Po Chu, Yen-Wei Wang, John Open Med (Wars) Research Article BACKGROUND: Hormone receptors of breast cancer, such as estrogen receptor (ER), progesterone receptor (PR), and human epidermal growth factor receptor 2 (Her-2), are important prognostic factors for breast cancer. OBJECTIVE: The current study aimed to develop a method to retrieve the statistics of hormone receptor expression status, documented in pathology reports, given their importance in research for primary and recurrent breast cancer, and quality management of pathology laboratories. METHOD: A two-stage text mining approach via regular expression-based word/phrase matching, was developed to retrieve the data. RESULTS: The method achieved a sensitivity of 98.8%, 98.7% and 98.4% for extraction of ER, PR, and Her-2 results. The hormone expression status from 3679 primary and 44 recurrent breast cancer cases was successfully retrieved with the method. Statistical analysis of these data showed that the recurrent disease had a significantly lower positivity rate for ER (54.5% vs 76.5%, p=0.001278) than primary breast cancer and a higher positivity rate for Her-2 (48.8% vs 16.2%, p=9.79e-8). These results corroborated the previous literature. CONCLUSION: Text mining on pathology reports using the developed method may benefit research of primary and recurrent breast cancer. De Gruyter 2019-02-20 /pmc/articles/PMC6401490/ /pubmed/30847396 http://dx.doi.org/10.1515/med-2019-0013 Text en © 2019 Kai-Po Chang et al. published by De Gruyter http://creativecommons.org/licenses/by-nc-nd/4.0 This work is licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 License.
spellingShingle Research Article
Chang, Kai-Po
Chu, Yen-Wei
Wang, John
Analysis of Hormone Receptor Status in Primary and Recurrent Breast Cancer Via Data Mining Pathology Reports
title Analysis of Hormone Receptor Status in Primary and Recurrent Breast Cancer Via Data Mining Pathology Reports
title_full Analysis of Hormone Receptor Status in Primary and Recurrent Breast Cancer Via Data Mining Pathology Reports
title_fullStr Analysis of Hormone Receptor Status in Primary and Recurrent Breast Cancer Via Data Mining Pathology Reports
title_full_unstemmed Analysis of Hormone Receptor Status in Primary and Recurrent Breast Cancer Via Data Mining Pathology Reports
title_short Analysis of Hormone Receptor Status in Primary and Recurrent Breast Cancer Via Data Mining Pathology Reports
title_sort analysis of hormone receptor status in primary and recurrent breast cancer via data mining pathology reports
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6401490/
https://www.ncbi.nlm.nih.gov/pubmed/30847396
http://dx.doi.org/10.1515/med-2019-0013
work_keys_str_mv AT changkaipo analysisofhormonereceptorstatusinprimaryandrecurrentbreastcancerviadataminingpathologyreports
AT chuyenwei analysisofhormonereceptorstatusinprimaryandrecurrentbreastcancerviadataminingpathologyreports
AT wangjohn analysisofhormonereceptorstatusinprimaryandrecurrentbreastcancerviadataminingpathologyreports