Cargando…
Analysis of Hormone Receptor Status in Primary and Recurrent Breast Cancer Via Data Mining Pathology Reports
BACKGROUND: Hormone receptors of breast cancer, such as estrogen receptor (ER), progesterone receptor (PR), and human epidermal growth factor receptor 2 (Her-2), are important prognostic factors for breast cancer. OBJECTIVE: The current study aimed to develop a method to retrieve the statistics of h...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
De Gruyter
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6401490/ https://www.ncbi.nlm.nih.gov/pubmed/30847396 http://dx.doi.org/10.1515/med-2019-0013 |
_version_ | 1783400146341462016 |
---|---|
author | Chang, Kai-Po Chu, Yen-Wei Wang, John |
author_facet | Chang, Kai-Po Chu, Yen-Wei Wang, John |
author_sort | Chang, Kai-Po |
collection | PubMed |
description | BACKGROUND: Hormone receptors of breast cancer, such as estrogen receptor (ER), progesterone receptor (PR), and human epidermal growth factor receptor 2 (Her-2), are important prognostic factors for breast cancer. OBJECTIVE: The current study aimed to develop a method to retrieve the statistics of hormone receptor expression status, documented in pathology reports, given their importance in research for primary and recurrent breast cancer, and quality management of pathology laboratories. METHOD: A two-stage text mining approach via regular expression-based word/phrase matching, was developed to retrieve the data. RESULTS: The method achieved a sensitivity of 98.8%, 98.7% and 98.4% for extraction of ER, PR, and Her-2 results. The hormone expression status from 3679 primary and 44 recurrent breast cancer cases was successfully retrieved with the method. Statistical analysis of these data showed that the recurrent disease had a significantly lower positivity rate for ER (54.5% vs 76.5%, p=0.001278) than primary breast cancer and a higher positivity rate for Her-2 (48.8% vs 16.2%, p=9.79e-8). These results corroborated the previous literature. CONCLUSION: Text mining on pathology reports using the developed method may benefit research of primary and recurrent breast cancer. |
format | Online Article Text |
id | pubmed-6401490 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | De Gruyter |
record_format | MEDLINE/PubMed |
spelling | pubmed-64014902019-03-07 Analysis of Hormone Receptor Status in Primary and Recurrent Breast Cancer Via Data Mining Pathology Reports Chang, Kai-Po Chu, Yen-Wei Wang, John Open Med (Wars) Research Article BACKGROUND: Hormone receptors of breast cancer, such as estrogen receptor (ER), progesterone receptor (PR), and human epidermal growth factor receptor 2 (Her-2), are important prognostic factors for breast cancer. OBJECTIVE: The current study aimed to develop a method to retrieve the statistics of hormone receptor expression status, documented in pathology reports, given their importance in research for primary and recurrent breast cancer, and quality management of pathology laboratories. METHOD: A two-stage text mining approach via regular expression-based word/phrase matching, was developed to retrieve the data. RESULTS: The method achieved a sensitivity of 98.8%, 98.7% and 98.4% for extraction of ER, PR, and Her-2 results. The hormone expression status from 3679 primary and 44 recurrent breast cancer cases was successfully retrieved with the method. Statistical analysis of these data showed that the recurrent disease had a significantly lower positivity rate for ER (54.5% vs 76.5%, p=0.001278) than primary breast cancer and a higher positivity rate for Her-2 (48.8% vs 16.2%, p=9.79e-8). These results corroborated the previous literature. CONCLUSION: Text mining on pathology reports using the developed method may benefit research of primary and recurrent breast cancer. De Gruyter 2019-02-20 /pmc/articles/PMC6401490/ /pubmed/30847396 http://dx.doi.org/10.1515/med-2019-0013 Text en © 2019 Kai-Po Chang et al. published by De Gruyter http://creativecommons.org/licenses/by-nc-nd/4.0 This work is licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 License. |
spellingShingle | Research Article Chang, Kai-Po Chu, Yen-Wei Wang, John Analysis of Hormone Receptor Status in Primary and Recurrent Breast Cancer Via Data Mining Pathology Reports |
title | Analysis of Hormone Receptor Status in Primary and Recurrent Breast Cancer Via Data Mining Pathology Reports |
title_full | Analysis of Hormone Receptor Status in Primary and Recurrent Breast Cancer Via Data Mining Pathology Reports |
title_fullStr | Analysis of Hormone Receptor Status in Primary and Recurrent Breast Cancer Via Data Mining Pathology Reports |
title_full_unstemmed | Analysis of Hormone Receptor Status in Primary and Recurrent Breast Cancer Via Data Mining Pathology Reports |
title_short | Analysis of Hormone Receptor Status in Primary and Recurrent Breast Cancer Via Data Mining Pathology Reports |
title_sort | analysis of hormone receptor status in primary and recurrent breast cancer via data mining pathology reports |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6401490/ https://www.ncbi.nlm.nih.gov/pubmed/30847396 http://dx.doi.org/10.1515/med-2019-0013 |
work_keys_str_mv | AT changkaipo analysisofhormonereceptorstatusinprimaryandrecurrentbreastcancerviadataminingpathologyreports AT chuyenwei analysisofhormonereceptorstatusinprimaryandrecurrentbreastcancerviadataminingpathologyreports AT wangjohn analysisofhormonereceptorstatusinprimaryandrecurrentbreastcancerviadataminingpathologyreports |