Cargando…

Developing Customizable Cancer Information Extraction Modules for Pathology Reports Using CLAMP

Natural language processing (NLP) technologies have been successfully applied to cancer research by enabling automated phenotypic information extraction from narratives in electronic health records (EHRs) such as pathology reports; however, developing customized NLP solutions requires substantial ef...

Descripción completa

Detalles Bibliográficos
Autores principales: Soysal, Ergin, Warner, Jeremy L., Wang, Jingqi, Jiang, Min, Harvey, Krysten, Jain, Sandeep Kumar, Dong, Xiao, Song, Hsing-Yi, Siddhanamatha, Harish, Wang, Liwei, Dai, Qi, Chen, Qingxia, Du, Xianglin, Tao, Cui, Yang, Ping, Denny, Joshua Charles, Liu, Hongfang, Xu, Hua
Formato: Online Artículo Texto
Lenguaje:English
Publicado: 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7359882/
https://www.ncbi.nlm.nih.gov/pubmed/31438083
http://dx.doi.org/10.3233/SHTI190383
_version_ 1783559130160562176
author Soysal, Ergin
Warner, Jeremy L.
Wang, Jingqi
Jiang, Min
Harvey, Krysten
Jain, Sandeep Kumar
Dong, Xiao
Song, Hsing-Yi
Siddhanamatha, Harish
Wang, Liwei
Dai, Qi
Chen, Qingxia
Du, Xianglin
Tao, Cui
Yang, Ping
Denny, Joshua Charles
Liu, Hongfang
Xu, Hua
author_facet Soysal, Ergin
Warner, Jeremy L.
Wang, Jingqi
Jiang, Min
Harvey, Krysten
Jain, Sandeep Kumar
Dong, Xiao
Song, Hsing-Yi
Siddhanamatha, Harish
Wang, Liwei
Dai, Qi
Chen, Qingxia
Du, Xianglin
Tao, Cui
Yang, Ping
Denny, Joshua Charles
Liu, Hongfang
Xu, Hua
author_sort Soysal, Ergin
collection PubMed
description Natural language processing (NLP) technologies have been successfully applied to cancer research by enabling automated phenotypic information extraction from narratives in electronic health records (EHRs) such as pathology reports; however, developing customized NLP solutions requires substantial effort. To facilitate the adoption of NLP in cancer research, we have developed a set of customizable modules for extracting comprehensive types of cancer-related information in pathology reports (e.g., tumor size, tumor stage, and biomarkers), by leveraging the existing CLAMP system, which provides user-friendly interfaces for building customized NLP solutions for individual needs. Evaluation using annotated data at Vanderbilt University Medical Center showed that CLAMP-Cancer could extract diverse types of cancer information with good F-measures (0.80-0.98). We then applied CLAMP-Cancer to an information extraction task at Mayo Clinic and showed that we can quickly build a customized NLP system with comparable performance with an existing system at Mayo Clinic. CLAMP-Cancer is freely available for academic use.
format Online
Article
Text
id pubmed-7359882
institution National Center for Biotechnology Information
language English
publishDate 2019
record_format MEDLINE/PubMed
spelling pubmed-73598822020-07-14 Developing Customizable Cancer Information Extraction Modules for Pathology Reports Using CLAMP Soysal, Ergin Warner, Jeremy L. Wang, Jingqi Jiang, Min Harvey, Krysten Jain, Sandeep Kumar Dong, Xiao Song, Hsing-Yi Siddhanamatha, Harish Wang, Liwei Dai, Qi Chen, Qingxia Du, Xianglin Tao, Cui Yang, Ping Denny, Joshua Charles Liu, Hongfang Xu, Hua Stud Health Technol Inform Article Natural language processing (NLP) technologies have been successfully applied to cancer research by enabling automated phenotypic information extraction from narratives in electronic health records (EHRs) such as pathology reports; however, developing customized NLP solutions requires substantial effort. To facilitate the adoption of NLP in cancer research, we have developed a set of customizable modules for extracting comprehensive types of cancer-related information in pathology reports (e.g., tumor size, tumor stage, and biomarkers), by leveraging the existing CLAMP system, which provides user-friendly interfaces for building customized NLP solutions for individual needs. Evaluation using annotated data at Vanderbilt University Medical Center showed that CLAMP-Cancer could extract diverse types of cancer information with good F-measures (0.80-0.98). We then applied CLAMP-Cancer to an information extraction task at Mayo Clinic and showed that we can quickly build a customized NLP system with comparable performance with an existing system at Mayo Clinic. CLAMP-Cancer is freely available for academic use. 2019-08-21 /pmc/articles/PMC7359882/ /pubmed/31438083 http://dx.doi.org/10.3233/SHTI190383 Text en This article is published online with Open Access by IOS Press and distributed under the terms of the Creative Commons Attribution Non-Commercial License 4.0 (CC BY-NC 4.0). http://creativecommons.org/licenses/by-nc/4.0/
spellingShingle Article
Soysal, Ergin
Warner, Jeremy L.
Wang, Jingqi
Jiang, Min
Harvey, Krysten
Jain, Sandeep Kumar
Dong, Xiao
Song, Hsing-Yi
Siddhanamatha, Harish
Wang, Liwei
Dai, Qi
Chen, Qingxia
Du, Xianglin
Tao, Cui
Yang, Ping
Denny, Joshua Charles
Liu, Hongfang
Xu, Hua
Developing Customizable Cancer Information Extraction Modules for Pathology Reports Using CLAMP
title Developing Customizable Cancer Information Extraction Modules for Pathology Reports Using CLAMP
title_full Developing Customizable Cancer Information Extraction Modules for Pathology Reports Using CLAMP
title_fullStr Developing Customizable Cancer Information Extraction Modules for Pathology Reports Using CLAMP
title_full_unstemmed Developing Customizable Cancer Information Extraction Modules for Pathology Reports Using CLAMP
title_short Developing Customizable Cancer Information Extraction Modules for Pathology Reports Using CLAMP
title_sort developing customizable cancer information extraction modules for pathology reports using clamp
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7359882/
https://www.ncbi.nlm.nih.gov/pubmed/31438083
http://dx.doi.org/10.3233/SHTI190383
work_keys_str_mv AT soysalergin developingcustomizablecancerinformationextractionmodulesforpathologyreportsusingclamp
AT warnerjeremyl developingcustomizablecancerinformationextractionmodulesforpathologyreportsusingclamp
AT wangjingqi developingcustomizablecancerinformationextractionmodulesforpathologyreportsusingclamp
AT jiangmin developingcustomizablecancerinformationextractionmodulesforpathologyreportsusingclamp
AT harveykrysten developingcustomizablecancerinformationextractionmodulesforpathologyreportsusingclamp
AT jainsandeepkumar developingcustomizablecancerinformationextractionmodulesforpathologyreportsusingclamp
AT dongxiao developingcustomizablecancerinformationextractionmodulesforpathologyreportsusingclamp
AT songhsingyi developingcustomizablecancerinformationextractionmodulesforpathologyreportsusingclamp
AT siddhanamathaharish developingcustomizablecancerinformationextractionmodulesforpathologyreportsusingclamp
AT wangliwei developingcustomizablecancerinformationextractionmodulesforpathologyreportsusingclamp
AT daiqi developingcustomizablecancerinformationextractionmodulesforpathologyreportsusingclamp
AT chenqingxia developingcustomizablecancerinformationextractionmodulesforpathologyreportsusingclamp
AT duxianglin developingcustomizablecancerinformationextractionmodulesforpathologyreportsusingclamp
AT taocui developingcustomizablecancerinformationextractionmodulesforpathologyreportsusingclamp
AT yangping developingcustomizablecancerinformationextractionmodulesforpathologyreportsusingclamp
AT dennyjoshuacharles developingcustomizablecancerinformationextractionmodulesforpathologyreportsusingclamp
AT liuhongfang developingcustomizablecancerinformationextractionmodulesforpathologyreportsusingclamp
AT xuhua developingcustomizablecancerinformationextractionmodulesforpathologyreportsusingclamp