Cargando…

A common type system for clinical natural language processing

BACKGROUND: One challenge in reusing clinical data stored in electronic medical records is that these data are heterogenous. Clinical Natural Language Processing (NLP) plays an important role in transforming information in clinical text to a standard representation that is comparable and interoperab...

Descripción completa

Detalles Bibliográficos
Autores principales: Wu, Stephen T, Kaggal, Vinod C, Dligach, Dmitriy, Masanz, James J, Chen, Pei, Becker, Lee, Chapman, Wendy W, Savova, Guergana K, Liu, Hongfang, Chute, Christopher G
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3575354/
https://www.ncbi.nlm.nih.gov/pubmed/23286462
http://dx.doi.org/10.1186/2041-1480-4-1
_version_ 1782259709032529920
author Wu, Stephen T
Kaggal, Vinod C
Dligach, Dmitriy
Masanz, James J
Chen, Pei
Becker, Lee
Chapman, Wendy W
Savova, Guergana K
Liu, Hongfang
Chute, Christopher G
author_facet Wu, Stephen T
Kaggal, Vinod C
Dligach, Dmitriy
Masanz, James J
Chen, Pei
Becker, Lee
Chapman, Wendy W
Savova, Guergana K
Liu, Hongfang
Chute, Christopher G
author_sort Wu, Stephen T
collection PubMed
description BACKGROUND: One challenge in reusing clinical data stored in electronic medical records is that these data are heterogenous. Clinical Natural Language Processing (NLP) plays an important role in transforming information in clinical text to a standard representation that is comparable and interoperable. Information may be processed and shared when a type system specifies the allowable data structures. Therefore, we aim to define a common type system for clinical NLP that enables interoperability between structured and unstructured data generated in different clinical settings. RESULTS: We describe a common type system for clinical NLP that has an end target of deep semantics based on Clinical Element Models (CEMs), thus interoperating with structured data and accommodating diverse NLP approaches. The type system has been implemented in UIMA (Unstructured Information Management Architecture) and is fully functional in a popular open-source clinical NLP system, cTAKES (clinical Text Analysis and Knowledge Extraction System) versions 2.0 and later. CONCLUSIONS: We have created a type system that targets deep semantics, thereby allowing for NLP systems to encapsulate knowledge from text and share it alongside heterogenous clinical data sources. Rather than surface semantics that are typically the end product of NLP algorithms, CEM-based semantics explicitly build in deep clinical semantics as the point of interoperability with more structured data types.
format Online
Article
Text
id pubmed-3575354
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-35753542013-02-19 A common type system for clinical natural language processing Wu, Stephen T Kaggal, Vinod C Dligach, Dmitriy Masanz, James J Chen, Pei Becker, Lee Chapman, Wendy W Savova, Guergana K Liu, Hongfang Chute, Christopher G J Biomed Semantics Research BACKGROUND: One challenge in reusing clinical data stored in electronic medical records is that these data are heterogenous. Clinical Natural Language Processing (NLP) plays an important role in transforming information in clinical text to a standard representation that is comparable and interoperable. Information may be processed and shared when a type system specifies the allowable data structures. Therefore, we aim to define a common type system for clinical NLP that enables interoperability between structured and unstructured data generated in different clinical settings. RESULTS: We describe a common type system for clinical NLP that has an end target of deep semantics based on Clinical Element Models (CEMs), thus interoperating with structured data and accommodating diverse NLP approaches. The type system has been implemented in UIMA (Unstructured Information Management Architecture) and is fully functional in a popular open-source clinical NLP system, cTAKES (clinical Text Analysis and Knowledge Extraction System) versions 2.0 and later. CONCLUSIONS: We have created a type system that targets deep semantics, thereby allowing for NLP systems to encapsulate knowledge from text and share it alongside heterogenous clinical data sources. Rather than surface semantics that are typically the end product of NLP algorithms, CEM-based semantics explicitly build in deep clinical semantics as the point of interoperability with more structured data types. BioMed Central 2013-01-03 /pmc/articles/PMC3575354/ /pubmed/23286462 http://dx.doi.org/10.1186/2041-1480-4-1 Text en Copyright ©2013 Wu et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research
Wu, Stephen T
Kaggal, Vinod C
Dligach, Dmitriy
Masanz, James J
Chen, Pei
Becker, Lee
Chapman, Wendy W
Savova, Guergana K
Liu, Hongfang
Chute, Christopher G
A common type system for clinical natural language processing
title A common type system for clinical natural language processing
title_full A common type system for clinical natural language processing
title_fullStr A common type system for clinical natural language processing
title_full_unstemmed A common type system for clinical natural language processing
title_short A common type system for clinical natural language processing
title_sort common type system for clinical natural language processing
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3575354/
https://www.ncbi.nlm.nih.gov/pubmed/23286462
http://dx.doi.org/10.1186/2041-1480-4-1
work_keys_str_mv AT wustephent acommontypesystemforclinicalnaturallanguageprocessing
AT kaggalvinodc acommontypesystemforclinicalnaturallanguageprocessing
AT dligachdmitriy acommontypesystemforclinicalnaturallanguageprocessing
AT masanzjamesj acommontypesystemforclinicalnaturallanguageprocessing
AT chenpei acommontypesystemforclinicalnaturallanguageprocessing
AT beckerlee acommontypesystemforclinicalnaturallanguageprocessing
AT chapmanwendyw acommontypesystemforclinicalnaturallanguageprocessing
AT savovaguerganak acommontypesystemforclinicalnaturallanguageprocessing
AT liuhongfang acommontypesystemforclinicalnaturallanguageprocessing
AT chutechristopherg acommontypesystemforclinicalnaturallanguageprocessing
AT wustephent commontypesystemforclinicalnaturallanguageprocessing
AT kaggalvinodc commontypesystemforclinicalnaturallanguageprocessing
AT dligachdmitriy commontypesystemforclinicalnaturallanguageprocessing
AT masanzjamesj commontypesystemforclinicalnaturallanguageprocessing
AT chenpei commontypesystemforclinicalnaturallanguageprocessing
AT beckerlee commontypesystemforclinicalnaturallanguageprocessing
AT chapmanwendyw commontypesystemforclinicalnaturallanguageprocessing
AT savovaguerganak commontypesystemforclinicalnaturallanguageprocessing
AT liuhongfang commontypesystemforclinicalnaturallanguageprocessing
AT chutechristopherg commontypesystemforclinicalnaturallanguageprocessing