Cargando…

A context-blocks model for identifying clinical relationships in patient records

BACKGROUND: Patient records contain valuable information regarding explanation of diagnosis, progression of disease, prescription and/or effectiveness of treatment, and more. Automatic recognition of clinically important concepts and the identification of relationships between those concepts in pati...

Descripción completa

Detalles Bibliográficos
Autores principales: Islamaj Doğan, Rezarta, Névéol, Aurélie, Lu, Zhiyong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3111589/
https://www.ncbi.nlm.nih.gov/pubmed/21658290
http://dx.doi.org/10.1186/1471-2105-12-S3-S3
_version_ 1782205649797513216
author Islamaj Doğan, Rezarta
Névéol, Aurélie
Lu, Zhiyong
author_facet Islamaj Doğan, Rezarta
Névéol, Aurélie
Lu, Zhiyong
author_sort Islamaj Doğan, Rezarta
collection PubMed
description BACKGROUND: Patient records contain valuable information regarding explanation of diagnosis, progression of disease, prescription and/or effectiveness of treatment, and more. Automatic recognition of clinically important concepts and the identification of relationships between those concepts in patient records are preliminary steps for many important applications in medical informatics, ranging from quality of care to hypothesis generation. METHODS: In this work we describe an approach that facilitates the automatic recognition of eight relationships defined between medical problems, treatments and tests. Unlike the traditional bag-of-words representation, in this work, we represent a relationship with a scheme of five distinct context-blocks determined by the position of concepts in the text. As a preliminary step to relationship recognition, and in order to provide an end-to-end system, we also addressed the automatic extraction of medical problems, treatments and tests. Our approach combined the outcome of a statistical model for concept recognition and simple natural language processing features in a conditional random fields model. A set of 826 patient records from the 4th i2b2 challenge was used for training and evaluating the system. RESULTS: Results show that our concept recognition system achieved an F-measure of 0.870 for exact span concept detection. Moreover the context-block representation of relationships was more successful (F-Measure = 0.775) at identifying relationships than bag-of-words (F-Measure = 0.402). Most importantly, the performance of the end-to-end system of relationship extraction using automatically extracted concepts (F-Measure = 0.704) was comparable to that obtained using manually annotated concepts (F-Measure = 0.711), and their difference was not statistically significant. CONCLUSIONS: We extracted important clinical relationships from text in an automated manner, starting with concept recognition, and ending with relationship identification. The advantage of the context-blocks representation scheme was the correct management of word position information, which may be critical in identifying certain relationships. Our results may serve as benchmark for comparison to other systems developed on i2b2 challenge data. Finally, our system may serve as a preliminary step for other discovery tasks in medical informatics.
format Online
Article
Text
id pubmed-3111589
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-31115892011-06-11 A context-blocks model for identifying clinical relationships in patient records Islamaj Doğan, Rezarta Névéol, Aurélie Lu, Zhiyong BMC Bioinformatics Research BACKGROUND: Patient records contain valuable information regarding explanation of diagnosis, progression of disease, prescription and/or effectiveness of treatment, and more. Automatic recognition of clinically important concepts and the identification of relationships between those concepts in patient records are preliminary steps for many important applications in medical informatics, ranging from quality of care to hypothesis generation. METHODS: In this work we describe an approach that facilitates the automatic recognition of eight relationships defined between medical problems, treatments and tests. Unlike the traditional bag-of-words representation, in this work, we represent a relationship with a scheme of five distinct context-blocks determined by the position of concepts in the text. As a preliminary step to relationship recognition, and in order to provide an end-to-end system, we also addressed the automatic extraction of medical problems, treatments and tests. Our approach combined the outcome of a statistical model for concept recognition and simple natural language processing features in a conditional random fields model. A set of 826 patient records from the 4th i2b2 challenge was used for training and evaluating the system. RESULTS: Results show that our concept recognition system achieved an F-measure of 0.870 for exact span concept detection. Moreover the context-block representation of relationships was more successful (F-Measure = 0.775) at identifying relationships than bag-of-words (F-Measure = 0.402). Most importantly, the performance of the end-to-end system of relationship extraction using automatically extracted concepts (F-Measure = 0.704) was comparable to that obtained using manually annotated concepts (F-Measure = 0.711), and their difference was not statistically significant. CONCLUSIONS: We extracted important clinical relationships from text in an automated manner, starting with concept recognition, and ending with relationship identification. The advantage of the context-blocks representation scheme was the correct management of word position information, which may be critical in identifying certain relationships. Our results may serve as benchmark for comparison to other systems developed on i2b2 challenge data. Finally, our system may serve as a preliminary step for other discovery tasks in medical informatics. BioMed Central 2011-06-09 /pmc/articles/PMC3111589/ /pubmed/21658290 http://dx.doi.org/10.1186/1471-2105-12-S3-S3 Text en This article is in the public domain. This article is in the public domain.
spellingShingle Research
Islamaj Doğan, Rezarta
Névéol, Aurélie
Lu, Zhiyong
A context-blocks model for identifying clinical relationships in patient records
title A context-blocks model for identifying clinical relationships in patient records
title_full A context-blocks model for identifying clinical relationships in patient records
title_fullStr A context-blocks model for identifying clinical relationships in patient records
title_full_unstemmed A context-blocks model for identifying clinical relationships in patient records
title_short A context-blocks model for identifying clinical relationships in patient records
title_sort context-blocks model for identifying clinical relationships in patient records
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3111589/
https://www.ncbi.nlm.nih.gov/pubmed/21658290
http://dx.doi.org/10.1186/1471-2105-12-S3-S3
work_keys_str_mv AT islamajdoganrezarta acontextblocksmodelforidentifyingclinicalrelationshipsinpatientrecords
AT neveolaurelie acontextblocksmodelforidentifyingclinicalrelationshipsinpatientrecords
AT luzhiyong acontextblocksmodelforidentifyingclinicalrelationshipsinpatientrecords
AT islamajdoganrezarta contextblocksmodelforidentifyingclinicalrelationshipsinpatientrecords
AT neveolaurelie contextblocksmodelforidentifyingclinicalrelationshipsinpatientrecords
AT luzhiyong contextblocksmodelforidentifyingclinicalrelationshipsinpatientrecords