Cargando…

A context-blocks model for identifying clinical relationships in patient records

BACKGROUND: Patient records contain valuable information regarding explanation of diagnosis, progression of disease, prescription and/or effectiveness of treatment, and more. Automatic recognition of clinically important concepts and the identification of relationships between those concepts in pati...

Descripción completa

Detalles Bibliográficos
Autores principales:	Islamaj Doğan, Rezarta, Névéol, Aurélie, Lu, Zhiyong
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	BioMed Central 2011
Materias:	Research
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3111589/ https://www.ncbi.nlm.nih.gov/pubmed/21658290 http://dx.doi.org/10.1186/1471-2105-12-S3-S3

_version_	1782205649797513216
author	Islamaj Doğan, Rezarta Névéol, Aurélie Lu, Zhiyong
author_facet	Islamaj Doğan, Rezarta Névéol, Aurélie Lu, Zhiyong
author_sort	Islamaj Doğan, Rezarta
collection	PubMed
description	BACKGROUND: Patient records contain valuable information regarding explanation of diagnosis, progression of disease, prescription and/or effectiveness of treatment, and more. Automatic recognition of clinically important concepts and the identification of relationships between those concepts in patient records are preliminary steps for many important applications in medical informatics, ranging from quality of care to hypothesis generation. METHODS: In this work we describe an approach that facilitates the automatic recognition of eight relationships defined between medical problems, treatments and tests. Unlike the traditional bag-of-words representation, in this work, we represent a relationship with a scheme of five distinct context-blocks determined by the position of concepts in the text. As a preliminary step to relationship recognition, and in order to provide an end-to-end system, we also addressed the automatic extraction of medical problems, treatments and tests. Our approach combined the outcome of a statistical model for concept recognition and simple natural language processing features in a conditional random fields model. A set of 826 patient records from the 4th i2b2 challenge was used for training and evaluating the system. RESULTS: Results show that our concept recognition system achieved an F-measure of 0.870 for exact span concept detection. Moreover the context-block representation of relationships was more successful (F-Measure = 0.775) at identifying relationships than bag-of-words (F-Measure = 0.402). Most importantly, the performance of the end-to-end system of relationship extraction using automatically extracted concepts (F-Measure = 0.704) was comparable to that obtained using manually annotated concepts (F-Measure = 0.711), and their difference was not statistically significant. CONCLUSIONS: We extracted important clinical relationships from text in an automated manner, starting with concept recognition, and ending with relationship identification. The advantage of the context-blocks representation scheme was the correct management of word position information, which may be critical in identifying certain relationships. Our results may serve as benchmark for comparison to other systems developed on i2b2 challenge data. Finally, our system may serve as a preliminary step for other discovery tasks in medical informatics.
format	Online Article Text
id	pubmed-3111589
institution	National Center for Biotechnology Information
language	English
publishDate	2011
publisher	BioMed Central
record_format	MEDLINE/PubMed
spelling	pubmed-31115892011-06-11 A context-blocks model for identifying clinical relationships in patient records Islamaj Doğan, Rezarta Névéol, Aurélie Lu, Zhiyong BMC Bioinformatics Research BACKGROUND: Patient records contain valuable information regarding explanation of diagnosis, progression of disease, prescription and/or effectiveness of treatment, and more. Automatic recognition of clinically important concepts and the identification of relationships between those concepts in patient records are preliminary steps for many important applications in medical informatics, ranging from quality of care to hypothesis generation. METHODS: In this work we describe an approach that facilitates the automatic recognition of eight relationships defined between medical problems, treatments and tests. Unlike the traditional bag-of-words representation, in this work, we represent a relationship with a scheme of five distinct context-blocks determined by the position of concepts in the text. As a preliminary step to relationship recognition, and in order to provide an end-to-end system, we also addressed the automatic extraction of medical problems, treatments and tests. Our approach combined the outcome of a statistical model for concept recognition and simple natural language processing features in a conditional random fields model. A set of 826 patient records from the 4th i2b2 challenge was used for training and evaluating the system. RESULTS: Results show that our concept recognition system achieved an F-measure of 0.870 for exact span concept detection. Moreover the context-block representation of relationships was more successful (F-Measure = 0.775) at identifying relationships than bag-of-words (F-Measure = 0.402). Most importantly, the performance of the end-to-end system of relationship extraction using automatically extracted concepts (F-Measure = 0.704) was comparable to that obtained using manually annotated concepts (F-Measure = 0.711), and their difference was not statistically significant. CONCLUSIONS: We extracted important clinical relationships from text in an automated manner, starting with concept recognition, and ending with relationship identification. The advantage of the context-blocks representation scheme was the correct management of word position information, which may be critical in identifying certain relationships. Our results may serve as benchmark for comparison to other systems developed on i2b2 challenge data. Finally, our system may serve as a preliminary step for other discovery tasks in medical informatics. BioMed Central 2011-06-09 /pmc/articles/PMC3111589/ /pubmed/21658290 http://dx.doi.org/10.1186/1471-2105-12-S3-S3 Text en This article is in the public domain. This article is in the public domain.
spellingShingle	Research Islamaj Doğan, Rezarta Névéol, Aurélie Lu, Zhiyong A context-blocks model for identifying clinical relationships in patient records
title	A context-blocks model for identifying clinical relationships in patient records
title_full	A context-blocks model for identifying clinical relationships in patient records
title_fullStr	A context-blocks model for identifying clinical relationships in patient records
title_full_unstemmed	A context-blocks model for identifying clinical relationships in patient records
title_short	A context-blocks model for identifying clinical relationships in patient records
title_sort	context-blocks model for identifying clinical relationships in patient records
topic	Research
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3111589/ https://www.ncbi.nlm.nih.gov/pubmed/21658290 http://dx.doi.org/10.1186/1471-2105-12-S3-S3
work_keys_str_mv	AT islamajdoganrezarta acontextblocksmodelforidentifyingclinicalrelationshipsinpatientrecords AT neveolaurelie acontextblocksmodelforidentifyingclinicalrelationshipsinpatientrecords AT luzhiyong acontextblocksmodelforidentifyingclinicalrelationshipsinpatientrecords AT islamajdoganrezarta contextblocksmodelforidentifyingclinicalrelationshipsinpatientrecords AT neveolaurelie contextblocksmodelforidentifyingclinicalrelationshipsinpatientrecords AT luzhiyong contextblocksmodelforidentifyingclinicalrelationshipsinpatientrecords

A context-blocks model for identifying clinical relationships in patient records

Ejemplares similares