Cargando…

A Metadata Extraction Approach for Clinical Case Reports to Enable Advanced Understanding of Biomedical Concepts

Clinical case reports (CCRs) are a valuable means of sharing observations and insights in medicine. The form of these documents varies, and their content includes descriptions of numerous, novel disease presentations and treatments. Thus far, the text data within CCRs is largely unstructured, requir...

Descripción completa

Detalles Bibliográficos
Autores principales: Caufield, John Harry, Liem, David A., Garlid, Anders O., Zhou, Yijiang, Watson, Karol, Bui, Alex A. T., Wang, Wei, Ping, Peipei
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MyJove Corporation 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6235242/
https://www.ncbi.nlm.nih.gov/pubmed/30295669
http://dx.doi.org/10.3791/58392
_version_ 1783370842251460608
author Caufield, John Harry
Liem, David A.
Garlid, Anders O.
Zhou, Yijiang
Watson, Karol
Bui, Alex A. T.
Wang, Wei
Ping, Peipei
author_facet Caufield, John Harry
Liem, David A.
Garlid, Anders O.
Zhou, Yijiang
Watson, Karol
Bui, Alex A. T.
Wang, Wei
Ping, Peipei
author_sort Caufield, John Harry
collection PubMed
description Clinical case reports (CCRs) are a valuable means of sharing observations and insights in medicine. The form of these documents varies, and their content includes descriptions of numerous, novel disease presentations and treatments. Thus far, the text data within CCRs is largely unstructured, requiring significant human and computational effort to render these data useful for in-depth analysis. In this protocol, we describe methods for identifying metadata corresponding to specific biomedical concepts frequently observed within CCRs. We provide a metadata template as a guide for document annotation, recognizing that imposing structure on CCRs may be pursued by combinations of manual and automated effort. The approach presented here is appropriate for organization of concept-related text from a large literature corpus (e.g., thousands of CCRs) but may be easily adapted to facilitate more focused tasks or small sets of reports. The resulting structured text data includes sufficient semantic context to support a variety of subsequent text analysis workflows: meta-analyses to determine how to maximize CCR detail, epidemiological studies of rare diseases, and the development of models of medical language may all be made more realizable and manageable through the use of structured text data.
format Online
Article
Text
id pubmed-6235242
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher MyJove Corporation
record_format MEDLINE/PubMed
spelling pubmed-62352422018-11-20 A Metadata Extraction Approach for Clinical Case Reports to Enable Advanced Understanding of Biomedical Concepts Caufield, John Harry Liem, David A. Garlid, Anders O. Zhou, Yijiang Watson, Karol Bui, Alex A. T. Wang, Wei Ping, Peipei J Vis Exp This Month in JoVE Clinical case reports (CCRs) are a valuable means of sharing observations and insights in medicine. The form of these documents varies, and their content includes descriptions of numerous, novel disease presentations and treatments. Thus far, the text data within CCRs is largely unstructured, requiring significant human and computational effort to render these data useful for in-depth analysis. In this protocol, we describe methods for identifying metadata corresponding to specific biomedical concepts frequently observed within CCRs. We provide a metadata template as a guide for document annotation, recognizing that imposing structure on CCRs may be pursued by combinations of manual and automated effort. The approach presented here is appropriate for organization of concept-related text from a large literature corpus (e.g., thousands of CCRs) but may be easily adapted to facilitate more focused tasks or small sets of reports. The resulting structured text data includes sufficient semantic context to support a variety of subsequent text analysis workflows: meta-analyses to determine how to maximize CCR detail, epidemiological studies of rare diseases, and the development of models of medical language may all be made more realizable and manageable through the use of structured text data. MyJove Corporation 2018-09-20 /pmc/articles/PMC6235242/ /pubmed/30295669 http://dx.doi.org/10.3791/58392 Text en Copyright © 2018, Journal of Visualized Experiments http://creativecommons.org/licenses/by-nc-nd/3.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution-NonCommercial-NoDerivs 3.0 Unported License. To view a copy of this license, visithttp://creativecommons.org/licenses/by-nc-nd/3.0/
spellingShingle This Month in JoVE
Caufield, John Harry
Liem, David A.
Garlid, Anders O.
Zhou, Yijiang
Watson, Karol
Bui, Alex A. T.
Wang, Wei
Ping, Peipei
A Metadata Extraction Approach for Clinical Case Reports to Enable Advanced Understanding of Biomedical Concepts
title A Metadata Extraction Approach for Clinical Case Reports to Enable Advanced Understanding of Biomedical Concepts
title_full A Metadata Extraction Approach for Clinical Case Reports to Enable Advanced Understanding of Biomedical Concepts
title_fullStr A Metadata Extraction Approach for Clinical Case Reports to Enable Advanced Understanding of Biomedical Concepts
title_full_unstemmed A Metadata Extraction Approach for Clinical Case Reports to Enable Advanced Understanding of Biomedical Concepts
title_short A Metadata Extraction Approach for Clinical Case Reports to Enable Advanced Understanding of Biomedical Concepts
title_sort metadata extraction approach for clinical case reports to enable advanced understanding of biomedical concepts
topic This Month in JoVE
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6235242/
https://www.ncbi.nlm.nih.gov/pubmed/30295669
http://dx.doi.org/10.3791/58392
work_keys_str_mv AT caufieldjohnharry ametadataextractionapproachforclinicalcasereportstoenableadvancedunderstandingofbiomedicalconcepts
AT liemdavida ametadataextractionapproachforclinicalcasereportstoenableadvancedunderstandingofbiomedicalconcepts
AT garlidanderso ametadataextractionapproachforclinicalcasereportstoenableadvancedunderstandingofbiomedicalconcepts
AT zhouyijiang ametadataextractionapproachforclinicalcasereportstoenableadvancedunderstandingofbiomedicalconcepts
AT watsonkarol ametadataextractionapproachforclinicalcasereportstoenableadvancedunderstandingofbiomedicalconcepts
AT buialexat ametadataextractionapproachforclinicalcasereportstoenableadvancedunderstandingofbiomedicalconcepts
AT wangwei ametadataextractionapproachforclinicalcasereportstoenableadvancedunderstandingofbiomedicalconcepts
AT pingpeipei ametadataextractionapproachforclinicalcasereportstoenableadvancedunderstandingofbiomedicalconcepts
AT caufieldjohnharry metadataextractionapproachforclinicalcasereportstoenableadvancedunderstandingofbiomedicalconcepts
AT liemdavida metadataextractionapproachforclinicalcasereportstoenableadvancedunderstandingofbiomedicalconcepts
AT garlidanderso metadataextractionapproachforclinicalcasereportstoenableadvancedunderstandingofbiomedicalconcepts
AT zhouyijiang metadataextractionapproachforclinicalcasereportstoenableadvancedunderstandingofbiomedicalconcepts
AT watsonkarol metadataextractionapproachforclinicalcasereportstoenableadvancedunderstandingofbiomedicalconcepts
AT buialexat metadataextractionapproachforclinicalcasereportstoenableadvancedunderstandingofbiomedicalconcepts
AT wangwei metadataextractionapproachforclinicalcasereportstoenableadvancedunderstandingofbiomedicalconcepts
AT pingpeipei metadataextractionapproachforclinicalcasereportstoenableadvancedunderstandingofbiomedicalconcepts