Cargando…

tbiExtractor: A framework for extracting traumatic brain injury common data elements from radiology reports

BACKGROUND: The manual extraction of valuable data from electronic medical records is cumbersome, error-prone, and inconsistent. By automating extraction in conjunction with standardized terminology, the quality and consistency of data utilized for research and clinical purposes would be substantial...

Descripción completa

Detalles Bibliográficos
Autores principales: Mahan, Margaret, Rafter, Daniel, Casey, Hannah, Engelking, Marta, Abdallah, Tessneem, Truwit, Charles, Oswood, Mark, Samadani, Uzma
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7329124/
https://www.ncbi.nlm.nih.gov/pubmed/32609723
http://dx.doi.org/10.1371/journal.pone.0214775
_version_ 1783552855376920576
author Mahan, Margaret
Rafter, Daniel
Casey, Hannah
Engelking, Marta
Abdallah, Tessneem
Truwit, Charles
Oswood, Mark
Samadani, Uzma
author_facet Mahan, Margaret
Rafter, Daniel
Casey, Hannah
Engelking, Marta
Abdallah, Tessneem
Truwit, Charles
Oswood, Mark
Samadani, Uzma
author_sort Mahan, Margaret
collection PubMed
description BACKGROUND: The manual extraction of valuable data from electronic medical records is cumbersome, error-prone, and inconsistent. By automating extraction in conjunction with standardized terminology, the quality and consistency of data utilized for research and clinical purposes would be substantially improved. Here, we set out to develop and validate a framework to extract pertinent clinical conditions for traumatic brain injury (TBI) from computed tomography (CT) reports. METHODS: We developed tbiExtractor, which extends pyConTextNLP, a regular expression algorithm using negation detection and contextual features, to create a framework for extracting TBI common data elements from radiology reports. The algorithm inputs radiology reports and outputs a structured summary containing 27 clinical findings with their respective annotations. Development and validation of the algorithm was completed using two physician annotators as the gold standard. RESULTS: tbiExtractor displayed high sensitivity (0.92–0.94) and specificity (0.99) when compared to the gold standard. The algorithm also demonstrated a high equivalence (94.6%) with the annotators. A majority of clinical findings (85%) had minimal errors (F1 Score ≥ 0.80). When compared to annotators, tbiExtractor extracted information in significantly less time (0.3 sec vs 1.7 min per report). CONCLUSION: tbiExtractor is a validated algorithm for extraction of TBI common data elements from radiology reports. This automation reduces the time spent to extract structured data and improves the consistency of data extracted. Lastly, tbiExtractor can be used to stratify subjects into groups based on visible damage by partitioning the annotations of the pertinent clinical conditions on a radiology report.
format Online
Article
Text
id pubmed-7329124
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-73291242020-07-14 tbiExtractor: A framework for extracting traumatic brain injury common data elements from radiology reports Mahan, Margaret Rafter, Daniel Casey, Hannah Engelking, Marta Abdallah, Tessneem Truwit, Charles Oswood, Mark Samadani, Uzma PLoS One Research Article BACKGROUND: The manual extraction of valuable data from electronic medical records is cumbersome, error-prone, and inconsistent. By automating extraction in conjunction with standardized terminology, the quality and consistency of data utilized for research and clinical purposes would be substantially improved. Here, we set out to develop and validate a framework to extract pertinent clinical conditions for traumatic brain injury (TBI) from computed tomography (CT) reports. METHODS: We developed tbiExtractor, which extends pyConTextNLP, a regular expression algorithm using negation detection and contextual features, to create a framework for extracting TBI common data elements from radiology reports. The algorithm inputs radiology reports and outputs a structured summary containing 27 clinical findings with their respective annotations. Development and validation of the algorithm was completed using two physician annotators as the gold standard. RESULTS: tbiExtractor displayed high sensitivity (0.92–0.94) and specificity (0.99) when compared to the gold standard. The algorithm also demonstrated a high equivalence (94.6%) with the annotators. A majority of clinical findings (85%) had minimal errors (F1 Score ≥ 0.80). When compared to annotators, tbiExtractor extracted information in significantly less time (0.3 sec vs 1.7 min per report). CONCLUSION: tbiExtractor is a validated algorithm for extraction of TBI common data elements from radiology reports. This automation reduces the time spent to extract structured data and improves the consistency of data extracted. Lastly, tbiExtractor can be used to stratify subjects into groups based on visible damage by partitioning the annotations of the pertinent clinical conditions on a radiology report. Public Library of Science 2020-07-01 /pmc/articles/PMC7329124/ /pubmed/32609723 http://dx.doi.org/10.1371/journal.pone.0214775 Text en https://creativecommons.org/publicdomain/zero/1.0/ This is an open access article, free of all copyright, and may be freely reproduced, distributed, transmitted, modified, built upon, or otherwise used by anyone for any lawful purpose. The work is made available under the Creative Commons CC0 (https://creativecommons.org/publicdomain/zero/1.0/) public domain dedication.
spellingShingle Research Article
Mahan, Margaret
Rafter, Daniel
Casey, Hannah
Engelking, Marta
Abdallah, Tessneem
Truwit, Charles
Oswood, Mark
Samadani, Uzma
tbiExtractor: A framework for extracting traumatic brain injury common data elements from radiology reports
title tbiExtractor: A framework for extracting traumatic brain injury common data elements from radiology reports
title_full tbiExtractor: A framework for extracting traumatic brain injury common data elements from radiology reports
title_fullStr tbiExtractor: A framework for extracting traumatic brain injury common data elements from radiology reports
title_full_unstemmed tbiExtractor: A framework for extracting traumatic brain injury common data elements from radiology reports
title_short tbiExtractor: A framework for extracting traumatic brain injury common data elements from radiology reports
title_sort tbiextractor: a framework for extracting traumatic brain injury common data elements from radiology reports
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7329124/
https://www.ncbi.nlm.nih.gov/pubmed/32609723
http://dx.doi.org/10.1371/journal.pone.0214775
work_keys_str_mv AT mahanmargaret tbiextractoraframeworkforextractingtraumaticbraininjurycommondataelementsfromradiologyreports
AT rafterdaniel tbiextractoraframeworkforextractingtraumaticbraininjurycommondataelementsfromradiologyreports
AT caseyhannah tbiextractoraframeworkforextractingtraumaticbraininjurycommondataelementsfromradiologyreports
AT engelkingmarta tbiextractoraframeworkforextractingtraumaticbraininjurycommondataelementsfromradiologyreports
AT abdallahtessneem tbiextractoraframeworkforextractingtraumaticbraininjurycommondataelementsfromradiologyreports
AT truwitcharles tbiextractoraframeworkforextractingtraumaticbraininjurycommondataelementsfromradiologyreports
AT oswoodmark tbiextractoraframeworkforextractingtraumaticbraininjurycommondataelementsfromradiologyreports
AT samadaniuzma tbiextractoraframeworkforextractingtraumaticbraininjurycommondataelementsfromradiologyreports