Cargando…

Electronic case report forms generation from pathology reports by ARGO, automatic record generator for onco-hematology

The unstructured nature of Real-World (RW) data from onco-hematological patients and the scarce accessibility to integrated systems restrain the use of RW information for research purposes. Natural Language Processing (NLP) might help in transposing unstructured reports into standardized electronic...

Descripción completa

Detalles Bibliográficos
Autores principales: Zaccaria, Gian Maria, Colella, Vito, Colucci, Simona, Clemente, Felice, Pavone, Fabio, Vegliante, Maria Carmela, Esposito, Flavia, Opinto, Giuseppina, Scattone, Anna, Loseto, Giacomo, Minoia, Carla, Rossini, Bernardo, Quinto, Angela Maria, Angiulli, Vito, Grieco, Luigi Alfredo, Fama, Angelo, Ferrero, Simone, Moia, Riccardo, Di Rocco, Alice, Quaglia, Francesca Maria, Tabanelli, Valentina, Guarini, Attilio, Ciavarella, Sabino
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8664934/
https://www.ncbi.nlm.nih.gov/pubmed/34893665
http://dx.doi.org/10.1038/s41598-021-03204-z
_version_ 1784613944596365312
author Zaccaria, Gian Maria
Colella, Vito
Colucci, Simona
Clemente, Felice
Pavone, Fabio
Vegliante, Maria Carmela
Esposito, Flavia
Opinto, Giuseppina
Scattone, Anna
Loseto, Giacomo
Minoia, Carla
Rossini, Bernardo
Quinto, Angela Maria
Angiulli, Vito
Grieco, Luigi Alfredo
Fama, Angelo
Ferrero, Simone
Moia, Riccardo
Di Rocco, Alice
Quaglia, Francesca Maria
Tabanelli, Valentina
Guarini, Attilio
Ciavarella, Sabino
author_facet Zaccaria, Gian Maria
Colella, Vito
Colucci, Simona
Clemente, Felice
Pavone, Fabio
Vegliante, Maria Carmela
Esposito, Flavia
Opinto, Giuseppina
Scattone, Anna
Loseto, Giacomo
Minoia, Carla
Rossini, Bernardo
Quinto, Angela Maria
Angiulli, Vito
Grieco, Luigi Alfredo
Fama, Angelo
Ferrero, Simone
Moia, Riccardo
Di Rocco, Alice
Quaglia, Francesca Maria
Tabanelli, Valentina
Guarini, Attilio
Ciavarella, Sabino
author_sort Zaccaria, Gian Maria
collection PubMed
description The unstructured nature of Real-World (RW) data from onco-hematological patients and the scarce accessibility to integrated systems restrain the use of RW information for research purposes. Natural Language Processing (NLP) might help in transposing unstructured reports into standardized electronic health records. We exploited NLP to develop an automated tool, named ARGO (Automatic Record Generator for Onco-hematology) to recognize information from pathology reports and populate electronic case report forms (eCRFs) pre-implemented by REDCap. ARGO was applied to hemo-lymphopathology reports of diffuse large B-cell, follicular, and mantle cell lymphomas, and assessed for accuracy (A), precision (P), recall (R) and F1-score (F) on internal (n = 239) and external (n = 93) report series. 326 (98.2%) reports were converted into corresponding eCRFs. Overall, ARGO showed high performance in capturing (1) identification report number (all metrics > 90%), (2) biopsy date (all metrics > 90% in both series), (3) specimen type (86.6% and 91.4% of A, 98.5% and 100.0% of P, 92.5% and 95.5% of F, and 87.2% and 91.4% of R for internal and external series, respectively), (4) diagnosis (100% of P with A, R and F of 90% in both series). We developed and validated a generalizable tool that generates structured eCRFs from real-life pathology reports.
format Online
Article
Text
id pubmed-8664934
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-86649342021-12-15 Electronic case report forms generation from pathology reports by ARGO, automatic record generator for onco-hematology Zaccaria, Gian Maria Colella, Vito Colucci, Simona Clemente, Felice Pavone, Fabio Vegliante, Maria Carmela Esposito, Flavia Opinto, Giuseppina Scattone, Anna Loseto, Giacomo Minoia, Carla Rossini, Bernardo Quinto, Angela Maria Angiulli, Vito Grieco, Luigi Alfredo Fama, Angelo Ferrero, Simone Moia, Riccardo Di Rocco, Alice Quaglia, Francesca Maria Tabanelli, Valentina Guarini, Attilio Ciavarella, Sabino Sci Rep Article The unstructured nature of Real-World (RW) data from onco-hematological patients and the scarce accessibility to integrated systems restrain the use of RW information for research purposes. Natural Language Processing (NLP) might help in transposing unstructured reports into standardized electronic health records. We exploited NLP to develop an automated tool, named ARGO (Automatic Record Generator for Onco-hematology) to recognize information from pathology reports and populate electronic case report forms (eCRFs) pre-implemented by REDCap. ARGO was applied to hemo-lymphopathology reports of diffuse large B-cell, follicular, and mantle cell lymphomas, and assessed for accuracy (A), precision (P), recall (R) and F1-score (F) on internal (n = 239) and external (n = 93) report series. 326 (98.2%) reports were converted into corresponding eCRFs. Overall, ARGO showed high performance in capturing (1) identification report number (all metrics > 90%), (2) biopsy date (all metrics > 90% in both series), (3) specimen type (86.6% and 91.4% of A, 98.5% and 100.0% of P, 92.5% and 95.5% of F, and 87.2% and 91.4% of R for internal and external series, respectively), (4) diagnosis (100% of P with A, R and F of 90% in both series). We developed and validated a generalizable tool that generates structured eCRFs from real-life pathology reports. Nature Publishing Group UK 2021-12-10 /pmc/articles/PMC8664934/ /pubmed/34893665 http://dx.doi.org/10.1038/s41598-021-03204-z Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Zaccaria, Gian Maria
Colella, Vito
Colucci, Simona
Clemente, Felice
Pavone, Fabio
Vegliante, Maria Carmela
Esposito, Flavia
Opinto, Giuseppina
Scattone, Anna
Loseto, Giacomo
Minoia, Carla
Rossini, Bernardo
Quinto, Angela Maria
Angiulli, Vito
Grieco, Luigi Alfredo
Fama, Angelo
Ferrero, Simone
Moia, Riccardo
Di Rocco, Alice
Quaglia, Francesca Maria
Tabanelli, Valentina
Guarini, Attilio
Ciavarella, Sabino
Electronic case report forms generation from pathology reports by ARGO, automatic record generator for onco-hematology
title Electronic case report forms generation from pathology reports by ARGO, automatic record generator for onco-hematology
title_full Electronic case report forms generation from pathology reports by ARGO, automatic record generator for onco-hematology
title_fullStr Electronic case report forms generation from pathology reports by ARGO, automatic record generator for onco-hematology
title_full_unstemmed Electronic case report forms generation from pathology reports by ARGO, automatic record generator for onco-hematology
title_short Electronic case report forms generation from pathology reports by ARGO, automatic record generator for onco-hematology
title_sort electronic case report forms generation from pathology reports by argo, automatic record generator for onco-hematology
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8664934/
https://www.ncbi.nlm.nih.gov/pubmed/34893665
http://dx.doi.org/10.1038/s41598-021-03204-z
work_keys_str_mv AT zaccariagianmaria electroniccasereportformsgenerationfrompathologyreportsbyargoautomaticrecordgeneratorforoncohematology
AT colellavito electroniccasereportformsgenerationfrompathologyreportsbyargoautomaticrecordgeneratorforoncohematology
AT coluccisimona electroniccasereportformsgenerationfrompathologyreportsbyargoautomaticrecordgeneratorforoncohematology
AT clementefelice electroniccasereportformsgenerationfrompathologyreportsbyargoautomaticrecordgeneratorforoncohematology
AT pavonefabio electroniccasereportformsgenerationfrompathologyreportsbyargoautomaticrecordgeneratorforoncohematology
AT vegliantemariacarmela electroniccasereportformsgenerationfrompathologyreportsbyargoautomaticrecordgeneratorforoncohematology
AT espositoflavia electroniccasereportformsgenerationfrompathologyreportsbyargoautomaticrecordgeneratorforoncohematology
AT opintogiuseppina electroniccasereportformsgenerationfrompathologyreportsbyargoautomaticrecordgeneratorforoncohematology
AT scattoneanna electroniccasereportformsgenerationfrompathologyreportsbyargoautomaticrecordgeneratorforoncohematology
AT losetogiacomo electroniccasereportformsgenerationfrompathologyreportsbyargoautomaticrecordgeneratorforoncohematology
AT minoiacarla electroniccasereportformsgenerationfrompathologyreportsbyargoautomaticrecordgeneratorforoncohematology
AT rossinibernardo electroniccasereportformsgenerationfrompathologyreportsbyargoautomaticrecordgeneratorforoncohematology
AT quintoangelamaria electroniccasereportformsgenerationfrompathologyreportsbyargoautomaticrecordgeneratorforoncohematology
AT angiullivito electroniccasereportformsgenerationfrompathologyreportsbyargoautomaticrecordgeneratorforoncohematology
AT griecoluigialfredo electroniccasereportformsgenerationfrompathologyreportsbyargoautomaticrecordgeneratorforoncohematology
AT famaangelo electroniccasereportformsgenerationfrompathologyreportsbyargoautomaticrecordgeneratorforoncohematology
AT ferrerosimone electroniccasereportformsgenerationfrompathologyreportsbyargoautomaticrecordgeneratorforoncohematology
AT moiariccardo electroniccasereportformsgenerationfrompathologyreportsbyargoautomaticrecordgeneratorforoncohematology
AT diroccoalice electroniccasereportformsgenerationfrompathologyreportsbyargoautomaticrecordgeneratorforoncohematology
AT quagliafrancescamaria electroniccasereportformsgenerationfrompathologyreportsbyargoautomaticrecordgeneratorforoncohematology
AT tabanellivalentina electroniccasereportformsgenerationfrompathologyreportsbyargoautomaticrecordgeneratorforoncohematology
AT guariniattilio electroniccasereportformsgenerationfrompathologyreportsbyargoautomaticrecordgeneratorforoncohematology
AT ciavarellasabino electroniccasereportformsgenerationfrompathologyreportsbyargoautomaticrecordgeneratorforoncohematology