Cargando…
Electronic case report forms generation from pathology reports by ARGO, automatic record generator for onco-hematology
The unstructured nature of Real-World (RW) data from onco-hematological patients and the scarce accessibility to integrated systems restrain the use of RW information for research purposes. Natural Language Processing (NLP) might help in transposing unstructured reports into standardized electronic...
Autores principales: | , , , , , , , , , , , , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8664934/ https://www.ncbi.nlm.nih.gov/pubmed/34893665 http://dx.doi.org/10.1038/s41598-021-03204-z |
_version_ | 1784613944596365312 |
---|---|
author | Zaccaria, Gian Maria Colella, Vito Colucci, Simona Clemente, Felice Pavone, Fabio Vegliante, Maria Carmela Esposito, Flavia Opinto, Giuseppina Scattone, Anna Loseto, Giacomo Minoia, Carla Rossini, Bernardo Quinto, Angela Maria Angiulli, Vito Grieco, Luigi Alfredo Fama, Angelo Ferrero, Simone Moia, Riccardo Di Rocco, Alice Quaglia, Francesca Maria Tabanelli, Valentina Guarini, Attilio Ciavarella, Sabino |
author_facet | Zaccaria, Gian Maria Colella, Vito Colucci, Simona Clemente, Felice Pavone, Fabio Vegliante, Maria Carmela Esposito, Flavia Opinto, Giuseppina Scattone, Anna Loseto, Giacomo Minoia, Carla Rossini, Bernardo Quinto, Angela Maria Angiulli, Vito Grieco, Luigi Alfredo Fama, Angelo Ferrero, Simone Moia, Riccardo Di Rocco, Alice Quaglia, Francesca Maria Tabanelli, Valentina Guarini, Attilio Ciavarella, Sabino |
author_sort | Zaccaria, Gian Maria |
collection | PubMed |
description | The unstructured nature of Real-World (RW) data from onco-hematological patients and the scarce accessibility to integrated systems restrain the use of RW information for research purposes. Natural Language Processing (NLP) might help in transposing unstructured reports into standardized electronic health records. We exploited NLP to develop an automated tool, named ARGO (Automatic Record Generator for Onco-hematology) to recognize information from pathology reports and populate electronic case report forms (eCRFs) pre-implemented by REDCap. ARGO was applied to hemo-lymphopathology reports of diffuse large B-cell, follicular, and mantle cell lymphomas, and assessed for accuracy (A), precision (P), recall (R) and F1-score (F) on internal (n = 239) and external (n = 93) report series. 326 (98.2%) reports were converted into corresponding eCRFs. Overall, ARGO showed high performance in capturing (1) identification report number (all metrics > 90%), (2) biopsy date (all metrics > 90% in both series), (3) specimen type (86.6% and 91.4% of A, 98.5% and 100.0% of P, 92.5% and 95.5% of F, and 87.2% and 91.4% of R for internal and external series, respectively), (4) diagnosis (100% of P with A, R and F of 90% in both series). We developed and validated a generalizable tool that generates structured eCRFs from real-life pathology reports. |
format | Online Article Text |
id | pubmed-8664934 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-86649342021-12-15 Electronic case report forms generation from pathology reports by ARGO, automatic record generator for onco-hematology Zaccaria, Gian Maria Colella, Vito Colucci, Simona Clemente, Felice Pavone, Fabio Vegliante, Maria Carmela Esposito, Flavia Opinto, Giuseppina Scattone, Anna Loseto, Giacomo Minoia, Carla Rossini, Bernardo Quinto, Angela Maria Angiulli, Vito Grieco, Luigi Alfredo Fama, Angelo Ferrero, Simone Moia, Riccardo Di Rocco, Alice Quaglia, Francesca Maria Tabanelli, Valentina Guarini, Attilio Ciavarella, Sabino Sci Rep Article The unstructured nature of Real-World (RW) data from onco-hematological patients and the scarce accessibility to integrated systems restrain the use of RW information for research purposes. Natural Language Processing (NLP) might help in transposing unstructured reports into standardized electronic health records. We exploited NLP to develop an automated tool, named ARGO (Automatic Record Generator for Onco-hematology) to recognize information from pathology reports and populate electronic case report forms (eCRFs) pre-implemented by REDCap. ARGO was applied to hemo-lymphopathology reports of diffuse large B-cell, follicular, and mantle cell lymphomas, and assessed for accuracy (A), precision (P), recall (R) and F1-score (F) on internal (n = 239) and external (n = 93) report series. 326 (98.2%) reports were converted into corresponding eCRFs. Overall, ARGO showed high performance in capturing (1) identification report number (all metrics > 90%), (2) biopsy date (all metrics > 90% in both series), (3) specimen type (86.6% and 91.4% of A, 98.5% and 100.0% of P, 92.5% and 95.5% of F, and 87.2% and 91.4% of R for internal and external series, respectively), (4) diagnosis (100% of P with A, R and F of 90% in both series). We developed and validated a generalizable tool that generates structured eCRFs from real-life pathology reports. Nature Publishing Group UK 2021-12-10 /pmc/articles/PMC8664934/ /pubmed/34893665 http://dx.doi.org/10.1038/s41598-021-03204-z Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Article Zaccaria, Gian Maria Colella, Vito Colucci, Simona Clemente, Felice Pavone, Fabio Vegliante, Maria Carmela Esposito, Flavia Opinto, Giuseppina Scattone, Anna Loseto, Giacomo Minoia, Carla Rossini, Bernardo Quinto, Angela Maria Angiulli, Vito Grieco, Luigi Alfredo Fama, Angelo Ferrero, Simone Moia, Riccardo Di Rocco, Alice Quaglia, Francesca Maria Tabanelli, Valentina Guarini, Attilio Ciavarella, Sabino Electronic case report forms generation from pathology reports by ARGO, automatic record generator for onco-hematology |
title | Electronic case report forms generation from pathology reports by ARGO, automatic record generator for onco-hematology |
title_full | Electronic case report forms generation from pathology reports by ARGO, automatic record generator for onco-hematology |
title_fullStr | Electronic case report forms generation from pathology reports by ARGO, automatic record generator for onco-hematology |
title_full_unstemmed | Electronic case report forms generation from pathology reports by ARGO, automatic record generator for onco-hematology |
title_short | Electronic case report forms generation from pathology reports by ARGO, automatic record generator for onco-hematology |
title_sort | electronic case report forms generation from pathology reports by argo, automatic record generator for onco-hematology |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8664934/ https://www.ncbi.nlm.nih.gov/pubmed/34893665 http://dx.doi.org/10.1038/s41598-021-03204-z |
work_keys_str_mv | AT zaccariagianmaria electroniccasereportformsgenerationfrompathologyreportsbyargoautomaticrecordgeneratorforoncohematology AT colellavito electroniccasereportformsgenerationfrompathologyreportsbyargoautomaticrecordgeneratorforoncohematology AT coluccisimona electroniccasereportformsgenerationfrompathologyreportsbyargoautomaticrecordgeneratorforoncohematology AT clementefelice electroniccasereportformsgenerationfrompathologyreportsbyargoautomaticrecordgeneratorforoncohematology AT pavonefabio electroniccasereportformsgenerationfrompathologyreportsbyargoautomaticrecordgeneratorforoncohematology AT vegliantemariacarmela electroniccasereportformsgenerationfrompathologyreportsbyargoautomaticrecordgeneratorforoncohematology AT espositoflavia electroniccasereportformsgenerationfrompathologyreportsbyargoautomaticrecordgeneratorforoncohematology AT opintogiuseppina electroniccasereportformsgenerationfrompathologyreportsbyargoautomaticrecordgeneratorforoncohematology AT scattoneanna electroniccasereportformsgenerationfrompathologyreportsbyargoautomaticrecordgeneratorforoncohematology AT losetogiacomo electroniccasereportformsgenerationfrompathologyreportsbyargoautomaticrecordgeneratorforoncohematology AT minoiacarla electroniccasereportformsgenerationfrompathologyreportsbyargoautomaticrecordgeneratorforoncohematology AT rossinibernardo electroniccasereportformsgenerationfrompathologyreportsbyargoautomaticrecordgeneratorforoncohematology AT quintoangelamaria electroniccasereportformsgenerationfrompathologyreportsbyargoautomaticrecordgeneratorforoncohematology AT angiullivito electroniccasereportformsgenerationfrompathologyreportsbyargoautomaticrecordgeneratorforoncohematology AT griecoluigialfredo electroniccasereportformsgenerationfrompathologyreportsbyargoautomaticrecordgeneratorforoncohematology AT famaangelo electroniccasereportformsgenerationfrompathologyreportsbyargoautomaticrecordgeneratorforoncohematology AT ferrerosimone electroniccasereportformsgenerationfrompathologyreportsbyargoautomaticrecordgeneratorforoncohematology AT moiariccardo electroniccasereportformsgenerationfrompathologyreportsbyargoautomaticrecordgeneratorforoncohematology AT diroccoalice electroniccasereportformsgenerationfrompathologyreportsbyargoautomaticrecordgeneratorforoncohematology AT quagliafrancescamaria electroniccasereportformsgenerationfrompathologyreportsbyargoautomaticrecordgeneratorforoncohematology AT tabanellivalentina electroniccasereportformsgenerationfrompathologyreportsbyargoautomaticrecordgeneratorforoncohematology AT guariniattilio electroniccasereportformsgenerationfrompathologyreportsbyargoautomaticrecordgeneratorforoncohematology AT ciavarellasabino electroniccasereportformsgenerationfrompathologyreportsbyargoautomaticrecordgeneratorforoncohematology |