Cargando…

Integration of Cancer Registry Data into the Text Information Extraction System: Leveraging the Structured Data Import Tool

INTRODUCTION/BACKGROUND: Cancer registries in the US collect timely and systematic data on new cancer cases, extent of disease, staging, biomarker status, treatment, survival, and mortality of cancer cases. Existing methodologies for accessing local cancer registry data for research are time-consumi...

Descripción completa

Detalles Bibliográficos
Autores principales: Linkov, Faina, Silverstein, Jonathan C., Davis, Michael, Crocker, Brenda, Hao, Degan, Schneider, Althea, Schwenk, Melissa, Winters, Sharon, Zelnis, Joyce, Lee, Adrian V., Becich, Michael J.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Medknow Publications & Media Pvt Ltd 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6319041/
https://www.ncbi.nlm.nih.gov/pubmed/30662793
http://dx.doi.org/10.4103/jpi.jpi_38_18
_version_ 1783384999683162112
author Linkov, Faina
Silverstein, Jonathan C.
Davis, Michael
Crocker, Brenda
Hao, Degan
Schneider, Althea
Schwenk, Melissa
Winters, Sharon
Zelnis, Joyce
Lee, Adrian V.
Becich, Michael J.
author_facet Linkov, Faina
Silverstein, Jonathan C.
Davis, Michael
Crocker, Brenda
Hao, Degan
Schneider, Althea
Schwenk, Melissa
Winters, Sharon
Zelnis, Joyce
Lee, Adrian V.
Becich, Michael J.
author_sort Linkov, Faina
collection PubMed
description INTRODUCTION/BACKGROUND: Cancer registries in the US collect timely and systematic data on new cancer cases, extent of disease, staging, biomarker status, treatment, survival, and mortality of cancer cases. Existing methodologies for accessing local cancer registry data for research are time-consuming and often rely on the manual merging of data by staff registrars. In addition, existing registries do not provide direct access to these data nor do they routinely provide linkage to discrete electronic health record (EHR) data, reports, or imaging data. Automation of such linkage can provide an impressive data resource and make valuable data available for translational cancer research. METHODS: The UPMC Network Cancer Registry collects highly structured, longitudinal data on all reportable cancer patients, from the point of the diagnosis throughout treatment and follow-up/outcomes. Using commercial registry software, we collect data in compliance with standards governed by the North American Association of Central Cancer Registries. This standardization ensures that the data are highly structured with standard coding and collection methods, which support data exchange among central cancer registries and the Centers for Disease Control and Prevention. RESULTS: At the UPMC Hillman Cancer Center and University of Pittsburgh, we explored the feasibility of linking this well-curated, structured cancer registry data with unstructured text (i.e., pathology and radiology reports), using the Text Information Extraction System (TIES). We used the TIES platform to integrate breast cancer cases from the UPMC Network Cancer Registry system and then combine these data with other EHR data as a pilot use case that can be replicated for other cancers. CONCLUSIONS: As a result of this integration, we now have a single searchable repository of information for breast cancer patients from the UPMC registry, combined with their pathology and radiology reports. The system that we developed is easily scalable to other health systems and cancer centers.
format Online
Article
Text
id pubmed-6319041
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Medknow Publications & Media Pvt Ltd
record_format MEDLINE/PubMed
spelling pubmed-63190412019-01-18 Integration of Cancer Registry Data into the Text Information Extraction System: Leveraging the Structured Data Import Tool Linkov, Faina Silverstein, Jonathan C. Davis, Michael Crocker, Brenda Hao, Degan Schneider, Althea Schwenk, Melissa Winters, Sharon Zelnis, Joyce Lee, Adrian V. Becich, Michael J. J Pathol Inform Technical Note INTRODUCTION/BACKGROUND: Cancer registries in the US collect timely and systematic data on new cancer cases, extent of disease, staging, biomarker status, treatment, survival, and mortality of cancer cases. Existing methodologies for accessing local cancer registry data for research are time-consuming and often rely on the manual merging of data by staff registrars. In addition, existing registries do not provide direct access to these data nor do they routinely provide linkage to discrete electronic health record (EHR) data, reports, or imaging data. Automation of such linkage can provide an impressive data resource and make valuable data available for translational cancer research. METHODS: The UPMC Network Cancer Registry collects highly structured, longitudinal data on all reportable cancer patients, from the point of the diagnosis throughout treatment and follow-up/outcomes. Using commercial registry software, we collect data in compliance with standards governed by the North American Association of Central Cancer Registries. This standardization ensures that the data are highly structured with standard coding and collection methods, which support data exchange among central cancer registries and the Centers for Disease Control and Prevention. RESULTS: At the UPMC Hillman Cancer Center and University of Pittsburgh, we explored the feasibility of linking this well-curated, structured cancer registry data with unstructured text (i.e., pathology and radiology reports), using the Text Information Extraction System (TIES). We used the TIES platform to integrate breast cancer cases from the UPMC Network Cancer Registry system and then combine these data with other EHR data as a pilot use case that can be replicated for other cancers. CONCLUSIONS: As a result of this integration, we now have a single searchable repository of information for breast cancer patients from the UPMC registry, combined with their pathology and radiology reports. The system that we developed is easily scalable to other health systems and cancer centers. Medknow Publications & Media Pvt Ltd 2018-12-24 /pmc/articles/PMC6319041/ /pubmed/30662793 http://dx.doi.org/10.4103/jpi.jpi_38_18 Text en Copyright: © 2018 Journal of Pathology Informatics http://creativecommons.org/licenses/by-nc-sa/4.0 This is an open access journal, and articles are distributed under the terms of the Creative Commons Attribution-NonCommercial-ShareAlike 4.0 License, which allows others to remix, tweak, and build upon the work non-commercially, as long as appropriate credit is given and the new creations are licensed under the identical terms.
spellingShingle Technical Note
Linkov, Faina
Silverstein, Jonathan C.
Davis, Michael
Crocker, Brenda
Hao, Degan
Schneider, Althea
Schwenk, Melissa
Winters, Sharon
Zelnis, Joyce
Lee, Adrian V.
Becich, Michael J.
Integration of Cancer Registry Data into the Text Information Extraction System: Leveraging the Structured Data Import Tool
title Integration of Cancer Registry Data into the Text Information Extraction System: Leveraging the Structured Data Import Tool
title_full Integration of Cancer Registry Data into the Text Information Extraction System: Leveraging the Structured Data Import Tool
title_fullStr Integration of Cancer Registry Data into the Text Information Extraction System: Leveraging the Structured Data Import Tool
title_full_unstemmed Integration of Cancer Registry Data into the Text Information Extraction System: Leveraging the Structured Data Import Tool
title_short Integration of Cancer Registry Data into the Text Information Extraction System: Leveraging the Structured Data Import Tool
title_sort integration of cancer registry data into the text information extraction system: leveraging the structured data import tool
topic Technical Note
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6319041/
https://www.ncbi.nlm.nih.gov/pubmed/30662793
http://dx.doi.org/10.4103/jpi.jpi_38_18
work_keys_str_mv AT linkovfaina integrationofcancerregistrydataintothetextinformationextractionsystemleveragingthestructureddataimporttool
AT silversteinjonathanc integrationofcancerregistrydataintothetextinformationextractionsystemleveragingthestructureddataimporttool
AT davismichael integrationofcancerregistrydataintothetextinformationextractionsystemleveragingthestructureddataimporttool
AT crockerbrenda integrationofcancerregistrydataintothetextinformationextractionsystemleveragingthestructureddataimporttool
AT haodegan integrationofcancerregistrydataintothetextinformationextractionsystemleveragingthestructureddataimporttool
AT schneideralthea integrationofcancerregistrydataintothetextinformationextractionsystemleveragingthestructureddataimporttool
AT schwenkmelissa integrationofcancerregistrydataintothetextinformationextractionsystemleveragingthestructureddataimporttool
AT winterssharon integrationofcancerregistrydataintothetextinformationextractionsystemleveragingthestructureddataimporttool
AT zelnisjoyce integrationofcancerregistrydataintothetextinformationextractionsystemleveragingthestructureddataimporttool
AT leeadrianv integrationofcancerregistrydataintothetextinformationextractionsystemleveragingthestructureddataimporttool
AT becichmichaelj integrationofcancerregistrydataintothetextinformationextractionsystemleveragingthestructureddataimporttool