Cargando…

An expandable informatics framework for enhancing central cancer registries with digital pathology specimens, computational imaging tools, and advanced mining capabilities

BACKGROUND: Population-based state cancer registries are an authoritative source for cancer statistics in the United States. They routinely collect a variety of data, including patient demographics, primary tumor site, stage at diagnosis, first course of treatment, and survival, on every cancer case...

Descripción completa

Detalles Bibliográficos
Autores principales: Foran, David J., Durbin, Eric B., Chen, Wenjin, Sadimin, Evita, Sharma, Ashish, Banerjee, Imon, Kurc, Tahsin, Li, Nan, Stroup, Antoinette M., Harris, Gerald, Gu, Annie, Schymura, Maria, Gupta, Rajarsi, Bremer, Erich, Balsamo, Joseph, DiPrima, Tammy, Wang, Feiqiao, Abousamra, Shahira, Samaras, Dimitris, Hands, Isaac, Ward, Kevin, Saltz, Joel H.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8794027/
https://www.ncbi.nlm.nih.gov/pubmed/35136672
http://dx.doi.org/10.4103/jpi.jpi_31_21
_version_ 1784640736419905536
author Foran, David J.
Durbin, Eric B.
Chen, Wenjin
Sadimin, Evita
Sharma, Ashish
Banerjee, Imon
Kurc, Tahsin
Li, Nan
Stroup, Antoinette M.
Harris, Gerald
Gu, Annie
Schymura, Maria
Gupta, Rajarsi
Bremer, Erich
Balsamo, Joseph
DiPrima, Tammy
Wang, Feiqiao
Abousamra, Shahira
Samaras, Dimitris
Hands, Isaac
Ward, Kevin
Saltz, Joel H.
author_facet Foran, David J.
Durbin, Eric B.
Chen, Wenjin
Sadimin, Evita
Sharma, Ashish
Banerjee, Imon
Kurc, Tahsin
Li, Nan
Stroup, Antoinette M.
Harris, Gerald
Gu, Annie
Schymura, Maria
Gupta, Rajarsi
Bremer, Erich
Balsamo, Joseph
DiPrima, Tammy
Wang, Feiqiao
Abousamra, Shahira
Samaras, Dimitris
Hands, Isaac
Ward, Kevin
Saltz, Joel H.
author_sort Foran, David J.
collection PubMed
description BACKGROUND: Population-based state cancer registries are an authoritative source for cancer statistics in the United States. They routinely collect a variety of data, including patient demographics, primary tumor site, stage at diagnosis, first course of treatment, and survival, on every cancer case that is reported across all U.S. states and territories. The goal of our project is to enrich NCI’s Surveillance, Epidemiology, and End Results (SEER) registry data with high-quality population-based biospecimen data in the form of digital pathology, machine-learning-based classifications, and quantitative histopathology imaging feature sets (referred to here as Pathomics features). MATERIALS AND METHODS: As part of the project, the underlying informatics infrastructure was designed, tested, and implemented through close collaboration with several participating SEER registries to ensure consistency with registry processes, computational scalability, and ability to support creation of population cohorts that span multiple sites. Utilizing computational imaging algorithms and methods to both generate indices and search for matches makes it possible to reduce inter- and intra-observer inconsistencies and to improve the objectivity with which large image repositories are interrogated. RESULTS: Our team has created and continues to expand a well-curated repository of high-quality digitized pathology images corresponding to subjects whose data are routinely collected by the collaborating registries. Our team has systematically deployed and tested key, visual analytic methods to facilitate automated creation of population cohorts for epidemiological studies and tools to support visualization of feature clusters and evaluation of whole-slide images. As part of these efforts, we are developing and optimizing advanced search and matching algorithms to facilitate automated, content-based retrieval of digitized specimens based on their underlying image features and staining characteristics. CONCLUSION: To meet the challenges of this project, we established the analytic pipelines, methods, and workflows to support the expansion and management of a growing repository of high-quality digitized pathology and information-rich, population cohorts containing objective imaging and clinical attributes to facilitate studies that seek to discriminate among different subtypes of disease, stratify patient populations, and perform comparisons of tumor characteristics within and across patient cohorts. We have also successfully developed a suite of tools based on a deep-learning method to perform quantitative characterizations of tumor regions, assess infiltrating lymphocyte distributions, and generate objective nuclear feature measurements. As part of these efforts, our team has implemented reliable methods that enable investigators to systematically search through large repositories to automatically retrieve digitized pathology specimens and correlated clinical data based on their computational signatures.
format Online
Article
Text
id pubmed-8794027
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Elsevier
record_format MEDLINE/PubMed
spelling pubmed-87940272022-02-07 An expandable informatics framework for enhancing central cancer registries with digital pathology specimens, computational imaging tools, and advanced mining capabilities Foran, David J. Durbin, Eric B. Chen, Wenjin Sadimin, Evita Sharma, Ashish Banerjee, Imon Kurc, Tahsin Li, Nan Stroup, Antoinette M. Harris, Gerald Gu, Annie Schymura, Maria Gupta, Rajarsi Bremer, Erich Balsamo, Joseph DiPrima, Tammy Wang, Feiqiao Abousamra, Shahira Samaras, Dimitris Hands, Isaac Ward, Kevin Saltz, Joel H. J Pathol Inform Original Research Article BACKGROUND: Population-based state cancer registries are an authoritative source for cancer statistics in the United States. They routinely collect a variety of data, including patient demographics, primary tumor site, stage at diagnosis, first course of treatment, and survival, on every cancer case that is reported across all U.S. states and territories. The goal of our project is to enrich NCI’s Surveillance, Epidemiology, and End Results (SEER) registry data with high-quality population-based biospecimen data in the form of digital pathology, machine-learning-based classifications, and quantitative histopathology imaging feature sets (referred to here as Pathomics features). MATERIALS AND METHODS: As part of the project, the underlying informatics infrastructure was designed, tested, and implemented through close collaboration with several participating SEER registries to ensure consistency with registry processes, computational scalability, and ability to support creation of population cohorts that span multiple sites. Utilizing computational imaging algorithms and methods to both generate indices and search for matches makes it possible to reduce inter- and intra-observer inconsistencies and to improve the objectivity with which large image repositories are interrogated. RESULTS: Our team has created and continues to expand a well-curated repository of high-quality digitized pathology images corresponding to subjects whose data are routinely collected by the collaborating registries. Our team has systematically deployed and tested key, visual analytic methods to facilitate automated creation of population cohorts for epidemiological studies and tools to support visualization of feature clusters and evaluation of whole-slide images. As part of these efforts, we are developing and optimizing advanced search and matching algorithms to facilitate automated, content-based retrieval of digitized specimens based on their underlying image features and staining characteristics. CONCLUSION: To meet the challenges of this project, we established the analytic pipelines, methods, and workflows to support the expansion and management of a growing repository of high-quality digitized pathology and information-rich, population cohorts containing objective imaging and clinical attributes to facilitate studies that seek to discriminate among different subtypes of disease, stratify patient populations, and perform comparisons of tumor characteristics within and across patient cohorts. We have also successfully developed a suite of tools based on a deep-learning method to perform quantitative characterizations of tumor regions, assess infiltrating lymphocyte distributions, and generate objective nuclear feature measurements. As part of these efforts, our team has implemented reliable methods that enable investigators to systematically search through large repositories to automatically retrieve digitized pathology specimens and correlated clinical data based on their computational signatures. Elsevier 2022-12-20 /pmc/articles/PMC8794027/ /pubmed/35136672 http://dx.doi.org/10.4103/jpi.jpi_31_21 Text en © 2022 Published by Elsevier Inc. on behalf of Association for Pathology Informatics. https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
spellingShingle Original Research Article
Foran, David J.
Durbin, Eric B.
Chen, Wenjin
Sadimin, Evita
Sharma, Ashish
Banerjee, Imon
Kurc, Tahsin
Li, Nan
Stroup, Antoinette M.
Harris, Gerald
Gu, Annie
Schymura, Maria
Gupta, Rajarsi
Bremer, Erich
Balsamo, Joseph
DiPrima, Tammy
Wang, Feiqiao
Abousamra, Shahira
Samaras, Dimitris
Hands, Isaac
Ward, Kevin
Saltz, Joel H.
An expandable informatics framework for enhancing central cancer registries with digital pathology specimens, computational imaging tools, and advanced mining capabilities
title An expandable informatics framework for enhancing central cancer registries with digital pathology specimens, computational imaging tools, and advanced mining capabilities
title_full An expandable informatics framework for enhancing central cancer registries with digital pathology specimens, computational imaging tools, and advanced mining capabilities
title_fullStr An expandable informatics framework for enhancing central cancer registries with digital pathology specimens, computational imaging tools, and advanced mining capabilities
title_full_unstemmed An expandable informatics framework for enhancing central cancer registries with digital pathology specimens, computational imaging tools, and advanced mining capabilities
title_short An expandable informatics framework for enhancing central cancer registries with digital pathology specimens, computational imaging tools, and advanced mining capabilities
title_sort expandable informatics framework for enhancing central cancer registries with digital pathology specimens, computational imaging tools, and advanced mining capabilities
topic Original Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8794027/
https://www.ncbi.nlm.nih.gov/pubmed/35136672
http://dx.doi.org/10.4103/jpi.jpi_31_21
work_keys_str_mv AT forandavidj anexpandableinformaticsframeworkforenhancingcentralcancerregistrieswithdigitalpathologyspecimenscomputationalimagingtoolsandadvancedminingcapabilities
AT durbinericb anexpandableinformaticsframeworkforenhancingcentralcancerregistrieswithdigitalpathologyspecimenscomputationalimagingtoolsandadvancedminingcapabilities
AT chenwenjin anexpandableinformaticsframeworkforenhancingcentralcancerregistrieswithdigitalpathologyspecimenscomputationalimagingtoolsandadvancedminingcapabilities
AT sadiminevita anexpandableinformaticsframeworkforenhancingcentralcancerregistrieswithdigitalpathologyspecimenscomputationalimagingtoolsandadvancedminingcapabilities
AT sharmaashish anexpandableinformaticsframeworkforenhancingcentralcancerregistrieswithdigitalpathologyspecimenscomputationalimagingtoolsandadvancedminingcapabilities
AT banerjeeimon anexpandableinformaticsframeworkforenhancingcentralcancerregistrieswithdigitalpathologyspecimenscomputationalimagingtoolsandadvancedminingcapabilities
AT kurctahsin anexpandableinformaticsframeworkforenhancingcentralcancerregistrieswithdigitalpathologyspecimenscomputationalimagingtoolsandadvancedminingcapabilities
AT linan anexpandableinformaticsframeworkforenhancingcentralcancerregistrieswithdigitalpathologyspecimenscomputationalimagingtoolsandadvancedminingcapabilities
AT stroupantoinettem anexpandableinformaticsframeworkforenhancingcentralcancerregistrieswithdigitalpathologyspecimenscomputationalimagingtoolsandadvancedminingcapabilities
AT harrisgerald anexpandableinformaticsframeworkforenhancingcentralcancerregistrieswithdigitalpathologyspecimenscomputationalimagingtoolsandadvancedminingcapabilities
AT guannie anexpandableinformaticsframeworkforenhancingcentralcancerregistrieswithdigitalpathologyspecimenscomputationalimagingtoolsandadvancedminingcapabilities
AT schymuramaria anexpandableinformaticsframeworkforenhancingcentralcancerregistrieswithdigitalpathologyspecimenscomputationalimagingtoolsandadvancedminingcapabilities
AT guptarajarsi anexpandableinformaticsframeworkforenhancingcentralcancerregistrieswithdigitalpathologyspecimenscomputationalimagingtoolsandadvancedminingcapabilities
AT bremererich anexpandableinformaticsframeworkforenhancingcentralcancerregistrieswithdigitalpathologyspecimenscomputationalimagingtoolsandadvancedminingcapabilities
AT balsamojoseph anexpandableinformaticsframeworkforenhancingcentralcancerregistrieswithdigitalpathologyspecimenscomputationalimagingtoolsandadvancedminingcapabilities
AT diprimatammy anexpandableinformaticsframeworkforenhancingcentralcancerregistrieswithdigitalpathologyspecimenscomputationalimagingtoolsandadvancedminingcapabilities
AT wangfeiqiao anexpandableinformaticsframeworkforenhancingcentralcancerregistrieswithdigitalpathologyspecimenscomputationalimagingtoolsandadvancedminingcapabilities
AT abousamrashahira anexpandableinformaticsframeworkforenhancingcentralcancerregistrieswithdigitalpathologyspecimenscomputationalimagingtoolsandadvancedminingcapabilities
AT samarasdimitris anexpandableinformaticsframeworkforenhancingcentralcancerregistrieswithdigitalpathologyspecimenscomputationalimagingtoolsandadvancedminingcapabilities
AT handsisaac anexpandableinformaticsframeworkforenhancingcentralcancerregistrieswithdigitalpathologyspecimenscomputationalimagingtoolsandadvancedminingcapabilities
AT wardkevin anexpandableinformaticsframeworkforenhancingcentralcancerregistrieswithdigitalpathologyspecimenscomputationalimagingtoolsandadvancedminingcapabilities
AT saltzjoelh anexpandableinformaticsframeworkforenhancingcentralcancerregistrieswithdigitalpathologyspecimenscomputationalimagingtoolsandadvancedminingcapabilities
AT forandavidj expandableinformaticsframeworkforenhancingcentralcancerregistrieswithdigitalpathologyspecimenscomputationalimagingtoolsandadvancedminingcapabilities
AT durbinericb expandableinformaticsframeworkforenhancingcentralcancerregistrieswithdigitalpathologyspecimenscomputationalimagingtoolsandadvancedminingcapabilities
AT chenwenjin expandableinformaticsframeworkforenhancingcentralcancerregistrieswithdigitalpathologyspecimenscomputationalimagingtoolsandadvancedminingcapabilities
AT sadiminevita expandableinformaticsframeworkforenhancingcentralcancerregistrieswithdigitalpathologyspecimenscomputationalimagingtoolsandadvancedminingcapabilities
AT sharmaashish expandableinformaticsframeworkforenhancingcentralcancerregistrieswithdigitalpathologyspecimenscomputationalimagingtoolsandadvancedminingcapabilities
AT banerjeeimon expandableinformaticsframeworkforenhancingcentralcancerregistrieswithdigitalpathologyspecimenscomputationalimagingtoolsandadvancedminingcapabilities
AT kurctahsin expandableinformaticsframeworkforenhancingcentralcancerregistrieswithdigitalpathologyspecimenscomputationalimagingtoolsandadvancedminingcapabilities
AT linan expandableinformaticsframeworkforenhancingcentralcancerregistrieswithdigitalpathologyspecimenscomputationalimagingtoolsandadvancedminingcapabilities
AT stroupantoinettem expandableinformaticsframeworkforenhancingcentralcancerregistrieswithdigitalpathologyspecimenscomputationalimagingtoolsandadvancedminingcapabilities
AT harrisgerald expandableinformaticsframeworkforenhancingcentralcancerregistrieswithdigitalpathologyspecimenscomputationalimagingtoolsandadvancedminingcapabilities
AT guannie expandableinformaticsframeworkforenhancingcentralcancerregistrieswithdigitalpathologyspecimenscomputationalimagingtoolsandadvancedminingcapabilities
AT schymuramaria expandableinformaticsframeworkforenhancingcentralcancerregistrieswithdigitalpathologyspecimenscomputationalimagingtoolsandadvancedminingcapabilities
AT guptarajarsi expandableinformaticsframeworkforenhancingcentralcancerregistrieswithdigitalpathologyspecimenscomputationalimagingtoolsandadvancedminingcapabilities
AT bremererich expandableinformaticsframeworkforenhancingcentralcancerregistrieswithdigitalpathologyspecimenscomputationalimagingtoolsandadvancedminingcapabilities
AT balsamojoseph expandableinformaticsframeworkforenhancingcentralcancerregistrieswithdigitalpathologyspecimenscomputationalimagingtoolsandadvancedminingcapabilities
AT diprimatammy expandableinformaticsframeworkforenhancingcentralcancerregistrieswithdigitalpathologyspecimenscomputationalimagingtoolsandadvancedminingcapabilities
AT wangfeiqiao expandableinformaticsframeworkforenhancingcentralcancerregistrieswithdigitalpathologyspecimenscomputationalimagingtoolsandadvancedminingcapabilities
AT abousamrashahira expandableinformaticsframeworkforenhancingcentralcancerregistrieswithdigitalpathologyspecimenscomputationalimagingtoolsandadvancedminingcapabilities
AT samarasdimitris expandableinformaticsframeworkforenhancingcentralcancerregistrieswithdigitalpathologyspecimenscomputationalimagingtoolsandadvancedminingcapabilities
AT handsisaac expandableinformaticsframeworkforenhancingcentralcancerregistrieswithdigitalpathologyspecimenscomputationalimagingtoolsandadvancedminingcapabilities
AT wardkevin expandableinformaticsframeworkforenhancingcentralcancerregistrieswithdigitalpathologyspecimenscomputationalimagingtoolsandadvancedminingcapabilities
AT saltzjoelh expandableinformaticsframeworkforenhancingcentralcancerregistrieswithdigitalpathologyspecimenscomputationalimagingtoolsandadvancedminingcapabilities