Cargando…

Development of Training Materials for Pathologists to Provide Machine Learning Validation Data of Tumor-Infiltrating Lymphocytes in Breast Cancer

SIMPLE SUMMARY: The High Throughput Truthing project aims to develop a dataset of stromal tumor-infiltrating lymphocytes (sTILs) density evaluations in hematoxylin and eosin-stained invasive breast cancer specimens fit for a regulatory purpose. After completion of the pilot study, the analysis demon...

Descripción completa

Detalles Bibliográficos
Autores principales: Garcia, Victor, Elfer, Katherine, Peeters, Dieter J. E., Ehinger, Anna, Werness, Bruce, Ly, Amy, Li, Xiaoxian, Hanna, Matthew G., Blenman, Kim R. M., Salgado, Roberto, Gallas, Brandon D.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9139395/
https://www.ncbi.nlm.nih.gov/pubmed/35626070
http://dx.doi.org/10.3390/cancers14102467
_version_ 1784714849867005952
author Garcia, Victor
Elfer, Katherine
Peeters, Dieter J. E.
Ehinger, Anna
Werness, Bruce
Ly, Amy
Li, Xiaoxian
Hanna, Matthew G.
Blenman, Kim R. M.
Salgado, Roberto
Gallas, Brandon D.
author_facet Garcia, Victor
Elfer, Katherine
Peeters, Dieter J. E.
Ehinger, Anna
Werness, Bruce
Ly, Amy
Li, Xiaoxian
Hanna, Matthew G.
Blenman, Kim R. M.
Salgado, Roberto
Gallas, Brandon D.
author_sort Garcia, Victor
collection PubMed
description SIMPLE SUMMARY: The High Throughput Truthing project aims to develop a dataset of stromal tumor-infiltrating lymphocytes (sTILs) density evaluations in hematoxylin and eosin-stained invasive breast cancer specimens fit for a regulatory purpose. After completion of the pilot study, the analysis demonstrated inconsistencies and gaps in the provided training to pathologists. Select regions of interest (ROIs) were reviewed by an expert panel, who provided annotations and commentary on the challenges of the sTILs assessment. We used these annotations to develop a training document and reference standard for new training materials. These materials will train crowd-sourced pathologists to help create an algorithm validation dataset and contribute to sTILs evaluations in clinical practice. ABSTRACT: The High Throughput Truthing project aims to develop a dataset for validating artificial intelligence and machine learning models (AI/ML) fit for regulatory purposes. The context of this AI/ML validation dataset is the reporting of stromal tumor-infiltrating lymphocytes (sTILs) density evaluations in hematoxylin and eosin-stained invasive breast cancer biopsy specimens. After completing the pilot study, we found notable variability in the sTILs estimates as well as inconsistencies and gaps in the provided training to pathologists. Using the pilot study data and an expert panel, we created custom training materials to improve pathologist annotation quality for the pivotal study. We categorized regions of interest (ROIs) based on their mean sTILs density and selected ROIs with the highest and lowest sTILs variability. In a series of eight one-hour sessions, the expert panel reviewed each ROI and provided verbal density estimates and comments on features that confounded the sTILs evaluation. We aggregated and shaped the comments to identify pitfalls and instructions to improve our training materials. From these selected ROIs, we created a training set and proficiency test set to improve pathologist training with the goal to improve data collection for the pivotal study. We are not exploring AI/ML performance in this paper. Instead, we are creating materials that will train crowd-sourced pathologists to be the reference standard in a pivotal study to create an AI/ML model validation dataset. The issues discussed here are also important for clinicians to understand about the evaluation of sTILs in clinical practice and can provide insight to developers of AI/ML models.
format Online
Article
Text
id pubmed-9139395
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-91393952022-05-28 Development of Training Materials for Pathologists to Provide Machine Learning Validation Data of Tumor-Infiltrating Lymphocytes in Breast Cancer Garcia, Victor Elfer, Katherine Peeters, Dieter J. E. Ehinger, Anna Werness, Bruce Ly, Amy Li, Xiaoxian Hanna, Matthew G. Blenman, Kim R. M. Salgado, Roberto Gallas, Brandon D. Cancers (Basel) Article SIMPLE SUMMARY: The High Throughput Truthing project aims to develop a dataset of stromal tumor-infiltrating lymphocytes (sTILs) density evaluations in hematoxylin and eosin-stained invasive breast cancer specimens fit for a regulatory purpose. After completion of the pilot study, the analysis demonstrated inconsistencies and gaps in the provided training to pathologists. Select regions of interest (ROIs) were reviewed by an expert panel, who provided annotations and commentary on the challenges of the sTILs assessment. We used these annotations to develop a training document and reference standard for new training materials. These materials will train crowd-sourced pathologists to help create an algorithm validation dataset and contribute to sTILs evaluations in clinical practice. ABSTRACT: The High Throughput Truthing project aims to develop a dataset for validating artificial intelligence and machine learning models (AI/ML) fit for regulatory purposes. The context of this AI/ML validation dataset is the reporting of stromal tumor-infiltrating lymphocytes (sTILs) density evaluations in hematoxylin and eosin-stained invasive breast cancer biopsy specimens. After completing the pilot study, we found notable variability in the sTILs estimates as well as inconsistencies and gaps in the provided training to pathologists. Using the pilot study data and an expert panel, we created custom training materials to improve pathologist annotation quality for the pivotal study. We categorized regions of interest (ROIs) based on their mean sTILs density and selected ROIs with the highest and lowest sTILs variability. In a series of eight one-hour sessions, the expert panel reviewed each ROI and provided verbal density estimates and comments on features that confounded the sTILs evaluation. We aggregated and shaped the comments to identify pitfalls and instructions to improve our training materials. From these selected ROIs, we created a training set and proficiency test set to improve pathologist training with the goal to improve data collection for the pivotal study. We are not exploring AI/ML performance in this paper. Instead, we are creating materials that will train crowd-sourced pathologists to be the reference standard in a pivotal study to create an AI/ML model validation dataset. The issues discussed here are also important for clinicians to understand about the evaluation of sTILs in clinical practice and can provide insight to developers of AI/ML models. MDPI 2022-05-17 /pmc/articles/PMC9139395/ /pubmed/35626070 http://dx.doi.org/10.3390/cancers14102467 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Garcia, Victor
Elfer, Katherine
Peeters, Dieter J. E.
Ehinger, Anna
Werness, Bruce
Ly, Amy
Li, Xiaoxian
Hanna, Matthew G.
Blenman, Kim R. M.
Salgado, Roberto
Gallas, Brandon D.
Development of Training Materials for Pathologists to Provide Machine Learning Validation Data of Tumor-Infiltrating Lymphocytes in Breast Cancer
title Development of Training Materials for Pathologists to Provide Machine Learning Validation Data of Tumor-Infiltrating Lymphocytes in Breast Cancer
title_full Development of Training Materials for Pathologists to Provide Machine Learning Validation Data of Tumor-Infiltrating Lymphocytes in Breast Cancer
title_fullStr Development of Training Materials for Pathologists to Provide Machine Learning Validation Data of Tumor-Infiltrating Lymphocytes in Breast Cancer
title_full_unstemmed Development of Training Materials for Pathologists to Provide Machine Learning Validation Data of Tumor-Infiltrating Lymphocytes in Breast Cancer
title_short Development of Training Materials for Pathologists to Provide Machine Learning Validation Data of Tumor-Infiltrating Lymphocytes in Breast Cancer
title_sort development of training materials for pathologists to provide machine learning validation data of tumor-infiltrating lymphocytes in breast cancer
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9139395/
https://www.ncbi.nlm.nih.gov/pubmed/35626070
http://dx.doi.org/10.3390/cancers14102467
work_keys_str_mv AT garciavictor developmentoftrainingmaterialsforpathologiststoprovidemachinelearningvalidationdataoftumorinfiltratinglymphocytesinbreastcancer
AT elferkatherine developmentoftrainingmaterialsforpathologiststoprovidemachinelearningvalidationdataoftumorinfiltratinglymphocytesinbreastcancer
AT peetersdieterje developmentoftrainingmaterialsforpathologiststoprovidemachinelearningvalidationdataoftumorinfiltratinglymphocytesinbreastcancer
AT ehingeranna developmentoftrainingmaterialsforpathologiststoprovidemachinelearningvalidationdataoftumorinfiltratinglymphocytesinbreastcancer
AT wernessbruce developmentoftrainingmaterialsforpathologiststoprovidemachinelearningvalidationdataoftumorinfiltratinglymphocytesinbreastcancer
AT lyamy developmentoftrainingmaterialsforpathologiststoprovidemachinelearningvalidationdataoftumorinfiltratinglymphocytesinbreastcancer
AT lixiaoxian developmentoftrainingmaterialsforpathologiststoprovidemachinelearningvalidationdataoftumorinfiltratinglymphocytesinbreastcancer
AT hannamatthewg developmentoftrainingmaterialsforpathologiststoprovidemachinelearningvalidationdataoftumorinfiltratinglymphocytesinbreastcancer
AT blenmankimrm developmentoftrainingmaterialsforpathologiststoprovidemachinelearningvalidationdataoftumorinfiltratinglymphocytesinbreastcancer
AT salgadoroberto developmentoftrainingmaterialsforpathologiststoprovidemachinelearningvalidationdataoftumorinfiltratinglymphocytesinbreastcancer
AT gallasbrandond developmentoftrainingmaterialsforpathologiststoprovidemachinelearningvalidationdataoftumorinfiltratinglymphocytesinbreastcancer