Cargando…
Development of Training Materials for Pathologists to Provide Machine Learning Validation Data of Tumor-Infiltrating Lymphocytes in Breast Cancer
SIMPLE SUMMARY: The High Throughput Truthing project aims to develop a dataset of stromal tumor-infiltrating lymphocytes (sTILs) density evaluations in hematoxylin and eosin-stained invasive breast cancer specimens fit for a regulatory purpose. After completion of the pilot study, the analysis demon...
Autores principales: | , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9139395/ https://www.ncbi.nlm.nih.gov/pubmed/35626070 http://dx.doi.org/10.3390/cancers14102467 |
_version_ | 1784714849867005952 |
---|---|
author | Garcia, Victor Elfer, Katherine Peeters, Dieter J. E. Ehinger, Anna Werness, Bruce Ly, Amy Li, Xiaoxian Hanna, Matthew G. Blenman, Kim R. M. Salgado, Roberto Gallas, Brandon D. |
author_facet | Garcia, Victor Elfer, Katherine Peeters, Dieter J. E. Ehinger, Anna Werness, Bruce Ly, Amy Li, Xiaoxian Hanna, Matthew G. Blenman, Kim R. M. Salgado, Roberto Gallas, Brandon D. |
author_sort | Garcia, Victor |
collection | PubMed |
description | SIMPLE SUMMARY: The High Throughput Truthing project aims to develop a dataset of stromal tumor-infiltrating lymphocytes (sTILs) density evaluations in hematoxylin and eosin-stained invasive breast cancer specimens fit for a regulatory purpose. After completion of the pilot study, the analysis demonstrated inconsistencies and gaps in the provided training to pathologists. Select regions of interest (ROIs) were reviewed by an expert panel, who provided annotations and commentary on the challenges of the sTILs assessment. We used these annotations to develop a training document and reference standard for new training materials. These materials will train crowd-sourced pathologists to help create an algorithm validation dataset and contribute to sTILs evaluations in clinical practice. ABSTRACT: The High Throughput Truthing project aims to develop a dataset for validating artificial intelligence and machine learning models (AI/ML) fit for regulatory purposes. The context of this AI/ML validation dataset is the reporting of stromal tumor-infiltrating lymphocytes (sTILs) density evaluations in hematoxylin and eosin-stained invasive breast cancer biopsy specimens. After completing the pilot study, we found notable variability in the sTILs estimates as well as inconsistencies and gaps in the provided training to pathologists. Using the pilot study data and an expert panel, we created custom training materials to improve pathologist annotation quality for the pivotal study. We categorized regions of interest (ROIs) based on their mean sTILs density and selected ROIs with the highest and lowest sTILs variability. In a series of eight one-hour sessions, the expert panel reviewed each ROI and provided verbal density estimates and comments on features that confounded the sTILs evaluation. We aggregated and shaped the comments to identify pitfalls and instructions to improve our training materials. From these selected ROIs, we created a training set and proficiency test set to improve pathologist training with the goal to improve data collection for the pivotal study. We are not exploring AI/ML performance in this paper. Instead, we are creating materials that will train crowd-sourced pathologists to be the reference standard in a pivotal study to create an AI/ML model validation dataset. The issues discussed here are also important for clinicians to understand about the evaluation of sTILs in clinical practice and can provide insight to developers of AI/ML models. |
format | Online Article Text |
id | pubmed-9139395 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-91393952022-05-28 Development of Training Materials for Pathologists to Provide Machine Learning Validation Data of Tumor-Infiltrating Lymphocytes in Breast Cancer Garcia, Victor Elfer, Katherine Peeters, Dieter J. E. Ehinger, Anna Werness, Bruce Ly, Amy Li, Xiaoxian Hanna, Matthew G. Blenman, Kim R. M. Salgado, Roberto Gallas, Brandon D. Cancers (Basel) Article SIMPLE SUMMARY: The High Throughput Truthing project aims to develop a dataset of stromal tumor-infiltrating lymphocytes (sTILs) density evaluations in hematoxylin and eosin-stained invasive breast cancer specimens fit for a regulatory purpose. After completion of the pilot study, the analysis demonstrated inconsistencies and gaps in the provided training to pathologists. Select regions of interest (ROIs) were reviewed by an expert panel, who provided annotations and commentary on the challenges of the sTILs assessment. We used these annotations to develop a training document and reference standard for new training materials. These materials will train crowd-sourced pathologists to help create an algorithm validation dataset and contribute to sTILs evaluations in clinical practice. ABSTRACT: The High Throughput Truthing project aims to develop a dataset for validating artificial intelligence and machine learning models (AI/ML) fit for regulatory purposes. The context of this AI/ML validation dataset is the reporting of stromal tumor-infiltrating lymphocytes (sTILs) density evaluations in hematoxylin and eosin-stained invasive breast cancer biopsy specimens. After completing the pilot study, we found notable variability in the sTILs estimates as well as inconsistencies and gaps in the provided training to pathologists. Using the pilot study data and an expert panel, we created custom training materials to improve pathologist annotation quality for the pivotal study. We categorized regions of interest (ROIs) based on their mean sTILs density and selected ROIs with the highest and lowest sTILs variability. In a series of eight one-hour sessions, the expert panel reviewed each ROI and provided verbal density estimates and comments on features that confounded the sTILs evaluation. We aggregated and shaped the comments to identify pitfalls and instructions to improve our training materials. From these selected ROIs, we created a training set and proficiency test set to improve pathologist training with the goal to improve data collection for the pivotal study. We are not exploring AI/ML performance in this paper. Instead, we are creating materials that will train crowd-sourced pathologists to be the reference standard in a pivotal study to create an AI/ML model validation dataset. The issues discussed here are also important for clinicians to understand about the evaluation of sTILs in clinical practice and can provide insight to developers of AI/ML models. MDPI 2022-05-17 /pmc/articles/PMC9139395/ /pubmed/35626070 http://dx.doi.org/10.3390/cancers14102467 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Garcia, Victor Elfer, Katherine Peeters, Dieter J. E. Ehinger, Anna Werness, Bruce Ly, Amy Li, Xiaoxian Hanna, Matthew G. Blenman, Kim R. M. Salgado, Roberto Gallas, Brandon D. Development of Training Materials for Pathologists to Provide Machine Learning Validation Data of Tumor-Infiltrating Lymphocytes in Breast Cancer |
title | Development of Training Materials for Pathologists to Provide Machine Learning Validation Data of Tumor-Infiltrating Lymphocytes in Breast Cancer |
title_full | Development of Training Materials for Pathologists to Provide Machine Learning Validation Data of Tumor-Infiltrating Lymphocytes in Breast Cancer |
title_fullStr | Development of Training Materials for Pathologists to Provide Machine Learning Validation Data of Tumor-Infiltrating Lymphocytes in Breast Cancer |
title_full_unstemmed | Development of Training Materials for Pathologists to Provide Machine Learning Validation Data of Tumor-Infiltrating Lymphocytes in Breast Cancer |
title_short | Development of Training Materials for Pathologists to Provide Machine Learning Validation Data of Tumor-Infiltrating Lymphocytes in Breast Cancer |
title_sort | development of training materials for pathologists to provide machine learning validation data of tumor-infiltrating lymphocytes in breast cancer |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9139395/ https://www.ncbi.nlm.nih.gov/pubmed/35626070 http://dx.doi.org/10.3390/cancers14102467 |
work_keys_str_mv | AT garciavictor developmentoftrainingmaterialsforpathologiststoprovidemachinelearningvalidationdataoftumorinfiltratinglymphocytesinbreastcancer AT elferkatherine developmentoftrainingmaterialsforpathologiststoprovidemachinelearningvalidationdataoftumorinfiltratinglymphocytesinbreastcancer AT peetersdieterje developmentoftrainingmaterialsforpathologiststoprovidemachinelearningvalidationdataoftumorinfiltratinglymphocytesinbreastcancer AT ehingeranna developmentoftrainingmaterialsforpathologiststoprovidemachinelearningvalidationdataoftumorinfiltratinglymphocytesinbreastcancer AT wernessbruce developmentoftrainingmaterialsforpathologiststoprovidemachinelearningvalidationdataoftumorinfiltratinglymphocytesinbreastcancer AT lyamy developmentoftrainingmaterialsforpathologiststoprovidemachinelearningvalidationdataoftumorinfiltratinglymphocytesinbreastcancer AT lixiaoxian developmentoftrainingmaterialsforpathologiststoprovidemachinelearningvalidationdataoftumorinfiltratinglymphocytesinbreastcancer AT hannamatthewg developmentoftrainingmaterialsforpathologiststoprovidemachinelearningvalidationdataoftumorinfiltratinglymphocytesinbreastcancer AT blenmankimrm developmentoftrainingmaterialsforpathologiststoprovidemachinelearningvalidationdataoftumorinfiltratinglymphocytesinbreastcancer AT salgadoroberto developmentoftrainingmaterialsforpathologiststoprovidemachinelearningvalidationdataoftumorinfiltratinglymphocytesinbreastcancer AT gallasbrandond developmentoftrainingmaterialsforpathologiststoprovidemachinelearningvalidationdataoftumorinfiltratinglymphocytesinbreastcancer |