Cargando…

Stain normalization gives greater generalizability than stain jittering in neural network training for the classification of coeliac disease in duodenal biopsy whole slide images

Around 1% of the population of the UK and North America have a diagnosis of coeliac disease (CD), due to a damaging immune response to the small intestine. Assessing whether a patient has CD relies primarily on the examination of a duodenal biopsy, an unavoidably subjective process with poor inter-o...

Descripción completa

Detalles Bibliográficos
Autores principales: Schreiber, B.A., Denholm, J., Gilbey, J.D., Schönlieb, C.-B., Soilleux, E.J.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10416012/
https://www.ncbi.nlm.nih.gov/pubmed/37577172
http://dx.doi.org/10.1016/j.jpi.2023.100324
Descripción
Sumario:Around 1% of the population of the UK and North America have a diagnosis of coeliac disease (CD), due to a damaging immune response to the small intestine. Assessing whether a patient has CD relies primarily on the examination of a duodenal biopsy, an unavoidably subjective process with poor inter-observer concordance. Wei et al. [11] developed a neural network-based method for diagnosing CD using a dataset of duodenal biopsy whole slide images (WSIs). As all training and validation data came from one source, there was no guarantee that their results would generalize to WSIs obtained from different scanners and laboratories. In this study, the effects of applying stain normalization and jittering to the training data were compared. We trained a deep neural network on 331 WSIs obtained with a Ventana scanner (WSIs; CD: [Formula: see text]; normal: [Formula: see text]) to classify presence of CD. In order to test the effects of stain processing when validating on WSIs scanned on varying scanners and from varying laboratories, the neural network was validated on 4 datasets: WSIs of slides scanned on a Ventana scanner (WSIs; CD: [Formula: see text]; normal: [Formula: see text]), WSIs of the same slides rescanned on a Hamamatsu scanner (WSIs; CD: [Formula: see text]; normal: [Formula: see text]), WSIs of the same slides rescanned on an Aperio scanner (WSIs; CD: [Formula: see text]; normal: [Formula: see text]), and WSIs of different slides scanned on an Aperio scanner (WSIs; CD: [Formula: see text]; normal: [Formula: see text]). Without stain processing, the F1 scores of the neural network were [Formula: see text] , [Formula: see text] , [Formula: see text] (,) and [Formula: see text] when validating on the Ventana validation WSIs, Hamamatsu and Aperio rescans of the Ventana validation WSIs, and Aperio WSIs from a different source respectively. With stain normalization, the performance of the neural network improved significantly with respective F1 scores [Formula: see text] , [Formula: see text] , [Formula: see text] , and [Formula: see text]. Stain jittering resulted in a better performance than stain normalization when validating on data from the same source F1 score [Formula: see text] , but resulted in poorer performance than stain normalization when validating on WSIs from different scanners (F1 scores [Formula: see text] , [Formula: see text] (,) and [Formula: see text]). This study shows the importance of stain processing, in particular stain normalization, when training machine learning models on duodenal biopsy WSIs to ensure generalizability between different scanners and laboratories.