Cargando…

Self-supervised deep learning for highly efficient spatial immunophenotyping

BACKGROUND: Efficient biomarker discovery and clinical translation depend on the fast and accurate analytical output from crucial technologies such as multiplex imaging. However, reliable cell classification often requires extensive annotations. Label-efficient strategies are urgently needed to reve...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhang, Hanyun, AbdulJabbar, Khalid, Grunewald, Tami, Akarca, Ayse U., Hagos, Yeman, Sobhani, Faranak, Lecat, Catherine S.Y., Patel, Dominic, Lee, Lydia, Rodriguez-Justo, Manuel, Yong, Kwee, Ledermann, Jonathan A., Le Quesne, John, Hwang, E. Shelley, Marafioti, Teresa, Yuan, Yinyin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10493897/
https://www.ncbi.nlm.nih.gov/pubmed/37672979
http://dx.doi.org/10.1016/j.ebiom.2023.104769
_version_ 1785104568031379456
author Zhang, Hanyun
AbdulJabbar, Khalid
Grunewald, Tami
Akarca, Ayse U.
Hagos, Yeman
Sobhani, Faranak
Lecat, Catherine S.Y.
Patel, Dominic
Lee, Lydia
Rodriguez-Justo, Manuel
Yong, Kwee
Ledermann, Jonathan A.
Le Quesne, John
Hwang, E. Shelley
Marafioti, Teresa
Yuan, Yinyin
author_facet Zhang, Hanyun
AbdulJabbar, Khalid
Grunewald, Tami
Akarca, Ayse U.
Hagos, Yeman
Sobhani, Faranak
Lecat, Catherine S.Y.
Patel, Dominic
Lee, Lydia
Rodriguez-Justo, Manuel
Yong, Kwee
Ledermann, Jonathan A.
Le Quesne, John
Hwang, E. Shelley
Marafioti, Teresa
Yuan, Yinyin
author_sort Zhang, Hanyun
collection PubMed
description BACKGROUND: Efficient biomarker discovery and clinical translation depend on the fast and accurate analytical output from crucial technologies such as multiplex imaging. However, reliable cell classification often requires extensive annotations. Label-efficient strategies are urgently needed to reveal diverse cell distribution and spatial interactions in large-scale multiplex datasets. METHODS: This study proposed Self-supervised Learning for Antigen Detection (SANDI) for accurate cell phenotyping while mitigating the annotation burden. The model first learns intrinsic pairwise similarities in unlabelled cell images, followed by a classification step to map learnt features to cell labels using a small set of annotated references. We acquired four multiplex immunohistochemistry datasets and one imaging mass cytometry dataset, comprising 2825 to 15,258 single-cell images to train and test the model. FINDINGS: With 1% annotations (18–114 cells), SANDI achieved weighted F1-scores ranging from 0.82 to 0.98 across the five datasets, which was comparable to the fully supervised classifier trained on 1828–11,459 annotated cells (−0.002 to −0.053 of averaged weighted F1-score, Wilcoxon rank-sum test, P = 0.31). Leveraging the immune checkpoint markers stained in ovarian cancer slides, SANDI-based cell identification reveals spatial expulsion between PD1-expressing T helper cells and T regulatory cells, suggesting an interplay between PD1 expression and T regulatory cell-mediated immunosuppression. INTERPRETATION: By striking a fine balance between minimal expert guidance and the power of deep learning to learn similarity within abundant data, SANDI presents new opportunities for efficient, large-scale learning for histology multiplex imaging data. FUNDING: This study was funded by the 10.13039/100012139Royal Marsden/ICR National Institute of Health Research Biomedical Research Centre.
format Online
Article
Text
id pubmed-10493897
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Elsevier
record_format MEDLINE/PubMed
spelling pubmed-104938972023-09-12 Self-supervised deep learning for highly efficient spatial immunophenotyping Zhang, Hanyun AbdulJabbar, Khalid Grunewald, Tami Akarca, Ayse U. Hagos, Yeman Sobhani, Faranak Lecat, Catherine S.Y. Patel, Dominic Lee, Lydia Rodriguez-Justo, Manuel Yong, Kwee Ledermann, Jonathan A. Le Quesne, John Hwang, E. Shelley Marafioti, Teresa Yuan, Yinyin eBioMedicine Articles BACKGROUND: Efficient biomarker discovery and clinical translation depend on the fast and accurate analytical output from crucial technologies such as multiplex imaging. However, reliable cell classification often requires extensive annotations. Label-efficient strategies are urgently needed to reveal diverse cell distribution and spatial interactions in large-scale multiplex datasets. METHODS: This study proposed Self-supervised Learning for Antigen Detection (SANDI) for accurate cell phenotyping while mitigating the annotation burden. The model first learns intrinsic pairwise similarities in unlabelled cell images, followed by a classification step to map learnt features to cell labels using a small set of annotated references. We acquired four multiplex immunohistochemistry datasets and one imaging mass cytometry dataset, comprising 2825 to 15,258 single-cell images to train and test the model. FINDINGS: With 1% annotations (18–114 cells), SANDI achieved weighted F1-scores ranging from 0.82 to 0.98 across the five datasets, which was comparable to the fully supervised classifier trained on 1828–11,459 annotated cells (−0.002 to −0.053 of averaged weighted F1-score, Wilcoxon rank-sum test, P = 0.31). Leveraging the immune checkpoint markers stained in ovarian cancer slides, SANDI-based cell identification reveals spatial expulsion between PD1-expressing T helper cells and T regulatory cells, suggesting an interplay between PD1 expression and T regulatory cell-mediated immunosuppression. INTERPRETATION: By striking a fine balance between minimal expert guidance and the power of deep learning to learn similarity within abundant data, SANDI presents new opportunities for efficient, large-scale learning for histology multiplex imaging data. FUNDING: This study was funded by the 10.13039/100012139Royal Marsden/ICR National Institute of Health Research Biomedical Research Centre. Elsevier 2023-09-04 /pmc/articles/PMC10493897/ /pubmed/37672979 http://dx.doi.org/10.1016/j.ebiom.2023.104769 Text en © 2023 The Authors https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
spellingShingle Articles
Zhang, Hanyun
AbdulJabbar, Khalid
Grunewald, Tami
Akarca, Ayse U.
Hagos, Yeman
Sobhani, Faranak
Lecat, Catherine S.Y.
Patel, Dominic
Lee, Lydia
Rodriguez-Justo, Manuel
Yong, Kwee
Ledermann, Jonathan A.
Le Quesne, John
Hwang, E. Shelley
Marafioti, Teresa
Yuan, Yinyin
Self-supervised deep learning for highly efficient spatial immunophenotyping
title Self-supervised deep learning for highly efficient spatial immunophenotyping
title_full Self-supervised deep learning for highly efficient spatial immunophenotyping
title_fullStr Self-supervised deep learning for highly efficient spatial immunophenotyping
title_full_unstemmed Self-supervised deep learning for highly efficient spatial immunophenotyping
title_short Self-supervised deep learning for highly efficient spatial immunophenotyping
title_sort self-supervised deep learning for highly efficient spatial immunophenotyping
topic Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10493897/
https://www.ncbi.nlm.nih.gov/pubmed/37672979
http://dx.doi.org/10.1016/j.ebiom.2023.104769
work_keys_str_mv AT zhanghanyun selfsuperviseddeeplearningforhighlyefficientspatialimmunophenotyping
AT abduljabbarkhalid selfsuperviseddeeplearningforhighlyefficientspatialimmunophenotyping
AT grunewaldtami selfsuperviseddeeplearningforhighlyefficientspatialimmunophenotyping
AT akarcaayseu selfsuperviseddeeplearningforhighlyefficientspatialimmunophenotyping
AT hagosyeman selfsuperviseddeeplearningforhighlyefficientspatialimmunophenotyping
AT sobhanifaranak selfsuperviseddeeplearningforhighlyefficientspatialimmunophenotyping
AT lecatcatherinesy selfsuperviseddeeplearningforhighlyefficientspatialimmunophenotyping
AT pateldominic selfsuperviseddeeplearningforhighlyefficientspatialimmunophenotyping
AT leelydia selfsuperviseddeeplearningforhighlyefficientspatialimmunophenotyping
AT rodriguezjustomanuel selfsuperviseddeeplearningforhighlyefficientspatialimmunophenotyping
AT yongkwee selfsuperviseddeeplearningforhighlyefficientspatialimmunophenotyping
AT ledermannjonathana selfsuperviseddeeplearningforhighlyefficientspatialimmunophenotyping
AT lequesnejohn selfsuperviseddeeplearningforhighlyefficientspatialimmunophenotyping
AT hwangeshelley selfsuperviseddeeplearningforhighlyefficientspatialimmunophenotyping
AT marafiotiteresa selfsuperviseddeeplearningforhighlyefficientspatialimmunophenotyping
AT yuanyinyin selfsuperviseddeeplearningforhighlyefficientspatialimmunophenotyping