Cargando…
Self-supervised deep learning for highly efficient spatial immunophenotyping
BACKGROUND: Efficient biomarker discovery and clinical translation depend on the fast and accurate analytical output from crucial technologies such as multiplex imaging. However, reliable cell classification often requires extensive annotations. Label-efficient strategies are urgently needed to reve...
Autores principales: | , , , , , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Elsevier
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10493897/ https://www.ncbi.nlm.nih.gov/pubmed/37672979 http://dx.doi.org/10.1016/j.ebiom.2023.104769 |
_version_ | 1785104568031379456 |
---|---|
author | Zhang, Hanyun AbdulJabbar, Khalid Grunewald, Tami Akarca, Ayse U. Hagos, Yeman Sobhani, Faranak Lecat, Catherine S.Y. Patel, Dominic Lee, Lydia Rodriguez-Justo, Manuel Yong, Kwee Ledermann, Jonathan A. Le Quesne, John Hwang, E. Shelley Marafioti, Teresa Yuan, Yinyin |
author_facet | Zhang, Hanyun AbdulJabbar, Khalid Grunewald, Tami Akarca, Ayse U. Hagos, Yeman Sobhani, Faranak Lecat, Catherine S.Y. Patel, Dominic Lee, Lydia Rodriguez-Justo, Manuel Yong, Kwee Ledermann, Jonathan A. Le Quesne, John Hwang, E. Shelley Marafioti, Teresa Yuan, Yinyin |
author_sort | Zhang, Hanyun |
collection | PubMed |
description | BACKGROUND: Efficient biomarker discovery and clinical translation depend on the fast and accurate analytical output from crucial technologies such as multiplex imaging. However, reliable cell classification often requires extensive annotations. Label-efficient strategies are urgently needed to reveal diverse cell distribution and spatial interactions in large-scale multiplex datasets. METHODS: This study proposed Self-supervised Learning for Antigen Detection (SANDI) for accurate cell phenotyping while mitigating the annotation burden. The model first learns intrinsic pairwise similarities in unlabelled cell images, followed by a classification step to map learnt features to cell labels using a small set of annotated references. We acquired four multiplex immunohistochemistry datasets and one imaging mass cytometry dataset, comprising 2825 to 15,258 single-cell images to train and test the model. FINDINGS: With 1% annotations (18–114 cells), SANDI achieved weighted F1-scores ranging from 0.82 to 0.98 across the five datasets, which was comparable to the fully supervised classifier trained on 1828–11,459 annotated cells (−0.002 to −0.053 of averaged weighted F1-score, Wilcoxon rank-sum test, P = 0.31). Leveraging the immune checkpoint markers stained in ovarian cancer slides, SANDI-based cell identification reveals spatial expulsion between PD1-expressing T helper cells and T regulatory cells, suggesting an interplay between PD1 expression and T regulatory cell-mediated immunosuppression. INTERPRETATION: By striking a fine balance between minimal expert guidance and the power of deep learning to learn similarity within abundant data, SANDI presents new opportunities for efficient, large-scale learning for histology multiplex imaging data. FUNDING: This study was funded by the 10.13039/100012139Royal Marsden/ICR National Institute of Health Research Biomedical Research Centre. |
format | Online Article Text |
id | pubmed-10493897 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Elsevier |
record_format | MEDLINE/PubMed |
spelling | pubmed-104938972023-09-12 Self-supervised deep learning for highly efficient spatial immunophenotyping Zhang, Hanyun AbdulJabbar, Khalid Grunewald, Tami Akarca, Ayse U. Hagos, Yeman Sobhani, Faranak Lecat, Catherine S.Y. Patel, Dominic Lee, Lydia Rodriguez-Justo, Manuel Yong, Kwee Ledermann, Jonathan A. Le Quesne, John Hwang, E. Shelley Marafioti, Teresa Yuan, Yinyin eBioMedicine Articles BACKGROUND: Efficient biomarker discovery and clinical translation depend on the fast and accurate analytical output from crucial technologies such as multiplex imaging. However, reliable cell classification often requires extensive annotations. Label-efficient strategies are urgently needed to reveal diverse cell distribution and spatial interactions in large-scale multiplex datasets. METHODS: This study proposed Self-supervised Learning for Antigen Detection (SANDI) for accurate cell phenotyping while mitigating the annotation burden. The model first learns intrinsic pairwise similarities in unlabelled cell images, followed by a classification step to map learnt features to cell labels using a small set of annotated references. We acquired four multiplex immunohistochemistry datasets and one imaging mass cytometry dataset, comprising 2825 to 15,258 single-cell images to train and test the model. FINDINGS: With 1% annotations (18–114 cells), SANDI achieved weighted F1-scores ranging from 0.82 to 0.98 across the five datasets, which was comparable to the fully supervised classifier trained on 1828–11,459 annotated cells (−0.002 to −0.053 of averaged weighted F1-score, Wilcoxon rank-sum test, P = 0.31). Leveraging the immune checkpoint markers stained in ovarian cancer slides, SANDI-based cell identification reveals spatial expulsion between PD1-expressing T helper cells and T regulatory cells, suggesting an interplay between PD1 expression and T regulatory cell-mediated immunosuppression. INTERPRETATION: By striking a fine balance between minimal expert guidance and the power of deep learning to learn similarity within abundant data, SANDI presents new opportunities for efficient, large-scale learning for histology multiplex imaging data. FUNDING: This study was funded by the 10.13039/100012139Royal Marsden/ICR National Institute of Health Research Biomedical Research Centre. Elsevier 2023-09-04 /pmc/articles/PMC10493897/ /pubmed/37672979 http://dx.doi.org/10.1016/j.ebiom.2023.104769 Text en © 2023 The Authors https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/). |
spellingShingle | Articles Zhang, Hanyun AbdulJabbar, Khalid Grunewald, Tami Akarca, Ayse U. Hagos, Yeman Sobhani, Faranak Lecat, Catherine S.Y. Patel, Dominic Lee, Lydia Rodriguez-Justo, Manuel Yong, Kwee Ledermann, Jonathan A. Le Quesne, John Hwang, E. Shelley Marafioti, Teresa Yuan, Yinyin Self-supervised deep learning for highly efficient spatial immunophenotyping |
title | Self-supervised deep learning for highly efficient spatial immunophenotyping |
title_full | Self-supervised deep learning for highly efficient spatial immunophenotyping |
title_fullStr | Self-supervised deep learning for highly efficient spatial immunophenotyping |
title_full_unstemmed | Self-supervised deep learning for highly efficient spatial immunophenotyping |
title_short | Self-supervised deep learning for highly efficient spatial immunophenotyping |
title_sort | self-supervised deep learning for highly efficient spatial immunophenotyping |
topic | Articles |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10493897/ https://www.ncbi.nlm.nih.gov/pubmed/37672979 http://dx.doi.org/10.1016/j.ebiom.2023.104769 |
work_keys_str_mv | AT zhanghanyun selfsuperviseddeeplearningforhighlyefficientspatialimmunophenotyping AT abduljabbarkhalid selfsuperviseddeeplearningforhighlyefficientspatialimmunophenotyping AT grunewaldtami selfsuperviseddeeplearningforhighlyefficientspatialimmunophenotyping AT akarcaayseu selfsuperviseddeeplearningforhighlyefficientspatialimmunophenotyping AT hagosyeman selfsuperviseddeeplearningforhighlyefficientspatialimmunophenotyping AT sobhanifaranak selfsuperviseddeeplearningforhighlyefficientspatialimmunophenotyping AT lecatcatherinesy selfsuperviseddeeplearningforhighlyefficientspatialimmunophenotyping AT pateldominic selfsuperviseddeeplearningforhighlyefficientspatialimmunophenotyping AT leelydia selfsuperviseddeeplearningforhighlyefficientspatialimmunophenotyping AT rodriguezjustomanuel selfsuperviseddeeplearningforhighlyefficientspatialimmunophenotyping AT yongkwee selfsuperviseddeeplearningforhighlyefficientspatialimmunophenotyping AT ledermannjonathana selfsuperviseddeeplearningforhighlyefficientspatialimmunophenotyping AT lequesnejohn selfsuperviseddeeplearningforhighlyefficientspatialimmunophenotyping AT hwangeshelley selfsuperviseddeeplearningforhighlyefficientspatialimmunophenotyping AT marafiotiteresa selfsuperviseddeeplearningforhighlyefficientspatialimmunophenotyping AT yuanyinyin selfsuperviseddeeplearningforhighlyefficientspatialimmunophenotyping |