Cargando…
Matching single cells across modalities with contrastive learning and optimal transport
Understanding the interactions between the biomolecules that govern cellular behaviors remains an emergent question in biology. Recent advances in single-cell technologies have enabled the simultaneous quantification of multiple biomolecules in the same cell, opening new avenues for understanding ce...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10199774/ https://www.ncbi.nlm.nih.gov/pubmed/37122067 http://dx.doi.org/10.1093/bib/bbad130 |
_version_ | 1785045002446962688 |
---|---|
author | Gossi, Federico Pati, Pushpak Chouvardas, Panagiotis Martinelli, Adriano Luca Kruithof-de Julio, Marianna Rapsomaniki, Maria Anna |
author_facet | Gossi, Federico Pati, Pushpak Chouvardas, Panagiotis Martinelli, Adriano Luca Kruithof-de Julio, Marianna Rapsomaniki, Maria Anna |
author_sort | Gossi, Federico |
collection | PubMed |
description | Understanding the interactions between the biomolecules that govern cellular behaviors remains an emergent question in biology. Recent advances in single-cell technologies have enabled the simultaneous quantification of multiple biomolecules in the same cell, opening new avenues for understanding cellular complexity and heterogeneity. Still, the resulting multimodal single-cell datasets present unique challenges arising from the high dimensionality and multiple sources of acquisition noise. Computational methods able to match cells across different modalities offer an appealing alternative towards this goal. In this work, we propose MatchCLOT, a novel method for modality matching inspired by recent promising developments in contrastive learning and optimal transport. MatchCLOT uses contrastive learning to learn a common representation between two modalities and applies entropic optimal transport as an approximate maximum weight bipartite matching algorithm. Our model obtains state-of-the-art performance on two curated benchmarking datasets and an independent test dataset, improving the top scoring method by 26.1% while preserving the underlying biological structure of the multimodal data. Importantly, MatchCLOT offers high gains in computational time and memory that, in contrast to existing methods, allows it to scale well with the number of cells. As single-cell datasets become increasingly large, MatchCLOT offers an accurate and efficient solution to the problem of modality matching. |
format | Online Article Text |
id | pubmed-10199774 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-101997742023-05-21 Matching single cells across modalities with contrastive learning and optimal transport Gossi, Federico Pati, Pushpak Chouvardas, Panagiotis Martinelli, Adriano Luca Kruithof-de Julio, Marianna Rapsomaniki, Maria Anna Brief Bioinform Problem Solving Protocol Understanding the interactions between the biomolecules that govern cellular behaviors remains an emergent question in biology. Recent advances in single-cell technologies have enabled the simultaneous quantification of multiple biomolecules in the same cell, opening new avenues for understanding cellular complexity and heterogeneity. Still, the resulting multimodal single-cell datasets present unique challenges arising from the high dimensionality and multiple sources of acquisition noise. Computational methods able to match cells across different modalities offer an appealing alternative towards this goal. In this work, we propose MatchCLOT, a novel method for modality matching inspired by recent promising developments in contrastive learning and optimal transport. MatchCLOT uses contrastive learning to learn a common representation between two modalities and applies entropic optimal transport as an approximate maximum weight bipartite matching algorithm. Our model obtains state-of-the-art performance on two curated benchmarking datasets and an independent test dataset, improving the top scoring method by 26.1% while preserving the underlying biological structure of the multimodal data. Importantly, MatchCLOT offers high gains in computational time and memory that, in contrast to existing methods, allows it to scale well with the number of cells. As single-cell datasets become increasingly large, MatchCLOT offers an accurate and efficient solution to the problem of modality matching. Oxford University Press 2023-04-29 /pmc/articles/PMC10199774/ /pubmed/37122067 http://dx.doi.org/10.1093/bib/bbad130 Text en © The Author(s) 2023. Published by Oxford University Press. https://creativecommons.org/licenses/by-nc/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (https://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com |
spellingShingle | Problem Solving Protocol Gossi, Federico Pati, Pushpak Chouvardas, Panagiotis Martinelli, Adriano Luca Kruithof-de Julio, Marianna Rapsomaniki, Maria Anna Matching single cells across modalities with contrastive learning and optimal transport |
title | Matching single cells across modalities with contrastive learning and optimal transport |
title_full | Matching single cells across modalities with contrastive learning and optimal transport |
title_fullStr | Matching single cells across modalities with contrastive learning and optimal transport |
title_full_unstemmed | Matching single cells across modalities with contrastive learning and optimal transport |
title_short | Matching single cells across modalities with contrastive learning and optimal transport |
title_sort | matching single cells across modalities with contrastive learning and optimal transport |
topic | Problem Solving Protocol |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10199774/ https://www.ncbi.nlm.nih.gov/pubmed/37122067 http://dx.doi.org/10.1093/bib/bbad130 |
work_keys_str_mv | AT gossifederico matchingsinglecellsacrossmodalitieswithcontrastivelearningandoptimaltransport AT patipushpak matchingsinglecellsacrossmodalitieswithcontrastivelearningandoptimaltransport AT chouvardaspanagiotis matchingsinglecellsacrossmodalitieswithcontrastivelearningandoptimaltransport AT martinelliadrianoluca matchingsinglecellsacrossmodalitieswithcontrastivelearningandoptimaltransport AT kruithofdejuliomarianna matchingsinglecellsacrossmodalitieswithcontrastivelearningandoptimaltransport AT rapsomanikimariaanna matchingsinglecellsacrossmodalitieswithcontrastivelearningandoptimaltransport |