Cargando…

Matching single cells across modalities with contrastive learning and optimal transport

Understanding the interactions between the biomolecules that govern cellular behaviors remains an emergent question in biology. Recent advances in single-cell technologies have enabled the simultaneous quantification of multiple biomolecules in the same cell, opening new avenues for understanding ce...

Descripción completa

Detalles Bibliográficos
Autores principales: Gossi, Federico, Pati, Pushpak, Chouvardas, Panagiotis, Martinelli, Adriano Luca, Kruithof-de Julio, Marianna, Rapsomaniki, Maria Anna
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10199774/
https://www.ncbi.nlm.nih.gov/pubmed/37122067
http://dx.doi.org/10.1093/bib/bbad130
_version_ 1785045002446962688
author Gossi, Federico
Pati, Pushpak
Chouvardas, Panagiotis
Martinelli, Adriano Luca
Kruithof-de Julio, Marianna
Rapsomaniki, Maria Anna
author_facet Gossi, Federico
Pati, Pushpak
Chouvardas, Panagiotis
Martinelli, Adriano Luca
Kruithof-de Julio, Marianna
Rapsomaniki, Maria Anna
author_sort Gossi, Federico
collection PubMed
description Understanding the interactions between the biomolecules that govern cellular behaviors remains an emergent question in biology. Recent advances in single-cell technologies have enabled the simultaneous quantification of multiple biomolecules in the same cell, opening new avenues for understanding cellular complexity and heterogeneity. Still, the resulting multimodal single-cell datasets present unique challenges arising from the high dimensionality and multiple sources of acquisition noise. Computational methods able to match cells across different modalities offer an appealing alternative towards this goal. In this work, we propose MatchCLOT, a novel method for modality matching inspired by recent promising developments in contrastive learning and optimal transport. MatchCLOT uses contrastive learning to learn a common representation between two modalities and applies entropic optimal transport as an approximate maximum weight bipartite matching algorithm. Our model obtains state-of-the-art performance on two curated benchmarking datasets and an independent test dataset, improving the top scoring method by 26.1% while preserving the underlying biological structure of the multimodal data. Importantly, MatchCLOT offers high gains in computational time and memory that, in contrast to existing methods, allows it to scale well with the number of cells. As single-cell datasets become increasingly large, MatchCLOT offers an accurate and efficient solution to the problem of modality matching.
format Online
Article
Text
id pubmed-10199774
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-101997742023-05-21 Matching single cells across modalities with contrastive learning and optimal transport Gossi, Federico Pati, Pushpak Chouvardas, Panagiotis Martinelli, Adriano Luca Kruithof-de Julio, Marianna Rapsomaniki, Maria Anna Brief Bioinform Problem Solving Protocol Understanding the interactions between the biomolecules that govern cellular behaviors remains an emergent question in biology. Recent advances in single-cell technologies have enabled the simultaneous quantification of multiple biomolecules in the same cell, opening new avenues for understanding cellular complexity and heterogeneity. Still, the resulting multimodal single-cell datasets present unique challenges arising from the high dimensionality and multiple sources of acquisition noise. Computational methods able to match cells across different modalities offer an appealing alternative towards this goal. In this work, we propose MatchCLOT, a novel method for modality matching inspired by recent promising developments in contrastive learning and optimal transport. MatchCLOT uses contrastive learning to learn a common representation between two modalities and applies entropic optimal transport as an approximate maximum weight bipartite matching algorithm. Our model obtains state-of-the-art performance on two curated benchmarking datasets and an independent test dataset, improving the top scoring method by 26.1% while preserving the underlying biological structure of the multimodal data. Importantly, MatchCLOT offers high gains in computational time and memory that, in contrast to existing methods, allows it to scale well with the number of cells. As single-cell datasets become increasingly large, MatchCLOT offers an accurate and efficient solution to the problem of modality matching. Oxford University Press 2023-04-29 /pmc/articles/PMC10199774/ /pubmed/37122067 http://dx.doi.org/10.1093/bib/bbad130 Text en © The Author(s) 2023. Published by Oxford University Press. https://creativecommons.org/licenses/by-nc/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (https://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Problem Solving Protocol
Gossi, Federico
Pati, Pushpak
Chouvardas, Panagiotis
Martinelli, Adriano Luca
Kruithof-de Julio, Marianna
Rapsomaniki, Maria Anna
Matching single cells across modalities with contrastive learning and optimal transport
title Matching single cells across modalities with contrastive learning and optimal transport
title_full Matching single cells across modalities with contrastive learning and optimal transport
title_fullStr Matching single cells across modalities with contrastive learning and optimal transport
title_full_unstemmed Matching single cells across modalities with contrastive learning and optimal transport
title_short Matching single cells across modalities with contrastive learning and optimal transport
title_sort matching single cells across modalities with contrastive learning and optimal transport
topic Problem Solving Protocol
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10199774/
https://www.ncbi.nlm.nih.gov/pubmed/37122067
http://dx.doi.org/10.1093/bib/bbad130
work_keys_str_mv AT gossifederico matchingsinglecellsacrossmodalitieswithcontrastivelearningandoptimaltransport
AT patipushpak matchingsinglecellsacrossmodalitieswithcontrastivelearningandoptimaltransport
AT chouvardaspanagiotis matchingsinglecellsacrossmodalitieswithcontrastivelearningandoptimaltransport
AT martinelliadrianoluca matchingsinglecellsacrossmodalitieswithcontrastivelearningandoptimaltransport
AT kruithofdejuliomarianna matchingsinglecellsacrossmodalitieswithcontrastivelearningandoptimaltransport
AT rapsomanikimariaanna matchingsinglecellsacrossmodalitieswithcontrastivelearningandoptimaltransport