Cargando…

GEODESIC SINKHORN FOR FAST AND ACCURATE OPTIMAL TRANSPORT ON MANIFOLDS

Efficient computation of optimal transport distance between distributions is of growing importance in data science. Sinkhorn-based methods are currently the state-of-the-art for such computations, but require [Formula: see text] computations. In addition, Sinkhorn-based methods commonly use an Eucli...

Descripción completa

Detalles Bibliográficos
Autores principales: Huguet, Guillaume, Tong, Alexander, Zapatero, María Ramos, Tape, Christopher J., Wolf, Guy, Krishnaswamy, Smita
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Cornell University 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10557786/
https://www.ncbi.nlm.nih.gov/pubmed/37808090
_version_ 1785117149617979392
author Huguet, Guillaume
Tong, Alexander
Zapatero, María Ramos
Tape, Christopher J.
Wolf, Guy
Krishnaswamy, Smita
author_facet Huguet, Guillaume
Tong, Alexander
Zapatero, María Ramos
Tape, Christopher J.
Wolf, Guy
Krishnaswamy, Smita
author_sort Huguet, Guillaume
collection PubMed
description Efficient computation of optimal transport distance between distributions is of growing importance in data science. Sinkhorn-based methods are currently the state-of-the-art for such computations, but require [Formula: see text] computations. In addition, Sinkhorn-based methods commonly use an Euclidean ground distance between datapoints. However, with the prevalence of manifold structured scientific data, it is often desirable to consider geodesic ground distance. Here, we tackle both issues by proposing Geodesic Sinkhorn—based on diffusing a heat kernel on a manifold graph. Notably, Geodesic Sinkhorn requires only [Formula: see text] computation, as we approximate the heat kernel with Chebyshev polynomials based on the sparse graph Laplacian. We apply our method to the computation of barycenters of several distributions of high dimensional single cell data from patient samples undergoing chemotherapy. In particular, we define the barycentric distance as the distance between two such barycenters. Using this definition, we identify an optimal transport distance and path associated with the effect of treatment on cellular data.
format Online
Article
Text
id pubmed-10557786
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Cornell University
record_format MEDLINE/PubMed
spelling pubmed-105577862023-10-07 GEODESIC SINKHORN FOR FAST AND ACCURATE OPTIMAL TRANSPORT ON MANIFOLDS Huguet, Guillaume Tong, Alexander Zapatero, María Ramos Tape, Christopher J. Wolf, Guy Krishnaswamy, Smita ArXiv Article Efficient computation of optimal transport distance between distributions is of growing importance in data science. Sinkhorn-based methods are currently the state-of-the-art for such computations, but require [Formula: see text] computations. In addition, Sinkhorn-based methods commonly use an Euclidean ground distance between datapoints. However, with the prevalence of manifold structured scientific data, it is often desirable to consider geodesic ground distance. Here, we tackle both issues by proposing Geodesic Sinkhorn—based on diffusing a heat kernel on a manifold graph. Notably, Geodesic Sinkhorn requires only [Formula: see text] computation, as we approximate the heat kernel with Chebyshev polynomials based on the sparse graph Laplacian. We apply our method to the computation of barycenters of several distributions of high dimensional single cell data from patient samples undergoing chemotherapy. In particular, we define the barycentric distance as the distance between two such barycenters. Using this definition, we identify an optimal transport distance and path associated with the effect of treatment on cellular data. Cornell University 2023-09-26 /pmc/articles/PMC10557786/ /pubmed/37808090 Text en https://creativecommons.org/licenses/by/4.0/This work is licensed under a Creative Commons Attribution 4.0 International License (https://creativecommons.org/licenses/by/4.0/) , which allows reusers to distribute, remix, adapt, and build upon the material in any medium or format, so long as attribution is given to the creator. The license allows for commercial use.
spellingShingle Article
Huguet, Guillaume
Tong, Alexander
Zapatero, María Ramos
Tape, Christopher J.
Wolf, Guy
Krishnaswamy, Smita
GEODESIC SINKHORN FOR FAST AND ACCURATE OPTIMAL TRANSPORT ON MANIFOLDS
title GEODESIC SINKHORN FOR FAST AND ACCURATE OPTIMAL TRANSPORT ON MANIFOLDS
title_full GEODESIC SINKHORN FOR FAST AND ACCURATE OPTIMAL TRANSPORT ON MANIFOLDS
title_fullStr GEODESIC SINKHORN FOR FAST AND ACCURATE OPTIMAL TRANSPORT ON MANIFOLDS
title_full_unstemmed GEODESIC SINKHORN FOR FAST AND ACCURATE OPTIMAL TRANSPORT ON MANIFOLDS
title_short GEODESIC SINKHORN FOR FAST AND ACCURATE OPTIMAL TRANSPORT ON MANIFOLDS
title_sort geodesic sinkhorn for fast and accurate optimal transport on manifolds
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10557786/
https://www.ncbi.nlm.nih.gov/pubmed/37808090
work_keys_str_mv AT huguetguillaume geodesicsinkhornforfastandaccurateoptimaltransportonmanifolds
AT tongalexander geodesicsinkhornforfastandaccurateoptimaltransportonmanifolds
AT zapateromariaramos geodesicsinkhornforfastandaccurateoptimaltransportonmanifolds
AT tapechristopherj geodesicsinkhornforfastandaccurateoptimaltransportonmanifolds
AT wolfguy geodesicsinkhornforfastandaccurateoptimaltransportonmanifolds
AT krishnaswamysmita geodesicsinkhornforfastandaccurateoptimaltransportonmanifolds