Cargando…
GEODESIC SINKHORN FOR FAST AND ACCURATE OPTIMAL TRANSPORT ON MANIFOLDS
Efficient computation of optimal transport distance between distributions is of growing importance in data science. Sinkhorn-based methods are currently the state-of-the-art for such computations, but require [Formula: see text] computations. In addition, Sinkhorn-based methods commonly use an Eucli...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Cornell University
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10557786/ https://www.ncbi.nlm.nih.gov/pubmed/37808090 |
_version_ | 1785117149617979392 |
---|---|
author | Huguet, Guillaume Tong, Alexander Zapatero, María Ramos Tape, Christopher J. Wolf, Guy Krishnaswamy, Smita |
author_facet | Huguet, Guillaume Tong, Alexander Zapatero, María Ramos Tape, Christopher J. Wolf, Guy Krishnaswamy, Smita |
author_sort | Huguet, Guillaume |
collection | PubMed |
description | Efficient computation of optimal transport distance between distributions is of growing importance in data science. Sinkhorn-based methods are currently the state-of-the-art for such computations, but require [Formula: see text] computations. In addition, Sinkhorn-based methods commonly use an Euclidean ground distance between datapoints. However, with the prevalence of manifold structured scientific data, it is often desirable to consider geodesic ground distance. Here, we tackle both issues by proposing Geodesic Sinkhorn—based on diffusing a heat kernel on a manifold graph. Notably, Geodesic Sinkhorn requires only [Formula: see text] computation, as we approximate the heat kernel with Chebyshev polynomials based on the sparse graph Laplacian. We apply our method to the computation of barycenters of several distributions of high dimensional single cell data from patient samples undergoing chemotherapy. In particular, we define the barycentric distance as the distance between two such barycenters. Using this definition, we identify an optimal transport distance and path associated with the effect of treatment on cellular data. |
format | Online Article Text |
id | pubmed-10557786 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Cornell University |
record_format | MEDLINE/PubMed |
spelling | pubmed-105577862023-10-07 GEODESIC SINKHORN FOR FAST AND ACCURATE OPTIMAL TRANSPORT ON MANIFOLDS Huguet, Guillaume Tong, Alexander Zapatero, María Ramos Tape, Christopher J. Wolf, Guy Krishnaswamy, Smita ArXiv Article Efficient computation of optimal transport distance between distributions is of growing importance in data science. Sinkhorn-based methods are currently the state-of-the-art for such computations, but require [Formula: see text] computations. In addition, Sinkhorn-based methods commonly use an Euclidean ground distance between datapoints. However, with the prevalence of manifold structured scientific data, it is often desirable to consider geodesic ground distance. Here, we tackle both issues by proposing Geodesic Sinkhorn—based on diffusing a heat kernel on a manifold graph. Notably, Geodesic Sinkhorn requires only [Formula: see text] computation, as we approximate the heat kernel with Chebyshev polynomials based on the sparse graph Laplacian. We apply our method to the computation of barycenters of several distributions of high dimensional single cell data from patient samples undergoing chemotherapy. In particular, we define the barycentric distance as the distance between two such barycenters. Using this definition, we identify an optimal transport distance and path associated with the effect of treatment on cellular data. Cornell University 2023-09-26 /pmc/articles/PMC10557786/ /pubmed/37808090 Text en https://creativecommons.org/licenses/by/4.0/This work is licensed under a Creative Commons Attribution 4.0 International License (https://creativecommons.org/licenses/by/4.0/) , which allows reusers to distribute, remix, adapt, and build upon the material in any medium or format, so long as attribution is given to the creator. The license allows for commercial use. |
spellingShingle | Article Huguet, Guillaume Tong, Alexander Zapatero, María Ramos Tape, Christopher J. Wolf, Guy Krishnaswamy, Smita GEODESIC SINKHORN FOR FAST AND ACCURATE OPTIMAL TRANSPORT ON MANIFOLDS |
title | GEODESIC SINKHORN FOR FAST AND ACCURATE OPTIMAL TRANSPORT ON MANIFOLDS |
title_full | GEODESIC SINKHORN FOR FAST AND ACCURATE OPTIMAL TRANSPORT ON MANIFOLDS |
title_fullStr | GEODESIC SINKHORN FOR FAST AND ACCURATE OPTIMAL TRANSPORT ON MANIFOLDS |
title_full_unstemmed | GEODESIC SINKHORN FOR FAST AND ACCURATE OPTIMAL TRANSPORT ON MANIFOLDS |
title_short | GEODESIC SINKHORN FOR FAST AND ACCURATE OPTIMAL TRANSPORT ON MANIFOLDS |
title_sort | geodesic sinkhorn for fast and accurate optimal transport on manifolds |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10557786/ https://www.ncbi.nlm.nih.gov/pubmed/37808090 |
work_keys_str_mv | AT huguetguillaume geodesicsinkhornforfastandaccurateoptimaltransportonmanifolds AT tongalexander geodesicsinkhornforfastandaccurateoptimaltransportonmanifolds AT zapateromariaramos geodesicsinkhornforfastandaccurateoptimaltransportonmanifolds AT tapechristopherj geodesicsinkhornforfastandaccurateoptimaltransportonmanifolds AT wolfguy geodesicsinkhornforfastandaccurateoptimaltransportonmanifolds AT krishnaswamysmita geodesicsinkhornforfastandaccurateoptimaltransportonmanifolds |