Cargando…

Deconfounded Dimension Reduction via Partial Embeddings

Dimension reduction tools preserving similarity and graph structure such as t-SNE and UMAP can capture complex biological patterns in high-dimensional data. However, these tools typically are not designed to separate effects of interest from unwanted effects due to confounders. We introduce the part...

Descripción completa

Detalles Bibliográficos
Autores principales: Chen, Andrew A., Clark, Kelly, Dewey, Blake, DuVal, Anna, Pellegrini, Nicole, Nair, Govind, Jalkh, Youmna, Khalil, Samar, Zurawski, Jon, Calabresi, Peter, Reich, Daniel, Bakshi, Rohit, Shou, Haochang, Shinohara, Russell T.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Cold Spring Harbor Laboratory 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9882043/
https://www.ncbi.nlm.nih.gov/pubmed/36711940
http://dx.doi.org/10.1101/2023.01.10.523448
_version_ 1784879229861625856
author Chen, Andrew A.
Clark, Kelly
Dewey, Blake
DuVal, Anna
Pellegrini, Nicole
Nair, Govind
Jalkh, Youmna
Khalil, Samar
Zurawski, Jon
Calabresi, Peter
Reich, Daniel
Bakshi, Rohit
Shou, Haochang
Shinohara, Russell T.
author_facet Chen, Andrew A.
Clark, Kelly
Dewey, Blake
DuVal, Anna
Pellegrini, Nicole
Nair, Govind
Jalkh, Youmna
Khalil, Samar
Zurawski, Jon
Calabresi, Peter
Reich, Daniel
Bakshi, Rohit
Shou, Haochang
Shinohara, Russell T.
author_sort Chen, Andrew A.
collection PubMed
description Dimension reduction tools preserving similarity and graph structure such as t-SNE and UMAP can capture complex biological patterns in high-dimensional data. However, these tools typically are not designed to separate effects of interest from unwanted effects due to confounders. We introduce the partial embedding (PARE) framework, which enables removal of confounders from any distance-based dimension reduction method. We then develop partial t-SNE and partial UMAP and apply these methods to genomic and neuroimaging data. Our results show that the PARE framework can remove batch effects in single-cell sequencing data as well as separate clinical and technical variability in neuroimaging measures. We demonstrate that the PARE framework extends dimension reduction methods to highlight biological patterns of interest while effectively removing confounding effects.
format Online
Article
Text
id pubmed-9882043
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Cold Spring Harbor Laboratory
record_format MEDLINE/PubMed
spelling pubmed-98820432023-01-28 Deconfounded Dimension Reduction via Partial Embeddings Chen, Andrew A. Clark, Kelly Dewey, Blake DuVal, Anna Pellegrini, Nicole Nair, Govind Jalkh, Youmna Khalil, Samar Zurawski, Jon Calabresi, Peter Reich, Daniel Bakshi, Rohit Shou, Haochang Shinohara, Russell T. bioRxiv Article Dimension reduction tools preserving similarity and graph structure such as t-SNE and UMAP can capture complex biological patterns in high-dimensional data. However, these tools typically are not designed to separate effects of interest from unwanted effects due to confounders. We introduce the partial embedding (PARE) framework, which enables removal of confounders from any distance-based dimension reduction method. We then develop partial t-SNE and partial UMAP and apply these methods to genomic and neuroimaging data. Our results show that the PARE framework can remove batch effects in single-cell sequencing data as well as separate clinical and technical variability in neuroimaging measures. We demonstrate that the PARE framework extends dimension reduction methods to highlight biological patterns of interest while effectively removing confounding effects. Cold Spring Harbor Laboratory 2023-01-11 /pmc/articles/PMC9882043/ /pubmed/36711940 http://dx.doi.org/10.1101/2023.01.10.523448 Text en https://creativecommons.org/licenses/by-nc-nd/4.0/This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License (https://creativecommons.org/licenses/by-nc-nd/4.0/) , which allows reusers to copy and distribute the material in any medium or format in unadapted form only, for noncommercial purposes only, and only so long as attribution is given to the creator.
spellingShingle Article
Chen, Andrew A.
Clark, Kelly
Dewey, Blake
DuVal, Anna
Pellegrini, Nicole
Nair, Govind
Jalkh, Youmna
Khalil, Samar
Zurawski, Jon
Calabresi, Peter
Reich, Daniel
Bakshi, Rohit
Shou, Haochang
Shinohara, Russell T.
Deconfounded Dimension Reduction via Partial Embeddings
title Deconfounded Dimension Reduction via Partial Embeddings
title_full Deconfounded Dimension Reduction via Partial Embeddings
title_fullStr Deconfounded Dimension Reduction via Partial Embeddings
title_full_unstemmed Deconfounded Dimension Reduction via Partial Embeddings
title_short Deconfounded Dimension Reduction via Partial Embeddings
title_sort deconfounded dimension reduction via partial embeddings
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9882043/
https://www.ncbi.nlm.nih.gov/pubmed/36711940
http://dx.doi.org/10.1101/2023.01.10.523448
work_keys_str_mv AT chenandrewa deconfoundeddimensionreductionviapartialembeddings
AT clarkkelly deconfoundeddimensionreductionviapartialembeddings
AT deweyblake deconfoundeddimensionreductionviapartialembeddings
AT duvalanna deconfoundeddimensionreductionviapartialembeddings
AT pellegrininicole deconfoundeddimensionreductionviapartialembeddings
AT nairgovind deconfoundeddimensionreductionviapartialembeddings
AT jalkhyoumna deconfoundeddimensionreductionviapartialembeddings
AT khalilsamar deconfoundeddimensionreductionviapartialembeddings
AT zurawskijon deconfoundeddimensionreductionviapartialembeddings
AT calabresipeter deconfoundeddimensionreductionviapartialembeddings
AT reichdaniel deconfoundeddimensionreductionviapartialembeddings
AT bakshirohit deconfoundeddimensionreductionviapartialembeddings
AT shouhaochang deconfoundeddimensionreductionviapartialembeddings
AT shinohararussellt deconfoundeddimensionreductionviapartialembeddings