Cargando…
Deconfounded Dimension Reduction via Partial Embeddings
Dimension reduction tools preserving similarity and graph structure such as t-SNE and UMAP can capture complex biological patterns in high-dimensional data. However, these tools typically are not designed to separate effects of interest from unwanted effects due to confounders. We introduce the part...
Autores principales: | , , , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Cold Spring Harbor Laboratory
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9882043/ https://www.ncbi.nlm.nih.gov/pubmed/36711940 http://dx.doi.org/10.1101/2023.01.10.523448 |
_version_ | 1784879229861625856 |
---|---|
author | Chen, Andrew A. Clark, Kelly Dewey, Blake DuVal, Anna Pellegrini, Nicole Nair, Govind Jalkh, Youmna Khalil, Samar Zurawski, Jon Calabresi, Peter Reich, Daniel Bakshi, Rohit Shou, Haochang Shinohara, Russell T. |
author_facet | Chen, Andrew A. Clark, Kelly Dewey, Blake DuVal, Anna Pellegrini, Nicole Nair, Govind Jalkh, Youmna Khalil, Samar Zurawski, Jon Calabresi, Peter Reich, Daniel Bakshi, Rohit Shou, Haochang Shinohara, Russell T. |
author_sort | Chen, Andrew A. |
collection | PubMed |
description | Dimension reduction tools preserving similarity and graph structure such as t-SNE and UMAP can capture complex biological patterns in high-dimensional data. However, these tools typically are not designed to separate effects of interest from unwanted effects due to confounders. We introduce the partial embedding (PARE) framework, which enables removal of confounders from any distance-based dimension reduction method. We then develop partial t-SNE and partial UMAP and apply these methods to genomic and neuroimaging data. Our results show that the PARE framework can remove batch effects in single-cell sequencing data as well as separate clinical and technical variability in neuroimaging measures. We demonstrate that the PARE framework extends dimension reduction methods to highlight biological patterns of interest while effectively removing confounding effects. |
format | Online Article Text |
id | pubmed-9882043 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Cold Spring Harbor Laboratory |
record_format | MEDLINE/PubMed |
spelling | pubmed-98820432023-01-28 Deconfounded Dimension Reduction via Partial Embeddings Chen, Andrew A. Clark, Kelly Dewey, Blake DuVal, Anna Pellegrini, Nicole Nair, Govind Jalkh, Youmna Khalil, Samar Zurawski, Jon Calabresi, Peter Reich, Daniel Bakshi, Rohit Shou, Haochang Shinohara, Russell T. bioRxiv Article Dimension reduction tools preserving similarity and graph structure such as t-SNE and UMAP can capture complex biological patterns in high-dimensional data. However, these tools typically are not designed to separate effects of interest from unwanted effects due to confounders. We introduce the partial embedding (PARE) framework, which enables removal of confounders from any distance-based dimension reduction method. We then develop partial t-SNE and partial UMAP and apply these methods to genomic and neuroimaging data. Our results show that the PARE framework can remove batch effects in single-cell sequencing data as well as separate clinical and technical variability in neuroimaging measures. We demonstrate that the PARE framework extends dimension reduction methods to highlight biological patterns of interest while effectively removing confounding effects. Cold Spring Harbor Laboratory 2023-01-11 /pmc/articles/PMC9882043/ /pubmed/36711940 http://dx.doi.org/10.1101/2023.01.10.523448 Text en https://creativecommons.org/licenses/by-nc-nd/4.0/This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License (https://creativecommons.org/licenses/by-nc-nd/4.0/) , which allows reusers to copy and distribute the material in any medium or format in unadapted form only, for noncommercial purposes only, and only so long as attribution is given to the creator. |
spellingShingle | Article Chen, Andrew A. Clark, Kelly Dewey, Blake DuVal, Anna Pellegrini, Nicole Nair, Govind Jalkh, Youmna Khalil, Samar Zurawski, Jon Calabresi, Peter Reich, Daniel Bakshi, Rohit Shou, Haochang Shinohara, Russell T. Deconfounded Dimension Reduction via Partial Embeddings |
title | Deconfounded Dimension Reduction via Partial Embeddings |
title_full | Deconfounded Dimension Reduction via Partial Embeddings |
title_fullStr | Deconfounded Dimension Reduction via Partial Embeddings |
title_full_unstemmed | Deconfounded Dimension Reduction via Partial Embeddings |
title_short | Deconfounded Dimension Reduction via Partial Embeddings |
title_sort | deconfounded dimension reduction via partial embeddings |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9882043/ https://www.ncbi.nlm.nih.gov/pubmed/36711940 http://dx.doi.org/10.1101/2023.01.10.523448 |
work_keys_str_mv | AT chenandrewa deconfoundeddimensionreductionviapartialembeddings AT clarkkelly deconfoundeddimensionreductionviapartialembeddings AT deweyblake deconfoundeddimensionreductionviapartialembeddings AT duvalanna deconfoundeddimensionreductionviapartialembeddings AT pellegrininicole deconfoundeddimensionreductionviapartialembeddings AT nairgovind deconfoundeddimensionreductionviapartialembeddings AT jalkhyoumna deconfoundeddimensionreductionviapartialembeddings AT khalilsamar deconfoundeddimensionreductionviapartialembeddings AT zurawskijon deconfoundeddimensionreductionviapartialembeddings AT calabresipeter deconfoundeddimensionreductionviapartialembeddings AT reichdaniel deconfoundeddimensionreductionviapartialembeddings AT bakshirohit deconfoundeddimensionreductionviapartialembeddings AT shouhaochang deconfoundeddimensionreductionviapartialembeddings AT shinohararussellt deconfoundeddimensionreductionviapartialembeddings |