Cargando…

Towards a comprehensive evaluation of dimension reduction methods for transcriptomic data visualization

Dimension reduction (DR) algorithms project data from high dimensions to lower dimensions to enable visualization of interesting high-dimensional structure. DR algorithms are widely used for analysis of single-cell transcriptomic data. Despite widespread use of DR algorithms such as t-SNE and UMAP,...

Descripción completa

Detalles Bibliográficos
Autores principales: Huang, Haiyang, Wang, Yingfan, Rudin, Cynthia, Browne, Edward P.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9296444/
https://www.ncbi.nlm.nih.gov/pubmed/35853932
http://dx.doi.org/10.1038/s42003-022-03628-x
_version_ 1784750275018358784
author Huang, Haiyang
Wang, Yingfan
Rudin, Cynthia
Browne, Edward P.
author_facet Huang, Haiyang
Wang, Yingfan
Rudin, Cynthia
Browne, Edward P.
author_sort Huang, Haiyang
collection PubMed
description Dimension reduction (DR) algorithms project data from high dimensions to lower dimensions to enable visualization of interesting high-dimensional structure. DR algorithms are widely used for analysis of single-cell transcriptomic data. Despite widespread use of DR algorithms such as t-SNE and UMAP, these algorithms have characteristics that lead to lack of trust: they do not preserve important aspects of high-dimensional structure and are sensitive to arbitrary user choices. Given the importance of gaining insights from DR, DR methods should be evaluated carefully before trusting their results. In this paper, we introduce and perform a systematic evaluation of popular DR methods, including t-SNE, art-SNE, UMAP, PaCMAP, TriMap and ForceAtlas2. Our evaluation considers five components: preservation of local structure, preservation of global structure, sensitivity to parameter choices, sensitivity to preprocessing choices, and computational efficiency. This evaluation can help us to choose DR tools that align with the scientific goals of the user.
format Online
Article
Text
id pubmed-9296444
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-92964442022-07-21 Towards a comprehensive evaluation of dimension reduction methods for transcriptomic data visualization Huang, Haiyang Wang, Yingfan Rudin, Cynthia Browne, Edward P. Commun Biol Article Dimension reduction (DR) algorithms project data from high dimensions to lower dimensions to enable visualization of interesting high-dimensional structure. DR algorithms are widely used for analysis of single-cell transcriptomic data. Despite widespread use of DR algorithms such as t-SNE and UMAP, these algorithms have characteristics that lead to lack of trust: they do not preserve important aspects of high-dimensional structure and are sensitive to arbitrary user choices. Given the importance of gaining insights from DR, DR methods should be evaluated carefully before trusting their results. In this paper, we introduce and perform a systematic evaluation of popular DR methods, including t-SNE, art-SNE, UMAP, PaCMAP, TriMap and ForceAtlas2. Our evaluation considers five components: preservation of local structure, preservation of global structure, sensitivity to parameter choices, sensitivity to preprocessing choices, and computational efficiency. This evaluation can help us to choose DR tools that align with the scientific goals of the user. Nature Publishing Group UK 2022-07-19 /pmc/articles/PMC9296444/ /pubmed/35853932 http://dx.doi.org/10.1038/s42003-022-03628-x Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Huang, Haiyang
Wang, Yingfan
Rudin, Cynthia
Browne, Edward P.
Towards a comprehensive evaluation of dimension reduction methods for transcriptomic data visualization
title Towards a comprehensive evaluation of dimension reduction methods for transcriptomic data visualization
title_full Towards a comprehensive evaluation of dimension reduction methods for transcriptomic data visualization
title_fullStr Towards a comprehensive evaluation of dimension reduction methods for transcriptomic data visualization
title_full_unstemmed Towards a comprehensive evaluation of dimension reduction methods for transcriptomic data visualization
title_short Towards a comprehensive evaluation of dimension reduction methods for transcriptomic data visualization
title_sort towards a comprehensive evaluation of dimension reduction methods for transcriptomic data visualization
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9296444/
https://www.ncbi.nlm.nih.gov/pubmed/35853932
http://dx.doi.org/10.1038/s42003-022-03628-x
work_keys_str_mv AT huanghaiyang towardsacomprehensiveevaluationofdimensionreductionmethodsfortranscriptomicdatavisualization
AT wangyingfan towardsacomprehensiveevaluationofdimensionreductionmethodsfortranscriptomicdatavisualization
AT rudincynthia towardsacomprehensiveevaluationofdimensionreductionmethodsfortranscriptomicdatavisualization
AT browneedwardp towardsacomprehensiveevaluationofdimensionreductionmethodsfortranscriptomicdatavisualization