Cargando…

Uncovering High-dimensional Structures of Projections from Dimensionality Reduction Methods

Projections are conventional methods of dimensionality reduction for information visualization used to transform high-dimensional data into low dimensional space. If the projection method restricts the output space to two dimensions, the result is a scatter plot. The goal of this scatter plot is to...

Descripción completa

Detalles Bibliográficos
Autores principales: Thrun, Michael C., Ultsch, Alfred
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7586139/
https://www.ncbi.nlm.nih.gov/pubmed/33134096
http://dx.doi.org/10.1016/j.mex.2020.101093
_version_ 1783599935798640640
author Thrun, Michael C.
Ultsch, Alfred
author_facet Thrun, Michael C.
Ultsch, Alfred
author_sort Thrun, Michael C.
collection PubMed
description Projections are conventional methods of dimensionality reduction for information visualization used to transform high-dimensional data into low dimensional space. If the projection method restricts the output space to two dimensions, the result is a scatter plot. The goal of this scatter plot is to visualize the relative relationships between high-dimensional data points that build up distance and density-based structures. However, the Johnson–Lindenstrauss lemma states that the two-dimensional similarities in the scatter plot cannot coercively represent high-dimensional structures. Here, a simplified emergent self-organizing map uses the projected points of such a scatter plot in combination with the dataset in order to compute the generalized U-matrix. The generalized U-matrix defines the visualization of a topographic map depicting the misrepresentations of projected points with regards to a given dimensionality reduction method and the dataset. • The topographic map provides accurate information about the high-dimensional distance and density based structures of high-dimensional data if an appropriate dimensionality reduction method is selected. • The topographic map can uncover the absence of distance-based structures. • The topographic map reveals the number of clusters in a dataset as the number of valleys.
format Online
Article
Text
id pubmed-7586139
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Elsevier
record_format MEDLINE/PubMed
spelling pubmed-75861392020-10-30 Uncovering High-dimensional Structures of Projections from Dimensionality Reduction Methods Thrun, Michael C. Ultsch, Alfred MethodsX Method Article Projections are conventional methods of dimensionality reduction for information visualization used to transform high-dimensional data into low dimensional space. If the projection method restricts the output space to two dimensions, the result is a scatter plot. The goal of this scatter plot is to visualize the relative relationships between high-dimensional data points that build up distance and density-based structures. However, the Johnson–Lindenstrauss lemma states that the two-dimensional similarities in the scatter plot cannot coercively represent high-dimensional structures. Here, a simplified emergent self-organizing map uses the projected points of such a scatter plot in combination with the dataset in order to compute the generalized U-matrix. The generalized U-matrix defines the visualization of a topographic map depicting the misrepresentations of projected points with regards to a given dimensionality reduction method and the dataset. • The topographic map provides accurate information about the high-dimensional distance and density based structures of high-dimensional data if an appropriate dimensionality reduction method is selected. • The topographic map can uncover the absence of distance-based structures. • The topographic map reveals the number of clusters in a dataset as the number of valleys. Elsevier 2020-10-10 /pmc/articles/PMC7586139/ /pubmed/33134096 http://dx.doi.org/10.1016/j.mex.2020.101093 Text en © 2020 The Author(s) http://creativecommons.org/licenses/by/4.0/ This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Method Article
Thrun, Michael C.
Ultsch, Alfred
Uncovering High-dimensional Structures of Projections from Dimensionality Reduction Methods
title Uncovering High-dimensional Structures of Projections from Dimensionality Reduction Methods
title_full Uncovering High-dimensional Structures of Projections from Dimensionality Reduction Methods
title_fullStr Uncovering High-dimensional Structures of Projections from Dimensionality Reduction Methods
title_full_unstemmed Uncovering High-dimensional Structures of Projections from Dimensionality Reduction Methods
title_short Uncovering High-dimensional Structures of Projections from Dimensionality Reduction Methods
title_sort uncovering high-dimensional structures of projections from dimensionality reduction methods
topic Method Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7586139/
https://www.ncbi.nlm.nih.gov/pubmed/33134096
http://dx.doi.org/10.1016/j.mex.2020.101093
work_keys_str_mv AT thrunmichaelc uncoveringhighdimensionalstructuresofprojectionsfromdimensionalityreductionmethods
AT ultschalfred uncoveringhighdimensionalstructuresofprojectionsfromdimensionalityreductionmethods