Cargando…
Uncovering High-dimensional Structures of Projections from Dimensionality Reduction Methods
Projections are conventional methods of dimensionality reduction for information visualization used to transform high-dimensional data into low dimensional space. If the projection method restricts the output space to two dimensions, the result is a scatter plot. The goal of this scatter plot is to...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Elsevier
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7586139/ https://www.ncbi.nlm.nih.gov/pubmed/33134096 http://dx.doi.org/10.1016/j.mex.2020.101093 |
_version_ | 1783599935798640640 |
---|---|
author | Thrun, Michael C. Ultsch, Alfred |
author_facet | Thrun, Michael C. Ultsch, Alfred |
author_sort | Thrun, Michael C. |
collection | PubMed |
description | Projections are conventional methods of dimensionality reduction for information visualization used to transform high-dimensional data into low dimensional space. If the projection method restricts the output space to two dimensions, the result is a scatter plot. The goal of this scatter plot is to visualize the relative relationships between high-dimensional data points that build up distance and density-based structures. However, the Johnson–Lindenstrauss lemma states that the two-dimensional similarities in the scatter plot cannot coercively represent high-dimensional structures. Here, a simplified emergent self-organizing map uses the projected points of such a scatter plot in combination with the dataset in order to compute the generalized U-matrix. The generalized U-matrix defines the visualization of a topographic map depicting the misrepresentations of projected points with regards to a given dimensionality reduction method and the dataset. • The topographic map provides accurate information about the high-dimensional distance and density based structures of high-dimensional data if an appropriate dimensionality reduction method is selected. • The topographic map can uncover the absence of distance-based structures. • The topographic map reveals the number of clusters in a dataset as the number of valleys. |
format | Online Article Text |
id | pubmed-7586139 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Elsevier |
record_format | MEDLINE/PubMed |
spelling | pubmed-75861392020-10-30 Uncovering High-dimensional Structures of Projections from Dimensionality Reduction Methods Thrun, Michael C. Ultsch, Alfred MethodsX Method Article Projections are conventional methods of dimensionality reduction for information visualization used to transform high-dimensional data into low dimensional space. If the projection method restricts the output space to two dimensions, the result is a scatter plot. The goal of this scatter plot is to visualize the relative relationships between high-dimensional data points that build up distance and density-based structures. However, the Johnson–Lindenstrauss lemma states that the two-dimensional similarities in the scatter plot cannot coercively represent high-dimensional structures. Here, a simplified emergent self-organizing map uses the projected points of such a scatter plot in combination with the dataset in order to compute the generalized U-matrix. The generalized U-matrix defines the visualization of a topographic map depicting the misrepresentations of projected points with regards to a given dimensionality reduction method and the dataset. • The topographic map provides accurate information about the high-dimensional distance and density based structures of high-dimensional data if an appropriate dimensionality reduction method is selected. • The topographic map can uncover the absence of distance-based structures. • The topographic map reveals the number of clusters in a dataset as the number of valleys. Elsevier 2020-10-10 /pmc/articles/PMC7586139/ /pubmed/33134096 http://dx.doi.org/10.1016/j.mex.2020.101093 Text en © 2020 The Author(s) http://creativecommons.org/licenses/by/4.0/ This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Method Article Thrun, Michael C. Ultsch, Alfred Uncovering High-dimensional Structures of Projections from Dimensionality Reduction Methods |
title | Uncovering High-dimensional Structures of Projections from Dimensionality Reduction Methods |
title_full | Uncovering High-dimensional Structures of Projections from Dimensionality Reduction Methods |
title_fullStr | Uncovering High-dimensional Structures of Projections from Dimensionality Reduction Methods |
title_full_unstemmed | Uncovering High-dimensional Structures of Projections from Dimensionality Reduction Methods |
title_short | Uncovering High-dimensional Structures of Projections from Dimensionality Reduction Methods |
title_sort | uncovering high-dimensional structures of projections from dimensionality reduction methods |
topic | Method Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7586139/ https://www.ncbi.nlm.nih.gov/pubmed/33134096 http://dx.doi.org/10.1016/j.mex.2020.101093 |
work_keys_str_mv | AT thrunmichaelc uncoveringhighdimensionalstructuresofprojectionsfromdimensionalityreductionmethods AT ultschalfred uncoveringhighdimensionalstructuresofprojectionsfromdimensionalityreductionmethods |