Cargando…

A spectral method for assessing and combining multiple data visualizations

Dimension reduction is an indispensable part of modern data science, and many algorithms have been developed. However, different algorithms have their own strengths and weaknesses, making it important to evaluate their relative performance, and to leverage and combine their individual strengths. Thi...

Descripción completa

Detalles Bibliográficos
Autores principales: Ma, Rong, Sun, Eric D., Zou, James
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9922271/
https://www.ncbi.nlm.nih.gov/pubmed/36774377
http://dx.doi.org/10.1038/s41467-023-36492-2
_version_ 1784887508047233024
author Ma, Rong
Sun, Eric D.
Zou, James
author_facet Ma, Rong
Sun, Eric D.
Zou, James
author_sort Ma, Rong
collection PubMed
description Dimension reduction is an indispensable part of modern data science, and many algorithms have been developed. However, different algorithms have their own strengths and weaknesses, making it important to evaluate their relative performance, and to leverage and combine their individual strengths. This paper proposes a spectral method for assessing and combining multiple visualizations of a given dataset produced by diverse algorithms. The proposed method provides a quantitative measure – the visualization eigenscore – of the relative performance of the visualizations for preserving the structure around each data point. It also generates a consensus visualization, having improved quality over individual visualizations in capturing the underlying structure. Our approach is flexible and works as a wrapper around any visualizations. We analyze multiple real-world datasets to demonstrate the effectiveness of the method. We also provide theoretical justifications based on a general statistical framework, yielding several fundamental principles along with practical guidance.
format Online
Article
Text
id pubmed-9922271
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-99222712023-02-13 A spectral method for assessing and combining multiple data visualizations Ma, Rong Sun, Eric D. Zou, James Nat Commun Article Dimension reduction is an indispensable part of modern data science, and many algorithms have been developed. However, different algorithms have their own strengths and weaknesses, making it important to evaluate their relative performance, and to leverage and combine their individual strengths. This paper proposes a spectral method for assessing and combining multiple visualizations of a given dataset produced by diverse algorithms. The proposed method provides a quantitative measure – the visualization eigenscore – of the relative performance of the visualizations for preserving the structure around each data point. It also generates a consensus visualization, having improved quality over individual visualizations in capturing the underlying structure. Our approach is flexible and works as a wrapper around any visualizations. We analyze multiple real-world datasets to demonstrate the effectiveness of the method. We also provide theoretical justifications based on a general statistical framework, yielding several fundamental principles along with practical guidance. Nature Publishing Group UK 2023-02-11 /pmc/articles/PMC9922271/ /pubmed/36774377 http://dx.doi.org/10.1038/s41467-023-36492-2 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Ma, Rong
Sun, Eric D.
Zou, James
A spectral method for assessing and combining multiple data visualizations
title A spectral method for assessing and combining multiple data visualizations
title_full A spectral method for assessing and combining multiple data visualizations
title_fullStr A spectral method for assessing and combining multiple data visualizations
title_full_unstemmed A spectral method for assessing and combining multiple data visualizations
title_short A spectral method for assessing and combining multiple data visualizations
title_sort spectral method for assessing and combining multiple data visualizations
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9922271/
https://www.ncbi.nlm.nih.gov/pubmed/36774377
http://dx.doi.org/10.1038/s41467-023-36492-2
work_keys_str_mv AT marong aspectralmethodforassessingandcombiningmultipledatavisualizations
AT sunericd aspectralmethodforassessingandcombiningmultipledatavisualizations
AT zoujames aspectralmethodforassessingandcombiningmultipledatavisualizations
AT marong spectralmethodforassessingandcombiningmultipledatavisualizations
AT sunericd spectralmethodforassessingandcombiningmultipledatavisualizations
AT zoujames spectralmethodforassessingandcombiningmultipledatavisualizations