Cargando…

Correspondence analysis, spectral clustering and graph embedding: applications to ecology and economic complexity

Identifying structure underlying high-dimensional data is a common challenge across scientific disciplines. We revisit correspondence analysis (CA), a classical method revealing such structures, from a network perspective. We present the poorly-known equivalence of CA to spectral clustering and grap...

Descripción completa

Detalles Bibliográficos
Autores principales: van Dam, Alje, Dekker, Mark, Morales-Castilla, Ignacio, Rodríguez, Miguel Á., Wichmann, David, Baudena, Mara
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8076183/
https://www.ncbi.nlm.nih.gov/pubmed/33903623
http://dx.doi.org/10.1038/s41598-021-87971-9
_version_ 1783684642988097536
author van Dam, Alje
Dekker, Mark
Morales-Castilla, Ignacio
Rodríguez, Miguel Á.
Wichmann, David
Baudena, Mara
author_facet van Dam, Alje
Dekker, Mark
Morales-Castilla, Ignacio
Rodríguez, Miguel Á.
Wichmann, David
Baudena, Mara
author_sort van Dam, Alje
collection PubMed
description Identifying structure underlying high-dimensional data is a common challenge across scientific disciplines. We revisit correspondence analysis (CA), a classical method revealing such structures, from a network perspective. We present the poorly-known equivalence of CA to spectral clustering and graph-embedding techniques. We point out a number of complementary interpretations of CA results, other than its traditional interpretation as an ordination technique. These interpretations relate to the structure of the underlying networks. We then discuss an empirical example drawn from ecology, where we apply CA to the global distribution of Carnivora species to show how both the clustering and ordination interpretation can be used to find gradients in clustered data. In the second empirical example, we revisit the economic complexity index as an application of correspondence analysis, and use the different interpretations of the method to shed new light on the empirical results within this literature.
format Online
Article
Text
id pubmed-8076183
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-80761832021-04-27 Correspondence analysis, spectral clustering and graph embedding: applications to ecology and economic complexity van Dam, Alje Dekker, Mark Morales-Castilla, Ignacio Rodríguez, Miguel Á. Wichmann, David Baudena, Mara Sci Rep Article Identifying structure underlying high-dimensional data is a common challenge across scientific disciplines. We revisit correspondence analysis (CA), a classical method revealing such structures, from a network perspective. We present the poorly-known equivalence of CA to spectral clustering and graph-embedding techniques. We point out a number of complementary interpretations of CA results, other than its traditional interpretation as an ordination technique. These interpretations relate to the structure of the underlying networks. We then discuss an empirical example drawn from ecology, where we apply CA to the global distribution of Carnivora species to show how both the clustering and ordination interpretation can be used to find gradients in clustered data. In the second empirical example, we revisit the economic complexity index as an application of correspondence analysis, and use the different interpretations of the method to shed new light on the empirical results within this literature. Nature Publishing Group UK 2021-04-26 /pmc/articles/PMC8076183/ /pubmed/33903623 http://dx.doi.org/10.1038/s41598-021-87971-9 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
van Dam, Alje
Dekker, Mark
Morales-Castilla, Ignacio
Rodríguez, Miguel Á.
Wichmann, David
Baudena, Mara
Correspondence analysis, spectral clustering and graph embedding: applications to ecology and economic complexity
title Correspondence analysis, spectral clustering and graph embedding: applications to ecology and economic complexity
title_full Correspondence analysis, spectral clustering and graph embedding: applications to ecology and economic complexity
title_fullStr Correspondence analysis, spectral clustering and graph embedding: applications to ecology and economic complexity
title_full_unstemmed Correspondence analysis, spectral clustering and graph embedding: applications to ecology and economic complexity
title_short Correspondence analysis, spectral clustering and graph embedding: applications to ecology and economic complexity
title_sort correspondence analysis, spectral clustering and graph embedding: applications to ecology and economic complexity
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8076183/
https://www.ncbi.nlm.nih.gov/pubmed/33903623
http://dx.doi.org/10.1038/s41598-021-87971-9
work_keys_str_mv AT vandamalje correspondenceanalysisspectralclusteringandgraphembeddingapplicationstoecologyandeconomiccomplexity
AT dekkermark correspondenceanalysisspectralclusteringandgraphembeddingapplicationstoecologyandeconomiccomplexity
AT moralescastillaignacio correspondenceanalysisspectralclusteringandgraphembeddingapplicationstoecologyandeconomiccomplexity
AT rodriguezmiguela correspondenceanalysisspectralclusteringandgraphembeddingapplicationstoecologyandeconomiccomplexity
AT wichmanndavid correspondenceanalysisspectralclusteringandgraphembeddingapplicationstoecologyandeconomiccomplexity
AT baudenamara correspondenceanalysisspectralclusteringandgraphembeddingapplicationstoecologyandeconomiccomplexity