Cargando…

A comparison of 71 binary similarity coefficients: The effect of base rates

There are many psychological applications that require collapsing the information in a two-mode (e.g., respondents-by-attributes) binary matrix into a one-mode (e.g., attributes-by-attributes) similarity matrix. This process requires the selection of a measure of similarity between binary attributes...

Descripción completa

Detalles Bibliográficos
Autores principales: Brusco, Michael, Cradit, J. Dennis, Steinley, Douglas
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8026075/
https://www.ncbi.nlm.nih.gov/pubmed/33826612
http://dx.doi.org/10.1371/journal.pone.0247751
_version_ 1783675607914119168
author Brusco, Michael
Cradit, J. Dennis
Steinley, Douglas
author_facet Brusco, Michael
Cradit, J. Dennis
Steinley, Douglas
author_sort Brusco, Michael
collection PubMed
description There are many psychological applications that require collapsing the information in a two-mode (e.g., respondents-by-attributes) binary matrix into a one-mode (e.g., attributes-by-attributes) similarity matrix. This process requires the selection of a measure of similarity between binary attributes. A vast number of binary similarity coefficients have been proposed in fields such as biology, geology, and ecology. Although previous studies have reported cluster analyses of binary similarity coefficients, there has been little exploration of how cluster memberships are affected by the base rates (percentage of ones) for the binary attributes. We conducted a simulation experiment that compared two-cluster K-median partitions of 71 binary similarity coefficients based on their pairwise correlations obtained under 15 different base-rate configurations. The results reveal that some subsets of coefficients consistently group together regardless of the base rates. However, there are other subsets of coefficients that group together for some base rates, but not for others.
format Online
Article
Text
id pubmed-8026075
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-80260752021-04-15 A comparison of 71 binary similarity coefficients: The effect of base rates Brusco, Michael Cradit, J. Dennis Steinley, Douglas PLoS One Research Article There are many psychological applications that require collapsing the information in a two-mode (e.g., respondents-by-attributes) binary matrix into a one-mode (e.g., attributes-by-attributes) similarity matrix. This process requires the selection of a measure of similarity between binary attributes. A vast number of binary similarity coefficients have been proposed in fields such as biology, geology, and ecology. Although previous studies have reported cluster analyses of binary similarity coefficients, there has been little exploration of how cluster memberships are affected by the base rates (percentage of ones) for the binary attributes. We conducted a simulation experiment that compared two-cluster K-median partitions of 71 binary similarity coefficients based on their pairwise correlations obtained under 15 different base-rate configurations. The results reveal that some subsets of coefficients consistently group together regardless of the base rates. However, there are other subsets of coefficients that group together for some base rates, but not for others. Public Library of Science 2021-04-07 /pmc/articles/PMC8026075/ /pubmed/33826612 http://dx.doi.org/10.1371/journal.pone.0247751 Text en © 2021 Brusco et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Brusco, Michael
Cradit, J. Dennis
Steinley, Douglas
A comparison of 71 binary similarity coefficients: The effect of base rates
title A comparison of 71 binary similarity coefficients: The effect of base rates
title_full A comparison of 71 binary similarity coefficients: The effect of base rates
title_fullStr A comparison of 71 binary similarity coefficients: The effect of base rates
title_full_unstemmed A comparison of 71 binary similarity coefficients: The effect of base rates
title_short A comparison of 71 binary similarity coefficients: The effect of base rates
title_sort comparison of 71 binary similarity coefficients: the effect of base rates
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8026075/
https://www.ncbi.nlm.nih.gov/pubmed/33826612
http://dx.doi.org/10.1371/journal.pone.0247751
work_keys_str_mv AT bruscomichael acomparisonof71binarysimilaritycoefficientstheeffectofbaserates
AT craditjdennis acomparisonof71binarysimilaritycoefficientstheeffectofbaserates
AT steinleydouglas acomparisonof71binarysimilaritycoefficientstheeffectofbaserates
AT bruscomichael comparisonof71binarysimilaritycoefficientstheeffectofbaserates
AT craditjdennis comparisonof71binarysimilaritycoefficientstheeffectofbaserates
AT steinleydouglas comparisonof71binarysimilaritycoefficientstheeffectofbaserates