Cargando…

Multi-Way Multi-Group Segregation and Diversity Indices

BACKGROUND: How can we compute a segregation or diversity index from a three-way or multi-way contingency table, where each variable can take on an arbitrary finite number of values and where the index takes values between zero and one? Previous methods only exist for two-way contingency tables or d...

Descripción completa

Detalles Bibliográficos
Autores principales: Gorelick, Root, Bertram, Susan M.
Formato: Texto
Lenguaje:English
Publicado: Public Library of Science 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2879365/
https://www.ncbi.nlm.nih.gov/pubmed/20532222
http://dx.doi.org/10.1371/journal.pone.0010912
_version_ 1782181919579963392
author Gorelick, Root
Bertram, Susan M.
author_facet Gorelick, Root
Bertram, Susan M.
author_sort Gorelick, Root
collection PubMed
description BACKGROUND: How can we compute a segregation or diversity index from a three-way or multi-way contingency table, where each variable can take on an arbitrary finite number of values and where the index takes values between zero and one? Previous methods only exist for two-way contingency tables or dichotomous variables. A prototypical three-way case is the segregation index of a set of industries or departments given multiple explanatory variables of both sex and race. This can be further extended to other variables, such as disability, number of years of education, and former military service. METHODOLOGY/PRINCIPAL FINDINGS: We extend existing segregation indices based on Euclidean distance (square of coefficient of variation) and Boltzmann/Shannon/Theil index from two-way to multi-way contingency tables by including multiple summations. We provide several biological applications, such as indices for age polyethism and linkage disequilibrium. We also provide a new heuristic conceptualization of entropy-based indices. Higher order association measures are often independent of lower order ones, hence an overall segregation or diversity index should be the arithmetic mean of the normalized association measures at all orders. These methods are applicable when individuals self-identify as multiple races or even multiple sexes and when individuals work part-time in multiple industries. CONCLUSIONS/SIGNIFICANCE: The policy implications of this work are enormous, allowing people to rigorously test whether employment or biological diversity has changed.
format Text
id pubmed-2879365
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-28793652010-06-07 Multi-Way Multi-Group Segregation and Diversity Indices Gorelick, Root Bertram, Susan M. PLoS One Research Article BACKGROUND: How can we compute a segregation or diversity index from a three-way or multi-way contingency table, where each variable can take on an arbitrary finite number of values and where the index takes values between zero and one? Previous methods only exist for two-way contingency tables or dichotomous variables. A prototypical three-way case is the segregation index of a set of industries or departments given multiple explanatory variables of both sex and race. This can be further extended to other variables, such as disability, number of years of education, and former military service. METHODOLOGY/PRINCIPAL FINDINGS: We extend existing segregation indices based on Euclidean distance (square of coefficient of variation) and Boltzmann/Shannon/Theil index from two-way to multi-way contingency tables by including multiple summations. We provide several biological applications, such as indices for age polyethism and linkage disequilibrium. We also provide a new heuristic conceptualization of entropy-based indices. Higher order association measures are often independent of lower order ones, hence an overall segregation or diversity index should be the arithmetic mean of the normalized association measures at all orders. These methods are applicable when individuals self-identify as multiple races or even multiple sexes and when individuals work part-time in multiple industries. CONCLUSIONS/SIGNIFICANCE: The policy implications of this work are enormous, allowing people to rigorously test whether employment or biological diversity has changed. Public Library of Science 2010-06-01 /pmc/articles/PMC2879365/ /pubmed/20532222 http://dx.doi.org/10.1371/journal.pone.0010912 Text en Gorelick, Bertram. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Gorelick, Root
Bertram, Susan M.
Multi-Way Multi-Group Segregation and Diversity Indices
title Multi-Way Multi-Group Segregation and Diversity Indices
title_full Multi-Way Multi-Group Segregation and Diversity Indices
title_fullStr Multi-Way Multi-Group Segregation and Diversity Indices
title_full_unstemmed Multi-Way Multi-Group Segregation and Diversity Indices
title_short Multi-Way Multi-Group Segregation and Diversity Indices
title_sort multi-way multi-group segregation and diversity indices
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2879365/
https://www.ncbi.nlm.nih.gov/pubmed/20532222
http://dx.doi.org/10.1371/journal.pone.0010912
work_keys_str_mv AT gorelickroot multiwaymultigroupsegregationanddiversityindices
AT bertramsusanm multiwaymultigroupsegregationanddiversityindices