Cargando…
Multi-Way Multi-Group Segregation and Diversity Indices
BACKGROUND: How can we compute a segregation or diversity index from a three-way or multi-way contingency table, where each variable can take on an arbitrary finite number of values and where the index takes values between zero and one? Previous methods only exist for two-way contingency tables or d...
Autores principales: | , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2010
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2879365/ https://www.ncbi.nlm.nih.gov/pubmed/20532222 http://dx.doi.org/10.1371/journal.pone.0010912 |
_version_ | 1782181919579963392 |
---|---|
author | Gorelick, Root Bertram, Susan M. |
author_facet | Gorelick, Root Bertram, Susan M. |
author_sort | Gorelick, Root |
collection | PubMed |
description | BACKGROUND: How can we compute a segregation or diversity index from a three-way or multi-way contingency table, where each variable can take on an arbitrary finite number of values and where the index takes values between zero and one? Previous methods only exist for two-way contingency tables or dichotomous variables. A prototypical three-way case is the segregation index of a set of industries or departments given multiple explanatory variables of both sex and race. This can be further extended to other variables, such as disability, number of years of education, and former military service. METHODOLOGY/PRINCIPAL FINDINGS: We extend existing segregation indices based on Euclidean distance (square of coefficient of variation) and Boltzmann/Shannon/Theil index from two-way to multi-way contingency tables by including multiple summations. We provide several biological applications, such as indices for age polyethism and linkage disequilibrium. We also provide a new heuristic conceptualization of entropy-based indices. Higher order association measures are often independent of lower order ones, hence an overall segregation or diversity index should be the arithmetic mean of the normalized association measures at all orders. These methods are applicable when individuals self-identify as multiple races or even multiple sexes and when individuals work part-time in multiple industries. CONCLUSIONS/SIGNIFICANCE: The policy implications of this work are enormous, allowing people to rigorously test whether employment or biological diversity has changed. |
format | Text |
id | pubmed-2879365 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2010 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-28793652010-06-07 Multi-Way Multi-Group Segregation and Diversity Indices Gorelick, Root Bertram, Susan M. PLoS One Research Article BACKGROUND: How can we compute a segregation or diversity index from a three-way or multi-way contingency table, where each variable can take on an arbitrary finite number of values and where the index takes values between zero and one? Previous methods only exist for two-way contingency tables or dichotomous variables. A prototypical three-way case is the segregation index of a set of industries or departments given multiple explanatory variables of both sex and race. This can be further extended to other variables, such as disability, number of years of education, and former military service. METHODOLOGY/PRINCIPAL FINDINGS: We extend existing segregation indices based on Euclidean distance (square of coefficient of variation) and Boltzmann/Shannon/Theil index from two-way to multi-way contingency tables by including multiple summations. We provide several biological applications, such as indices for age polyethism and linkage disequilibrium. We also provide a new heuristic conceptualization of entropy-based indices. Higher order association measures are often independent of lower order ones, hence an overall segregation or diversity index should be the arithmetic mean of the normalized association measures at all orders. These methods are applicable when individuals self-identify as multiple races or even multiple sexes and when individuals work part-time in multiple industries. CONCLUSIONS/SIGNIFICANCE: The policy implications of this work are enormous, allowing people to rigorously test whether employment or biological diversity has changed. Public Library of Science 2010-06-01 /pmc/articles/PMC2879365/ /pubmed/20532222 http://dx.doi.org/10.1371/journal.pone.0010912 Text en Gorelick, Bertram. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited. |
spellingShingle | Research Article Gorelick, Root Bertram, Susan M. Multi-Way Multi-Group Segregation and Diversity Indices |
title | Multi-Way Multi-Group Segregation and Diversity Indices |
title_full | Multi-Way Multi-Group Segregation and Diversity Indices |
title_fullStr | Multi-Way Multi-Group Segregation and Diversity Indices |
title_full_unstemmed | Multi-Way Multi-Group Segregation and Diversity Indices |
title_short | Multi-Way Multi-Group Segregation and Diversity Indices |
title_sort | multi-way multi-group segregation and diversity indices |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2879365/ https://www.ncbi.nlm.nih.gov/pubmed/20532222 http://dx.doi.org/10.1371/journal.pone.0010912 |
work_keys_str_mv | AT gorelickroot multiwaymultigroupsegregationanddiversityindices AT bertramsusanm multiwaymultigroupsegregationanddiversityindices |