Cargando…

Shannon diversity index: a call to replace the original Shannon’s formula with unbiased estimator in the population genetics studies

BACKGROUND: The Shannon diversity index has been widely used in population genetics studies. Recently, it was proposed as a unifying measure of diversity at different levels—from genes and populations to whole species and ecosystems. The index, however, was proven to be negatively biased at small sa...

Descripción completa

Detalles Bibliográficos
Autor principal: Konopiński, Maciej K.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: PeerJ Inc. 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7331625/
https://www.ncbi.nlm.nih.gov/pubmed/32655992
http://dx.doi.org/10.7717/peerj.9391
_version_ 1783553368063475712
author Konopiński, Maciej K.
author_facet Konopiński, Maciej K.
author_sort Konopiński, Maciej K.
collection PubMed
description BACKGROUND: The Shannon diversity index has been widely used in population genetics studies. Recently, it was proposed as a unifying measure of diversity at different levels—from genes and populations to whole species and ecosystems. The index, however, was proven to be negatively biased at small sample sizes. Modifications to the original Shannon’s formula have been proposed to obtain an unbiased estimator. METHODS: In this study, the performance of four different estimators of Shannon index—the original Shannon’s formula and those of Zahl, Chao and Shen and Chao et al.—was tested on simulated microsatellite data. Both the simulation and analysis of the results were performed in the R language environment. A new R function was created for the calculation of all four indices from the genind data format. RESULTS: Sample size dependence was detected in all the estimators analysed; however, the deviation from parametric values was substantially smaller in the derived measures than in the original Shannon’s formula. Error rate was negatively associated with population heterozygosity. Comparisons among loci showed that fast-mutating loci were less affected by the error, except for the original Shannon’s estimator which, in the smallest sample, was more strongly affected by loci with a higher number of alleles. The Zahl and Chao et al. estimators performed notably better than the original Shannon’s formula. CONCLUSION: The results of this study show that the original Shannon index should no longer be used as a measure of genetic diversity and should be replaced by Zahl’s unbiased estimator.
format Online
Article
Text
id pubmed-7331625
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher PeerJ Inc.
record_format MEDLINE/PubMed
spelling pubmed-73316252020-07-09 Shannon diversity index: a call to replace the original Shannon’s formula with unbiased estimator in the population genetics studies Konopiński, Maciej K. PeerJ Biodiversity BACKGROUND: The Shannon diversity index has been widely used in population genetics studies. Recently, it was proposed as a unifying measure of diversity at different levels—from genes and populations to whole species and ecosystems. The index, however, was proven to be negatively biased at small sample sizes. Modifications to the original Shannon’s formula have been proposed to obtain an unbiased estimator. METHODS: In this study, the performance of four different estimators of Shannon index—the original Shannon’s formula and those of Zahl, Chao and Shen and Chao et al.—was tested on simulated microsatellite data. Both the simulation and analysis of the results were performed in the R language environment. A new R function was created for the calculation of all four indices from the genind data format. RESULTS: Sample size dependence was detected in all the estimators analysed; however, the deviation from parametric values was substantially smaller in the derived measures than in the original Shannon’s formula. Error rate was negatively associated with population heterozygosity. Comparisons among loci showed that fast-mutating loci were less affected by the error, except for the original Shannon’s estimator which, in the smallest sample, was more strongly affected by loci with a higher number of alleles. The Zahl and Chao et al. estimators performed notably better than the original Shannon’s formula. CONCLUSION: The results of this study show that the original Shannon index should no longer be used as a measure of genetic diversity and should be replaced by Zahl’s unbiased estimator. PeerJ Inc. 2020-06-29 /pmc/articles/PMC7331625/ /pubmed/32655992 http://dx.doi.org/10.7717/peerj.9391 Text en ©2020 Konopiński https://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ) and either DOI or URL of the article must be cited.
spellingShingle Biodiversity
Konopiński, Maciej K.
Shannon diversity index: a call to replace the original Shannon’s formula with unbiased estimator in the population genetics studies
title Shannon diversity index: a call to replace the original Shannon’s formula with unbiased estimator in the population genetics studies
title_full Shannon diversity index: a call to replace the original Shannon’s formula with unbiased estimator in the population genetics studies
title_fullStr Shannon diversity index: a call to replace the original Shannon’s formula with unbiased estimator in the population genetics studies
title_full_unstemmed Shannon diversity index: a call to replace the original Shannon’s formula with unbiased estimator in the population genetics studies
title_short Shannon diversity index: a call to replace the original Shannon’s formula with unbiased estimator in the population genetics studies
title_sort shannon diversity index: a call to replace the original shannon’s formula with unbiased estimator in the population genetics studies
topic Biodiversity
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7331625/
https://www.ncbi.nlm.nih.gov/pubmed/32655992
http://dx.doi.org/10.7717/peerj.9391
work_keys_str_mv AT konopinskimaciejk shannondiversityindexacalltoreplacetheoriginalshannonsformulawithunbiasedestimatorinthepopulationgeneticsstudies