Cargando…

Dirichlet composition distribution for compositional data with zero components: An application to fluorescence in situ hybridization (FISH) detection of chromosome

Zeros in compositional data are very common and can be classified into rounded and essential zeros. The rounded zero refers to a small proportion or below detection limit value, while the essential zero refers to the complete absence of the component in the composition. In this article, we propose a...

Descripción completa

Detalles Bibliográficos
Autores principales: Tang, Man‐Lai, Wu, Qin, Yang, Sheng, Tian, Guo‐Liang
Formato: Online Artículo Texto
Lenguaje:English
Publicado: John Wiley and Sons Inc. 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9300144/
https://www.ncbi.nlm.nih.gov/pubmed/34914842
http://dx.doi.org/10.1002/bimj.202000334
_version_ 1784751143994261504
author Tang, Man‐Lai
Wu, Qin
Yang, Sheng
Tian, Guo‐Liang
author_facet Tang, Man‐Lai
Wu, Qin
Yang, Sheng
Tian, Guo‐Liang
author_sort Tang, Man‐Lai
collection PubMed
description Zeros in compositional data are very common and can be classified into rounded and essential zeros. The rounded zero refers to a small proportion or below detection limit value, while the essential zero refers to the complete absence of the component in the composition. In this article, we propose a new framework for analyzing compositional data with zero entries by introducing a stochastic representation. In particular, a new distribution, namely the Dirichlet composition distribution, is developed to accommodate the possible essential‐zero feature in compositional data. We derive its distributional properties (e.g., its moments). The calculation of maximum likelihood estimates via the Expectation‐Maximization (EM) algorithm will be proposed. The regression model based on the new Dirichlet composition distribution will be considered. Simulation studies are conducted to evaluate the performance of the proposed methodologies. Finally, our method is employed to analyze a dataset of fluorescence in situ hybridization (FISH) for chromosome detection.
format Online
Article
Text
id pubmed-9300144
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher John Wiley and Sons Inc.
record_format MEDLINE/PubMed
spelling pubmed-93001442022-07-21 Dirichlet composition distribution for compositional data with zero components: An application to fluorescence in situ hybridization (FISH) detection of chromosome Tang, Man‐Lai Wu, Qin Yang, Sheng Tian, Guo‐Liang Biom J Statistical Modeling Zeros in compositional data are very common and can be classified into rounded and essential zeros. The rounded zero refers to a small proportion or below detection limit value, while the essential zero refers to the complete absence of the component in the composition. In this article, we propose a new framework for analyzing compositional data with zero entries by introducing a stochastic representation. In particular, a new distribution, namely the Dirichlet composition distribution, is developed to accommodate the possible essential‐zero feature in compositional data. We derive its distributional properties (e.g., its moments). The calculation of maximum likelihood estimates via the Expectation‐Maximization (EM) algorithm will be proposed. The regression model based on the new Dirichlet composition distribution will be considered. Simulation studies are conducted to evaluate the performance of the proposed methodologies. Finally, our method is employed to analyze a dataset of fluorescence in situ hybridization (FISH) for chromosome detection. John Wiley and Sons Inc. 2021-12-16 2022-04 /pmc/articles/PMC9300144/ /pubmed/34914842 http://dx.doi.org/10.1002/bimj.202000334 Text en © 2021 The Authors. Biometrical Journal published by Wiley‐VCH GmbH. https://creativecommons.org/licenses/by/4.0/This is an open access article under the terms of the http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited.
spellingShingle Statistical Modeling
Tang, Man‐Lai
Wu, Qin
Yang, Sheng
Tian, Guo‐Liang
Dirichlet composition distribution for compositional data with zero components: An application to fluorescence in situ hybridization (FISH) detection of chromosome
title Dirichlet composition distribution for compositional data with zero components: An application to fluorescence in situ hybridization (FISH) detection of chromosome
title_full Dirichlet composition distribution for compositional data with zero components: An application to fluorescence in situ hybridization (FISH) detection of chromosome
title_fullStr Dirichlet composition distribution for compositional data with zero components: An application to fluorescence in situ hybridization (FISH) detection of chromosome
title_full_unstemmed Dirichlet composition distribution for compositional data with zero components: An application to fluorescence in situ hybridization (FISH) detection of chromosome
title_short Dirichlet composition distribution for compositional data with zero components: An application to fluorescence in situ hybridization (FISH) detection of chromosome
title_sort dirichlet composition distribution for compositional data with zero components: an application to fluorescence in situ hybridization (fish) detection of chromosome
topic Statistical Modeling
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9300144/
https://www.ncbi.nlm.nih.gov/pubmed/34914842
http://dx.doi.org/10.1002/bimj.202000334
work_keys_str_mv AT tangmanlai dirichletcompositiondistributionforcompositionaldatawithzerocomponentsanapplicationtofluorescenceinsituhybridizationfishdetectionofchromosome
AT wuqin dirichletcompositiondistributionforcompositionaldatawithzerocomponentsanapplicationtofluorescenceinsituhybridizationfishdetectionofchromosome
AT yangsheng dirichletcompositiondistributionforcompositionaldatawithzerocomponentsanapplicationtofluorescenceinsituhybridizationfishdetectionofchromosome
AT tianguoliang dirichletcompositiondistributionforcompositionaldatawithzerocomponentsanapplicationtofluorescenceinsituhybridizationfishdetectionofchromosome