Cargando…
Mixed cumulative probit: a multivariate generalization of transition analysis that accommodates variation in the shape, spread and structure of data
Biological data are frequently nonlinear, heteroscedastic and conditionally dependent, and often researchers deal with missing data. To account for characteristics common in biological data in one algorithm, we developed the mixed cumulative probit (MCP), a novel latent trait model that is a formal...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
The Royal Society
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9974299/ https://www.ncbi.nlm.nih.gov/pubmed/36866077 http://dx.doi.org/10.1098/rsos.220963 |
_version_ | 1784898699586961408 |
---|---|
author | Stull, Kyra E. Chu, Elaine Y. Corron, Louise K. Price, Michael H. |
author_facet | Stull, Kyra E. Chu, Elaine Y. Corron, Louise K. Price, Michael H. |
author_sort | Stull, Kyra E. |
collection | PubMed |
description | Biological data are frequently nonlinear, heteroscedastic and conditionally dependent, and often researchers deal with missing data. To account for characteristics common in biological data in one algorithm, we developed the mixed cumulative probit (MCP), a novel latent trait model that is a formal generalization of the cumulative probit model usually used in transition analysis. Specifically, the MCP accommodates heteroscedasticity, mixtures of ordinal and continuous variables, missing values, conditional dependence and alternative specifications of the mean response and noise response. Cross-validation selects the best model parameters (mean response and the noise response for simple models, as well as conditional dependence for multivariate models), and the Kullback–Leibler divergence evaluates information gain during posterior inference to quantify mis-specified models (conditionally dependent versus conditionally independent). Two continuous and four ordinal skeletal and dental variables collected from 1296 individuals (aged birth to 22 years) from the Subadult Virtual Anthropology Database are used to introduce and demonstrate the algorithm. In addition to describing the features of the MCP, we provide material to help fit novel datasets using the MCP. The flexible, general formulation with model selection provides a process to robustly identify the modelling assumptions that are best suited for the data at hand. |
format | Online Article Text |
id | pubmed-9974299 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | The Royal Society |
record_format | MEDLINE/PubMed |
spelling | pubmed-99742992023-03-01 Mixed cumulative probit: a multivariate generalization of transition analysis that accommodates variation in the shape, spread and structure of data Stull, Kyra E. Chu, Elaine Y. Corron, Louise K. Price, Michael H. R Soc Open Sci Organismal and Evolutionary Biology Biological data are frequently nonlinear, heteroscedastic and conditionally dependent, and often researchers deal with missing data. To account for characteristics common in biological data in one algorithm, we developed the mixed cumulative probit (MCP), a novel latent trait model that is a formal generalization of the cumulative probit model usually used in transition analysis. Specifically, the MCP accommodates heteroscedasticity, mixtures of ordinal and continuous variables, missing values, conditional dependence and alternative specifications of the mean response and noise response. Cross-validation selects the best model parameters (mean response and the noise response for simple models, as well as conditional dependence for multivariate models), and the Kullback–Leibler divergence evaluates information gain during posterior inference to quantify mis-specified models (conditionally dependent versus conditionally independent). Two continuous and four ordinal skeletal and dental variables collected from 1296 individuals (aged birth to 22 years) from the Subadult Virtual Anthropology Database are used to introduce and demonstrate the algorithm. In addition to describing the features of the MCP, we provide material to help fit novel datasets using the MCP. The flexible, general formulation with model selection provides a process to robustly identify the modelling assumptions that are best suited for the data at hand. The Royal Society 2023-03-01 /pmc/articles/PMC9974299/ /pubmed/36866077 http://dx.doi.org/10.1098/rsos.220963 Text en © 2023 The Authors. https://creativecommons.org/licenses/by/4.0/Published by the Royal Society under the terms of the Creative Commons Attribution License http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, provided the original author and source are credited. |
spellingShingle | Organismal and Evolutionary Biology Stull, Kyra E. Chu, Elaine Y. Corron, Louise K. Price, Michael H. Mixed cumulative probit: a multivariate generalization of transition analysis that accommodates variation in the shape, spread and structure of data |
title | Mixed cumulative probit: a multivariate generalization of transition analysis that accommodates variation in the shape, spread and structure of data |
title_full | Mixed cumulative probit: a multivariate generalization of transition analysis that accommodates variation in the shape, spread and structure of data |
title_fullStr | Mixed cumulative probit: a multivariate generalization of transition analysis that accommodates variation in the shape, spread and structure of data |
title_full_unstemmed | Mixed cumulative probit: a multivariate generalization of transition analysis that accommodates variation in the shape, spread and structure of data |
title_short | Mixed cumulative probit: a multivariate generalization of transition analysis that accommodates variation in the shape, spread and structure of data |
title_sort | mixed cumulative probit: a multivariate generalization of transition analysis that accommodates variation in the shape, spread and structure of data |
topic | Organismal and Evolutionary Biology |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9974299/ https://www.ncbi.nlm.nih.gov/pubmed/36866077 http://dx.doi.org/10.1098/rsos.220963 |
work_keys_str_mv | AT stullkyrae mixedcumulativeprobitamultivariategeneralizationoftransitionanalysisthataccommodatesvariationintheshapespreadandstructureofdata AT chuelainey mixedcumulativeprobitamultivariategeneralizationoftransitionanalysisthataccommodatesvariationintheshapespreadandstructureofdata AT corronlouisek mixedcumulativeprobitamultivariategeneralizationoftransitionanalysisthataccommodatesvariationintheshapespreadandstructureofdata AT pricemichaelh mixedcumulativeprobitamultivariategeneralizationoftransitionanalysisthataccommodatesvariationintheshapespreadandstructureofdata |