Cargando…

Mixed cumulative probit: a multivariate generalization of transition analysis that accommodates variation in the shape, spread and structure of data

Biological data are frequently nonlinear, heteroscedastic and conditionally dependent, and often researchers deal with missing data. To account for characteristics common in biological data in one algorithm, we developed the mixed cumulative probit (MCP), a novel latent trait model that is a formal...

Descripción completa

Detalles Bibliográficos
Autores principales: Stull, Kyra E., Chu, Elaine Y., Corron, Louise K., Price, Michael H.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: The Royal Society 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9974299/
https://www.ncbi.nlm.nih.gov/pubmed/36866077
http://dx.doi.org/10.1098/rsos.220963
_version_ 1784898699586961408
author Stull, Kyra E.
Chu, Elaine Y.
Corron, Louise K.
Price, Michael H.
author_facet Stull, Kyra E.
Chu, Elaine Y.
Corron, Louise K.
Price, Michael H.
author_sort Stull, Kyra E.
collection PubMed
description Biological data are frequently nonlinear, heteroscedastic and conditionally dependent, and often researchers deal with missing data. To account for characteristics common in biological data in one algorithm, we developed the mixed cumulative probit (MCP), a novel latent trait model that is a formal generalization of the cumulative probit model usually used in transition analysis. Specifically, the MCP accommodates heteroscedasticity, mixtures of ordinal and continuous variables, missing values, conditional dependence and alternative specifications of the mean response and noise response. Cross-validation selects the best model parameters (mean response and the noise response for simple models, as well as conditional dependence for multivariate models), and the Kullback–Leibler divergence evaluates information gain during posterior inference to quantify mis-specified models (conditionally dependent versus conditionally independent). Two continuous and four ordinal skeletal and dental variables collected from 1296 individuals (aged birth to 22 years) from the Subadult Virtual Anthropology Database are used to introduce and demonstrate the algorithm. In addition to describing the features of the MCP, we provide material to help fit novel datasets using the MCP. The flexible, general formulation with model selection provides a process to robustly identify the modelling assumptions that are best suited for the data at hand.
format Online
Article
Text
id pubmed-9974299
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher The Royal Society
record_format MEDLINE/PubMed
spelling pubmed-99742992023-03-01 Mixed cumulative probit: a multivariate generalization of transition analysis that accommodates variation in the shape, spread and structure of data Stull, Kyra E. Chu, Elaine Y. Corron, Louise K. Price, Michael H. R Soc Open Sci Organismal and Evolutionary Biology Biological data are frequently nonlinear, heteroscedastic and conditionally dependent, and often researchers deal with missing data. To account for characteristics common in biological data in one algorithm, we developed the mixed cumulative probit (MCP), a novel latent trait model that is a formal generalization of the cumulative probit model usually used in transition analysis. Specifically, the MCP accommodates heteroscedasticity, mixtures of ordinal and continuous variables, missing values, conditional dependence and alternative specifications of the mean response and noise response. Cross-validation selects the best model parameters (mean response and the noise response for simple models, as well as conditional dependence for multivariate models), and the Kullback–Leibler divergence evaluates information gain during posterior inference to quantify mis-specified models (conditionally dependent versus conditionally independent). Two continuous and four ordinal skeletal and dental variables collected from 1296 individuals (aged birth to 22 years) from the Subadult Virtual Anthropology Database are used to introduce and demonstrate the algorithm. In addition to describing the features of the MCP, we provide material to help fit novel datasets using the MCP. The flexible, general formulation with model selection provides a process to robustly identify the modelling assumptions that are best suited for the data at hand. The Royal Society 2023-03-01 /pmc/articles/PMC9974299/ /pubmed/36866077 http://dx.doi.org/10.1098/rsos.220963 Text en © 2023 The Authors. https://creativecommons.org/licenses/by/4.0/Published by the Royal Society under the terms of the Creative Commons Attribution License http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, provided the original author and source are credited.
spellingShingle Organismal and Evolutionary Biology
Stull, Kyra E.
Chu, Elaine Y.
Corron, Louise K.
Price, Michael H.
Mixed cumulative probit: a multivariate generalization of transition analysis that accommodates variation in the shape, spread and structure of data
title Mixed cumulative probit: a multivariate generalization of transition analysis that accommodates variation in the shape, spread and structure of data
title_full Mixed cumulative probit: a multivariate generalization of transition analysis that accommodates variation in the shape, spread and structure of data
title_fullStr Mixed cumulative probit: a multivariate generalization of transition analysis that accommodates variation in the shape, spread and structure of data
title_full_unstemmed Mixed cumulative probit: a multivariate generalization of transition analysis that accommodates variation in the shape, spread and structure of data
title_short Mixed cumulative probit: a multivariate generalization of transition analysis that accommodates variation in the shape, spread and structure of data
title_sort mixed cumulative probit: a multivariate generalization of transition analysis that accommodates variation in the shape, spread and structure of data
topic Organismal and Evolutionary Biology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9974299/
https://www.ncbi.nlm.nih.gov/pubmed/36866077
http://dx.doi.org/10.1098/rsos.220963
work_keys_str_mv AT stullkyrae mixedcumulativeprobitamultivariategeneralizationoftransitionanalysisthataccommodatesvariationintheshapespreadandstructureofdata
AT chuelainey mixedcumulativeprobitamultivariategeneralizationoftransitionanalysisthataccommodatesvariationintheshapespreadandstructureofdata
AT corronlouisek mixedcumulativeprobitamultivariategeneralizationoftransitionanalysisthataccommodatesvariationintheshapespreadandstructureofdata
AT pricemichaelh mixedcumulativeprobitamultivariategeneralizationoftransitionanalysisthataccommodatesvariationintheshapespreadandstructureofdata