Cargando…

Group membership prediction when known groups consist of unknown subgroups: a Monte Carlo comparison of methods

Classification using standard statistical methods such as linear discriminant analysis (LDA) or logistic regression (LR) presume knowledge of group membership prior to the development of an algorithm for prediction. However, in many real world applications members of the same nominal group, might in...

Descripción completa

Detalles Bibliográficos
Autores principales: Finch, W. Holmes, Bolin, Jocelyn H., Kelley, Ken
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4033219/
https://www.ncbi.nlm.nih.gov/pubmed/24904445
http://dx.doi.org/10.3389/fpsyg.2014.00337
_version_ 1782317790115397632
author Finch, W. Holmes
Bolin, Jocelyn H.
Kelley, Ken
author_facet Finch, W. Holmes
Bolin, Jocelyn H.
Kelley, Ken
author_sort Finch, W. Holmes
collection PubMed
description Classification using standard statistical methods such as linear discriminant analysis (LDA) or logistic regression (LR) presume knowledge of group membership prior to the development of an algorithm for prediction. However, in many real world applications members of the same nominal group, might in fact come from different subpopulations on the underlying construct. For example, individuals diagnosed with depression will not all have the same levels of this disorder, though for the purposes of LDA or LR they will be treated in the same manner. The goal of this simulation study was to examine the performance of several methods for group classification in the case where within group membership was not homogeneous. For example, suppose there are 3 known groups but within each group two unknown classes. Several approaches were compared, including LDA, LR, classification and regression trees (CART), generalized additive models (GAM), and mixture discriminant analysis (MIXDA). Results of the study indicated that CART and mixture discriminant analysis were the most effective tools for situations in which known groups were not homogeneous, whereas LDA, LR, and GAM had the highest rates of misclassification. Implications of these results for theory and practice are discussed.
format Online
Article
Text
id pubmed-4033219
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-40332192014-06-05 Group membership prediction when known groups consist of unknown subgroups: a Monte Carlo comparison of methods Finch, W. Holmes Bolin, Jocelyn H. Kelley, Ken Front Psychol Psychology Classification using standard statistical methods such as linear discriminant analysis (LDA) or logistic regression (LR) presume knowledge of group membership prior to the development of an algorithm for prediction. However, in many real world applications members of the same nominal group, might in fact come from different subpopulations on the underlying construct. For example, individuals diagnosed with depression will not all have the same levels of this disorder, though for the purposes of LDA or LR they will be treated in the same manner. The goal of this simulation study was to examine the performance of several methods for group classification in the case where within group membership was not homogeneous. For example, suppose there are 3 known groups but within each group two unknown classes. Several approaches were compared, including LDA, LR, classification and regression trees (CART), generalized additive models (GAM), and mixture discriminant analysis (MIXDA). Results of the study indicated that CART and mixture discriminant analysis were the most effective tools for situations in which known groups were not homogeneous, whereas LDA, LR, and GAM had the highest rates of misclassification. Implications of these results for theory and practice are discussed. Frontiers Media S.A. 2014-05-20 /pmc/articles/PMC4033219/ /pubmed/24904445 http://dx.doi.org/10.3389/fpsyg.2014.00337 Text en Copyright © 2014 Finch, Bolin and Kelley. http://creativecommons.org/licenses/by/3.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Psychology
Finch, W. Holmes
Bolin, Jocelyn H.
Kelley, Ken
Group membership prediction when known groups consist of unknown subgroups: a Monte Carlo comparison of methods
title Group membership prediction when known groups consist of unknown subgroups: a Monte Carlo comparison of methods
title_full Group membership prediction when known groups consist of unknown subgroups: a Monte Carlo comparison of methods
title_fullStr Group membership prediction when known groups consist of unknown subgroups: a Monte Carlo comparison of methods
title_full_unstemmed Group membership prediction when known groups consist of unknown subgroups: a Monte Carlo comparison of methods
title_short Group membership prediction when known groups consist of unknown subgroups: a Monte Carlo comparison of methods
title_sort group membership prediction when known groups consist of unknown subgroups: a monte carlo comparison of methods
topic Psychology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4033219/
https://www.ncbi.nlm.nih.gov/pubmed/24904445
http://dx.doi.org/10.3389/fpsyg.2014.00337
work_keys_str_mv AT finchwholmes groupmembershippredictionwhenknowngroupsconsistofunknownsubgroupsamontecarlocomparisonofmethods
AT bolinjocelynh groupmembershippredictionwhenknowngroupsconsistofunknownsubgroupsamontecarlocomparisonofmethods
AT kelleyken groupmembershippredictionwhenknowngroupsconsistofunknownsubgroupsamontecarlocomparisonofmethods