Cargando…

MISSING VALUE IMPUTATION VIA GRAPH COMPLETION IN QUESTIONNAIRE SCORES FROM PERSONS WITH DEMENTIA

Background: Questionnaires are widely used to evaluate cognitive functions, depression, and loneliness of persons with dementia (PWDs). Successful assessment and treatment of dementia hinge on effective analysis of PWDs’ answers. However, many studies, especially pilot ones, are with small sample si...

Descripción completa

Detalles Bibliográficos
Autores principales: Kan, Chen, Kim, Won Hwa, Xu, Ling, Fields, Noelle L
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6844977/
http://dx.doi.org/10.1093/geroni/igz038.3524
_version_ 1783468556105547776
author Kan, Chen
Kim, Won Hwa
Xu, Ling
Fields, Noelle L
author_facet Kan, Chen
Kim, Won Hwa
Xu, Ling
Fields, Noelle L
author_sort Kan, Chen
collection PubMed
description Background: Questionnaires are widely used to evaluate cognitive functions, depression, and loneliness of persons with dementia (PWDs). Successful assessment and treatment of dementia hinge on effective analysis of PWDs’ answers. However, many studies, especially pilot ones, are with small sample sizes. Further, most of them contain missing data as PWDs skip some study sessions due to their clinical conditions. Conventional imputation strategies are not well-suited as bias will be introduced because of insufficient samples. Method: A novel machine learning framework was developed based on harmonic analysis on graphs to robustly handle missing values. Participants were first embedded as nodes in the graph with edges derived by their similarities based on demographic information, activities of daily living, etc. Then, questionnaire scores with missing values were regarded as a function on the nodes, and they were estimated based on spectral analysis of the graph with a smoothness constraint. The proposed approach was evaluated using data from our pilot study of dementia subjects (N=15) with 15% data missing. Result: A few complete variables (binary or ordinal) were available for all participants. For each variable, we randomly removed 5 scores to mimic missing values. With our approach, we could recover all missing values with 90% accuracy on average. We were also able to impute the actual missing values in the dataset within reasonable ranges. Conclusion: Our proposed approach imputes missing values with high accuracy despite the small sample size. The proposed approach will significantly boost statistical power of various small-scale studies with missing data.
format Online
Article
Text
id pubmed-6844977
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-68449772019-11-18 MISSING VALUE IMPUTATION VIA GRAPH COMPLETION IN QUESTIONNAIRE SCORES FROM PERSONS WITH DEMENTIA Kan, Chen Kim, Won Hwa Xu, Ling Fields, Noelle L Innov Aging Session Lb3620 (Late Breaking Poster) Background: Questionnaires are widely used to evaluate cognitive functions, depression, and loneliness of persons with dementia (PWDs). Successful assessment and treatment of dementia hinge on effective analysis of PWDs’ answers. However, many studies, especially pilot ones, are with small sample sizes. Further, most of them contain missing data as PWDs skip some study sessions due to their clinical conditions. Conventional imputation strategies are not well-suited as bias will be introduced because of insufficient samples. Method: A novel machine learning framework was developed based on harmonic analysis on graphs to robustly handle missing values. Participants were first embedded as nodes in the graph with edges derived by their similarities based on demographic information, activities of daily living, etc. Then, questionnaire scores with missing values were regarded as a function on the nodes, and they were estimated based on spectral analysis of the graph with a smoothness constraint. The proposed approach was evaluated using data from our pilot study of dementia subjects (N=15) with 15% data missing. Result: A few complete variables (binary or ordinal) were available for all participants. For each variable, we randomly removed 5 scores to mimic missing values. With our approach, we could recover all missing values with 90% accuracy on average. We were also able to impute the actual missing values in the dataset within reasonable ranges. Conclusion: Our proposed approach imputes missing values with high accuracy despite the small sample size. The proposed approach will significantly boost statistical power of various small-scale studies with missing data. Oxford University Press 2019-11-08 /pmc/articles/PMC6844977/ http://dx.doi.org/10.1093/geroni/igz038.3524 Text en © The Author(s) 2019. Published by Oxford University Press on behalf of The Gerontological Society of America. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Session Lb3620 (Late Breaking Poster)
Kan, Chen
Kim, Won Hwa
Xu, Ling
Fields, Noelle L
MISSING VALUE IMPUTATION VIA GRAPH COMPLETION IN QUESTIONNAIRE SCORES FROM PERSONS WITH DEMENTIA
title MISSING VALUE IMPUTATION VIA GRAPH COMPLETION IN QUESTIONNAIRE SCORES FROM PERSONS WITH DEMENTIA
title_full MISSING VALUE IMPUTATION VIA GRAPH COMPLETION IN QUESTIONNAIRE SCORES FROM PERSONS WITH DEMENTIA
title_fullStr MISSING VALUE IMPUTATION VIA GRAPH COMPLETION IN QUESTIONNAIRE SCORES FROM PERSONS WITH DEMENTIA
title_full_unstemmed MISSING VALUE IMPUTATION VIA GRAPH COMPLETION IN QUESTIONNAIRE SCORES FROM PERSONS WITH DEMENTIA
title_short MISSING VALUE IMPUTATION VIA GRAPH COMPLETION IN QUESTIONNAIRE SCORES FROM PERSONS WITH DEMENTIA
title_sort missing value imputation via graph completion in questionnaire scores from persons with dementia
topic Session Lb3620 (Late Breaking Poster)
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6844977/
http://dx.doi.org/10.1093/geroni/igz038.3524
work_keys_str_mv AT kanchen missingvalueimputationviagraphcompletioninquestionnairescoresfrompersonswithdementia
AT kimwonhwa missingvalueimputationviagraphcompletioninquestionnairescoresfrompersonswithdementia
AT xuling missingvalueimputationviagraphcompletioninquestionnairescoresfrompersonswithdementia
AT fieldsnoellel missingvalueimputationviagraphcompletioninquestionnairescoresfrompersonswithdementia