Cargando…
MISSING VALUE IMPUTATION VIA GRAPH COMPLETION IN QUESTIONNAIRE SCORES FROM PERSONS WITH DEMENTIA
Background: Questionnaires are widely used to evaluate cognitive functions, depression, and loneliness of persons with dementia (PWDs). Successful assessment and treatment of dementia hinge on effective analysis of PWDs’ answers. However, many studies, especially pilot ones, are with small sample si...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6844977/ http://dx.doi.org/10.1093/geroni/igz038.3524 |
_version_ | 1783468556105547776 |
---|---|
author | Kan, Chen Kim, Won Hwa Xu, Ling Fields, Noelle L |
author_facet | Kan, Chen Kim, Won Hwa Xu, Ling Fields, Noelle L |
author_sort | Kan, Chen |
collection | PubMed |
description | Background: Questionnaires are widely used to evaluate cognitive functions, depression, and loneliness of persons with dementia (PWDs). Successful assessment and treatment of dementia hinge on effective analysis of PWDs’ answers. However, many studies, especially pilot ones, are with small sample sizes. Further, most of them contain missing data as PWDs skip some study sessions due to their clinical conditions. Conventional imputation strategies are not well-suited as bias will be introduced because of insufficient samples. Method: A novel machine learning framework was developed based on harmonic analysis on graphs to robustly handle missing values. Participants were first embedded as nodes in the graph with edges derived by their similarities based on demographic information, activities of daily living, etc. Then, questionnaire scores with missing values were regarded as a function on the nodes, and they were estimated based on spectral analysis of the graph with a smoothness constraint. The proposed approach was evaluated using data from our pilot study of dementia subjects (N=15) with 15% data missing. Result: A few complete variables (binary or ordinal) were available for all participants. For each variable, we randomly removed 5 scores to mimic missing values. With our approach, we could recover all missing values with 90% accuracy on average. We were also able to impute the actual missing values in the dataset within reasonable ranges. Conclusion: Our proposed approach imputes missing values with high accuracy despite the small sample size. The proposed approach will significantly boost statistical power of various small-scale studies with missing data. |
format | Online Article Text |
id | pubmed-6844977 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-68449772019-11-18 MISSING VALUE IMPUTATION VIA GRAPH COMPLETION IN QUESTIONNAIRE SCORES FROM PERSONS WITH DEMENTIA Kan, Chen Kim, Won Hwa Xu, Ling Fields, Noelle L Innov Aging Session Lb3620 (Late Breaking Poster) Background: Questionnaires are widely used to evaluate cognitive functions, depression, and loneliness of persons with dementia (PWDs). Successful assessment and treatment of dementia hinge on effective analysis of PWDs’ answers. However, many studies, especially pilot ones, are with small sample sizes. Further, most of them contain missing data as PWDs skip some study sessions due to their clinical conditions. Conventional imputation strategies are not well-suited as bias will be introduced because of insufficient samples. Method: A novel machine learning framework was developed based on harmonic analysis on graphs to robustly handle missing values. Participants were first embedded as nodes in the graph with edges derived by their similarities based on demographic information, activities of daily living, etc. Then, questionnaire scores with missing values were regarded as a function on the nodes, and they were estimated based on spectral analysis of the graph with a smoothness constraint. The proposed approach was evaluated using data from our pilot study of dementia subjects (N=15) with 15% data missing. Result: A few complete variables (binary or ordinal) were available for all participants. For each variable, we randomly removed 5 scores to mimic missing values. With our approach, we could recover all missing values with 90% accuracy on average. We were also able to impute the actual missing values in the dataset within reasonable ranges. Conclusion: Our proposed approach imputes missing values with high accuracy despite the small sample size. The proposed approach will significantly boost statistical power of various small-scale studies with missing data. Oxford University Press 2019-11-08 /pmc/articles/PMC6844977/ http://dx.doi.org/10.1093/geroni/igz038.3524 Text en © The Author(s) 2019. Published by Oxford University Press on behalf of The Gerontological Society of America. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Session Lb3620 (Late Breaking Poster) Kan, Chen Kim, Won Hwa Xu, Ling Fields, Noelle L MISSING VALUE IMPUTATION VIA GRAPH COMPLETION IN QUESTIONNAIRE SCORES FROM PERSONS WITH DEMENTIA |
title | MISSING VALUE IMPUTATION VIA GRAPH COMPLETION IN QUESTIONNAIRE SCORES FROM PERSONS WITH DEMENTIA |
title_full | MISSING VALUE IMPUTATION VIA GRAPH COMPLETION IN QUESTIONNAIRE SCORES FROM PERSONS WITH DEMENTIA |
title_fullStr | MISSING VALUE IMPUTATION VIA GRAPH COMPLETION IN QUESTIONNAIRE SCORES FROM PERSONS WITH DEMENTIA |
title_full_unstemmed | MISSING VALUE IMPUTATION VIA GRAPH COMPLETION IN QUESTIONNAIRE SCORES FROM PERSONS WITH DEMENTIA |
title_short | MISSING VALUE IMPUTATION VIA GRAPH COMPLETION IN QUESTIONNAIRE SCORES FROM PERSONS WITH DEMENTIA |
title_sort | missing value imputation via graph completion in questionnaire scores from persons with dementia |
topic | Session Lb3620 (Late Breaking Poster) |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6844977/ http://dx.doi.org/10.1093/geroni/igz038.3524 |
work_keys_str_mv | AT kanchen missingvalueimputationviagraphcompletioninquestionnairescoresfrompersonswithdementia AT kimwonhwa missingvalueimputationviagraphcompletioninquestionnairescoresfrompersonswithdementia AT xuling missingvalueimputationviagraphcompletioninquestionnairescoresfrompersonswithdementia AT fieldsnoellel missingvalueimputationviagraphcompletioninquestionnairescoresfrompersonswithdementia |