Cargando…

TSI-GNN: Extending Graph Neural Networks to Handle Missing Data in Temporal Settings

We present a novel approach for imputing missing data that incorporates temporal information into bipartite graphs through an extension of graph representation learning. Missing data is abundant in several domains, particularly when observations are made over time. Most imputation methods make stron...

Descripción completa

Detalles Bibliográficos
Autores principales: Gordon, David, Petousis, Panayiotis, Zheng, Henry, Zamanzadeh, Davina, Bui, Alex A.T.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8480427/
https://www.ncbi.nlm.nih.gov/pubmed/34604740
http://dx.doi.org/10.3389/fdata.2021.693869
_version_ 1784576461612515328
author Gordon, David
Petousis, Panayiotis
Zheng, Henry
Zamanzadeh, Davina
Bui, Alex A.T.
author_facet Gordon, David
Petousis, Panayiotis
Zheng, Henry
Zamanzadeh, Davina
Bui, Alex A.T.
author_sort Gordon, David
collection PubMed
description We present a novel approach for imputing missing data that incorporates temporal information into bipartite graphs through an extension of graph representation learning. Missing data is abundant in several domains, particularly when observations are made over time. Most imputation methods make strong assumptions about the distribution of the data. While novel methods may relax some assumptions, they may not consider temporality. Moreover, when such methods are extended to handle time, they may not generalize without retraining. We propose using a joint bipartite graph approach to incorporate temporal sequence information. Specifically, the observation nodes and edges with temporal information are used in message passing to learn node and edge embeddings and to inform the imputation task. Our proposed method, temporal setting imputation using graph neural networks (TSI-GNN), captures sequence information that can then be used within an aggregation function of a graph neural network. To the best of our knowledge, this is the first effort to use a joint bipartite graph approach that captures sequence information to handle missing data. We use several benchmark datasets to test the performance of our method against a variety of conditions, comparing to both classic and contemporary methods. We further provide insight to manage the size of the generated TSI-GNN model. Through our analysis we show that incorporating temporal information into a bipartite graph improves the representation at the 30% and 60% missing rate, specifically when using a nonlinear model for downstream prediction tasks in regularly sampled datasets and is competitive with existing temporal methods under different scenarios.
format Online
Article
Text
id pubmed-8480427
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-84804272021-09-30 TSI-GNN: Extending Graph Neural Networks to Handle Missing Data in Temporal Settings Gordon, David Petousis, Panayiotis Zheng, Henry Zamanzadeh, Davina Bui, Alex A.T. Front Big Data Big Data We present a novel approach for imputing missing data that incorporates temporal information into bipartite graphs through an extension of graph representation learning. Missing data is abundant in several domains, particularly when observations are made over time. Most imputation methods make strong assumptions about the distribution of the data. While novel methods may relax some assumptions, they may not consider temporality. Moreover, when such methods are extended to handle time, they may not generalize without retraining. We propose using a joint bipartite graph approach to incorporate temporal sequence information. Specifically, the observation nodes and edges with temporal information are used in message passing to learn node and edge embeddings and to inform the imputation task. Our proposed method, temporal setting imputation using graph neural networks (TSI-GNN), captures sequence information that can then be used within an aggregation function of a graph neural network. To the best of our knowledge, this is the first effort to use a joint bipartite graph approach that captures sequence information to handle missing data. We use several benchmark datasets to test the performance of our method against a variety of conditions, comparing to both classic and contemporary methods. We further provide insight to manage the size of the generated TSI-GNN model. Through our analysis we show that incorporating temporal information into a bipartite graph improves the representation at the 30% and 60% missing rate, specifically when using a nonlinear model for downstream prediction tasks in regularly sampled datasets and is competitive with existing temporal methods under different scenarios. Frontiers Media S.A. 2021-09-15 /pmc/articles/PMC8480427/ /pubmed/34604740 http://dx.doi.org/10.3389/fdata.2021.693869 Text en Copyright © 2021 Gordon, Petousis, Zheng, Zamanzadeh and Bui. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Big Data
Gordon, David
Petousis, Panayiotis
Zheng, Henry
Zamanzadeh, Davina
Bui, Alex A.T.
TSI-GNN: Extending Graph Neural Networks to Handle Missing Data in Temporal Settings
title TSI-GNN: Extending Graph Neural Networks to Handle Missing Data in Temporal Settings
title_full TSI-GNN: Extending Graph Neural Networks to Handle Missing Data in Temporal Settings
title_fullStr TSI-GNN: Extending Graph Neural Networks to Handle Missing Data in Temporal Settings
title_full_unstemmed TSI-GNN: Extending Graph Neural Networks to Handle Missing Data in Temporal Settings
title_short TSI-GNN: Extending Graph Neural Networks to Handle Missing Data in Temporal Settings
title_sort tsi-gnn: extending graph neural networks to handle missing data in temporal settings
topic Big Data
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8480427/
https://www.ncbi.nlm.nih.gov/pubmed/34604740
http://dx.doi.org/10.3389/fdata.2021.693869
work_keys_str_mv AT gordondavid tsignnextendinggraphneuralnetworkstohandlemissingdataintemporalsettings
AT petousispanayiotis tsignnextendinggraphneuralnetworkstohandlemissingdataintemporalsettings
AT zhenghenry tsignnextendinggraphneuralnetworkstohandlemissingdataintemporalsettings
AT zamanzadehdavina tsignnextendinggraphneuralnetworkstohandlemissingdataintemporalsettings
AT buialexat tsignnextendinggraphneuralnetworkstohandlemissingdataintemporalsettings