Cargando…

Constructing training set using distance between learnt graphical models of time series data on patient physiology, to predict disease scores

Interventional endeavours in medicine include prediction of a score that parametrises a new subject’s susceptibility to a given disease, at the pre-onset stage. Here, for the first time, we provide reliable learning of such a score in the context of the potentially-terminal disease VOD, that often a...

Descripción completa

Detalles Bibliográficos
Autores principales: Chakrabarty, Dalia, Wang, Kangrui, Roy, Gargi, Bhojgaria, Akash, Zhang, Chuqiao, Pavlu, Jiri, Chakrabartty, Joydeep
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10586698/
https://www.ncbi.nlm.nih.gov/pubmed/37856497
http://dx.doi.org/10.1371/journal.pone.0292404
_version_ 1785123201734410240
author Chakrabarty, Dalia
Wang, Kangrui
Roy, Gargi
Bhojgaria, Akash
Zhang, Chuqiao
Pavlu, Jiri
Chakrabartty, Joydeep
author_facet Chakrabarty, Dalia
Wang, Kangrui
Roy, Gargi
Bhojgaria, Akash
Zhang, Chuqiao
Pavlu, Jiri
Chakrabartty, Joydeep
author_sort Chakrabarty, Dalia
collection PubMed
description Interventional endeavours in medicine include prediction of a score that parametrises a new subject’s susceptibility to a given disease, at the pre-onset stage. Here, for the first time, we provide reliable learning of such a score in the context of the potentially-terminal disease VOD, that often arises after bone marrow transplants. Indeed, the probability of surviving VOD, is correlated with early intervention. In our work, the VOD-score of each patient in a retrospective cohort, is defined as the distance between the (posterior) probability of a random graph variable—given the inter-variable partial correlation matrix of the time series data on variables that represent different aspects of patient physiology—and that given such time series data of an arbitrarily-selected reference patient. Such time series data is recorded from a pre-transplant to a post-transplant time, for each patient in this cohort, though the data available for distinct patients bear differential temporal coverage, owing to differential patient longevities. Each graph is a Soft Random Geometric Graph drawn in a probabilistic metric space, and the computed inter-graph distance is oblivious to the length of the time series data. The VOD-score learnt in this way, and the corresponding pre-transplant parameter vector of each patient in this retrospective cohort, then results in the training data, using which we learn the function that takes VOD-score as its input, and outputs the vector of pre-transplant parameters. We model this function with a vector-variate Gaussian Process, the covariance structure of which is kernel parametrised. Such modelling is easier than if the score variable were the output. Then for any prospective patient, whose pre-transplant variables are known, we learn the VOD-score (and the hyperparameters of the covariance kernel), using Markov Chain Monte Carlo based inference.
format Online
Article
Text
id pubmed-10586698
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-105866982023-10-20 Constructing training set using distance between learnt graphical models of time series data on patient physiology, to predict disease scores Chakrabarty, Dalia Wang, Kangrui Roy, Gargi Bhojgaria, Akash Zhang, Chuqiao Pavlu, Jiri Chakrabartty, Joydeep PLoS One Research Article Interventional endeavours in medicine include prediction of a score that parametrises a new subject’s susceptibility to a given disease, at the pre-onset stage. Here, for the first time, we provide reliable learning of such a score in the context of the potentially-terminal disease VOD, that often arises after bone marrow transplants. Indeed, the probability of surviving VOD, is correlated with early intervention. In our work, the VOD-score of each patient in a retrospective cohort, is defined as the distance between the (posterior) probability of a random graph variable—given the inter-variable partial correlation matrix of the time series data on variables that represent different aspects of patient physiology—and that given such time series data of an arbitrarily-selected reference patient. Such time series data is recorded from a pre-transplant to a post-transplant time, for each patient in this cohort, though the data available for distinct patients bear differential temporal coverage, owing to differential patient longevities. Each graph is a Soft Random Geometric Graph drawn in a probabilistic metric space, and the computed inter-graph distance is oblivious to the length of the time series data. The VOD-score learnt in this way, and the corresponding pre-transplant parameter vector of each patient in this retrospective cohort, then results in the training data, using which we learn the function that takes VOD-score as its input, and outputs the vector of pre-transplant parameters. We model this function with a vector-variate Gaussian Process, the covariance structure of which is kernel parametrised. Such modelling is easier than if the score variable were the output. Then for any prospective patient, whose pre-transplant variables are known, we learn the VOD-score (and the hyperparameters of the covariance kernel), using Markov Chain Monte Carlo based inference. Public Library of Science 2023-10-19 /pmc/articles/PMC10586698/ /pubmed/37856497 http://dx.doi.org/10.1371/journal.pone.0292404 Text en © 2023 Chakrabarty et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Chakrabarty, Dalia
Wang, Kangrui
Roy, Gargi
Bhojgaria, Akash
Zhang, Chuqiao
Pavlu, Jiri
Chakrabartty, Joydeep
Constructing training set using distance between learnt graphical models of time series data on patient physiology, to predict disease scores
title Constructing training set using distance between learnt graphical models of time series data on patient physiology, to predict disease scores
title_full Constructing training set using distance between learnt graphical models of time series data on patient physiology, to predict disease scores
title_fullStr Constructing training set using distance between learnt graphical models of time series data on patient physiology, to predict disease scores
title_full_unstemmed Constructing training set using distance between learnt graphical models of time series data on patient physiology, to predict disease scores
title_short Constructing training set using distance between learnt graphical models of time series data on patient physiology, to predict disease scores
title_sort constructing training set using distance between learnt graphical models of time series data on patient physiology, to predict disease scores
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10586698/
https://www.ncbi.nlm.nih.gov/pubmed/37856497
http://dx.doi.org/10.1371/journal.pone.0292404
work_keys_str_mv AT chakrabartydalia constructingtrainingsetusingdistancebetweenlearntgraphicalmodelsoftimeseriesdataonpatientphysiologytopredictdiseasescores
AT wangkangrui constructingtrainingsetusingdistancebetweenlearntgraphicalmodelsoftimeseriesdataonpatientphysiologytopredictdiseasescores
AT roygargi constructingtrainingsetusingdistancebetweenlearntgraphicalmodelsoftimeseriesdataonpatientphysiologytopredictdiseasescores
AT bhojgariaakash constructingtrainingsetusingdistancebetweenlearntgraphicalmodelsoftimeseriesdataonpatientphysiologytopredictdiseasescores
AT zhangchuqiao constructingtrainingsetusingdistancebetweenlearntgraphicalmodelsoftimeseriesdataonpatientphysiologytopredictdiseasescores
AT pavlujiri constructingtrainingsetusingdistancebetweenlearntgraphicalmodelsoftimeseriesdataonpatientphysiologytopredictdiseasescores
AT chakrabarttyjoydeep constructingtrainingsetusingdistancebetweenlearntgraphicalmodelsoftimeseriesdataonpatientphysiologytopredictdiseasescores