Cargando…
Constructing training set using distance between learnt graphical models of time series data on patient physiology, to predict disease scores
Interventional endeavours in medicine include prediction of a score that parametrises a new subject’s susceptibility to a given disease, at the pre-onset stage. Here, for the first time, we provide reliable learning of such a score in the context of the potentially-terminal disease VOD, that often a...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10586698/ https://www.ncbi.nlm.nih.gov/pubmed/37856497 http://dx.doi.org/10.1371/journal.pone.0292404 |
_version_ | 1785123201734410240 |
---|---|
author | Chakrabarty, Dalia Wang, Kangrui Roy, Gargi Bhojgaria, Akash Zhang, Chuqiao Pavlu, Jiri Chakrabartty, Joydeep |
author_facet | Chakrabarty, Dalia Wang, Kangrui Roy, Gargi Bhojgaria, Akash Zhang, Chuqiao Pavlu, Jiri Chakrabartty, Joydeep |
author_sort | Chakrabarty, Dalia |
collection | PubMed |
description | Interventional endeavours in medicine include prediction of a score that parametrises a new subject’s susceptibility to a given disease, at the pre-onset stage. Here, for the first time, we provide reliable learning of such a score in the context of the potentially-terminal disease VOD, that often arises after bone marrow transplants. Indeed, the probability of surviving VOD, is correlated with early intervention. In our work, the VOD-score of each patient in a retrospective cohort, is defined as the distance between the (posterior) probability of a random graph variable—given the inter-variable partial correlation matrix of the time series data on variables that represent different aspects of patient physiology—and that given such time series data of an arbitrarily-selected reference patient. Such time series data is recorded from a pre-transplant to a post-transplant time, for each patient in this cohort, though the data available for distinct patients bear differential temporal coverage, owing to differential patient longevities. Each graph is a Soft Random Geometric Graph drawn in a probabilistic metric space, and the computed inter-graph distance is oblivious to the length of the time series data. The VOD-score learnt in this way, and the corresponding pre-transplant parameter vector of each patient in this retrospective cohort, then results in the training data, using which we learn the function that takes VOD-score as its input, and outputs the vector of pre-transplant parameters. We model this function with a vector-variate Gaussian Process, the covariance structure of which is kernel parametrised. Such modelling is easier than if the score variable were the output. Then for any prospective patient, whose pre-transplant variables are known, we learn the VOD-score (and the hyperparameters of the covariance kernel), using Markov Chain Monte Carlo based inference. |
format | Online Article Text |
id | pubmed-10586698 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-105866982023-10-20 Constructing training set using distance between learnt graphical models of time series data on patient physiology, to predict disease scores Chakrabarty, Dalia Wang, Kangrui Roy, Gargi Bhojgaria, Akash Zhang, Chuqiao Pavlu, Jiri Chakrabartty, Joydeep PLoS One Research Article Interventional endeavours in medicine include prediction of a score that parametrises a new subject’s susceptibility to a given disease, at the pre-onset stage. Here, for the first time, we provide reliable learning of such a score in the context of the potentially-terminal disease VOD, that often arises after bone marrow transplants. Indeed, the probability of surviving VOD, is correlated with early intervention. In our work, the VOD-score of each patient in a retrospective cohort, is defined as the distance between the (posterior) probability of a random graph variable—given the inter-variable partial correlation matrix of the time series data on variables that represent different aspects of patient physiology—and that given such time series data of an arbitrarily-selected reference patient. Such time series data is recorded from a pre-transplant to a post-transplant time, for each patient in this cohort, though the data available for distinct patients bear differential temporal coverage, owing to differential patient longevities. Each graph is a Soft Random Geometric Graph drawn in a probabilistic metric space, and the computed inter-graph distance is oblivious to the length of the time series data. The VOD-score learnt in this way, and the corresponding pre-transplant parameter vector of each patient in this retrospective cohort, then results in the training data, using which we learn the function that takes VOD-score as its input, and outputs the vector of pre-transplant parameters. We model this function with a vector-variate Gaussian Process, the covariance structure of which is kernel parametrised. Such modelling is easier than if the score variable were the output. Then for any prospective patient, whose pre-transplant variables are known, we learn the VOD-score (and the hyperparameters of the covariance kernel), using Markov Chain Monte Carlo based inference. Public Library of Science 2023-10-19 /pmc/articles/PMC10586698/ /pubmed/37856497 http://dx.doi.org/10.1371/journal.pone.0292404 Text en © 2023 Chakrabarty et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Chakrabarty, Dalia Wang, Kangrui Roy, Gargi Bhojgaria, Akash Zhang, Chuqiao Pavlu, Jiri Chakrabartty, Joydeep Constructing training set using distance between learnt graphical models of time series data on patient physiology, to predict disease scores |
title | Constructing training set using distance between learnt graphical models of time series data on patient physiology, to predict disease scores |
title_full | Constructing training set using distance between learnt graphical models of time series data on patient physiology, to predict disease scores |
title_fullStr | Constructing training set using distance between learnt graphical models of time series data on patient physiology, to predict disease scores |
title_full_unstemmed | Constructing training set using distance between learnt graphical models of time series data on patient physiology, to predict disease scores |
title_short | Constructing training set using distance between learnt graphical models of time series data on patient physiology, to predict disease scores |
title_sort | constructing training set using distance between learnt graphical models of time series data on patient physiology, to predict disease scores |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10586698/ https://www.ncbi.nlm.nih.gov/pubmed/37856497 http://dx.doi.org/10.1371/journal.pone.0292404 |
work_keys_str_mv | AT chakrabartydalia constructingtrainingsetusingdistancebetweenlearntgraphicalmodelsoftimeseriesdataonpatientphysiologytopredictdiseasescores AT wangkangrui constructingtrainingsetusingdistancebetweenlearntgraphicalmodelsoftimeseriesdataonpatientphysiologytopredictdiseasescores AT roygargi constructingtrainingsetusingdistancebetweenlearntgraphicalmodelsoftimeseriesdataonpatientphysiologytopredictdiseasescores AT bhojgariaakash constructingtrainingsetusingdistancebetweenlearntgraphicalmodelsoftimeseriesdataonpatientphysiologytopredictdiseasescores AT zhangchuqiao constructingtrainingsetusingdistancebetweenlearntgraphicalmodelsoftimeseriesdataonpatientphysiologytopredictdiseasescores AT pavlujiri constructingtrainingsetusingdistancebetweenlearntgraphicalmodelsoftimeseriesdataonpatientphysiologytopredictdiseasescores AT chakrabarttyjoydeep constructingtrainingsetusingdistancebetweenlearntgraphicalmodelsoftimeseriesdataonpatientphysiologytopredictdiseasescores |