Cargando…

Hierarchical Cluster-based Partial Least Squares Regression (HC-PLSR) is an efficient tool for metamodelling of nonlinear dynamic models

BACKGROUND: Deterministic dynamic models of complex biological systems contain a large number of parameters and state variables, related through nonlinear differential equations with various types of feedback. A metamodel of such a dynamic model is a statistical approximation model that maps variati...

Descripción completa

Detalles Bibliográficos
Autores principales: Tøndel, Kristin, Indahl, Ulf G, Gjuvsland, Arne B, Vik, Jon Olav, Hunter, Peter, Omholt, Stig W, Martens, Harald
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3127793/
https://www.ncbi.nlm.nih.gov/pubmed/21627852
http://dx.doi.org/10.1186/1752-0509-5-90
_version_ 1782207373209763840
author Tøndel, Kristin
Indahl, Ulf G
Gjuvsland, Arne B
Vik, Jon Olav
Hunter, Peter
Omholt, Stig W
Martens, Harald
author_facet Tøndel, Kristin
Indahl, Ulf G
Gjuvsland, Arne B
Vik, Jon Olav
Hunter, Peter
Omholt, Stig W
Martens, Harald
author_sort Tøndel, Kristin
collection PubMed
description BACKGROUND: Deterministic dynamic models of complex biological systems contain a large number of parameters and state variables, related through nonlinear differential equations with various types of feedback. A metamodel of such a dynamic model is a statistical approximation model that maps variation in parameters and initial conditions (inputs) to variation in features of the trajectories of the state variables (outputs) throughout the entire biologically relevant input space. A sufficiently accurate mapping can be exploited both instrumentally and epistemically. Multivariate regression methodology is a commonly used approach for emulating dynamic models. However, when the input-output relations are highly nonlinear or non-monotone, a standard linear regression approach is prone to give suboptimal results. We therefore hypothesised that a more accurate mapping can be obtained by locally linear or locally polynomial regression. We present here a new method for local regression modelling, Hierarchical Cluster-based PLS regression (HC-PLSR), where fuzzy C-means clustering is used to separate the data set into parts according to the structure of the response surface. We compare the metamodelling performance of HC-PLSR with polynomial partial least squares regression (PLSR) and ordinary least squares (OLS) regression on various systems: six different gene regulatory network models with various types of feedback, a deterministic mathematical model of the mammalian circadian clock and a model of the mouse ventricular myocyte function. RESULTS: Our results indicate that multivariate regression is well suited for emulating dynamic models in systems biology. The hierarchical approach turned out to be superior to both polynomial PLSR and OLS regression in all three test cases. The advantage, in terms of explained variance and prediction accuracy, was largest in systems with highly nonlinear functional relationships and in systems with positive feedback loops. CONCLUSIONS: HC-PLSR is a promising approach for metamodelling in systems biology, especially for highly nonlinear or non-monotone parameter to phenotype maps. The algorithm can be flexibly adjusted to suit the complexity of the dynamic model behaviour, inviting automation in the metamodelling of complex systems.
format Online
Article
Text
id pubmed-3127793
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-31277932011-07-01 Hierarchical Cluster-based Partial Least Squares Regression (HC-PLSR) is an efficient tool for metamodelling of nonlinear dynamic models Tøndel, Kristin Indahl, Ulf G Gjuvsland, Arne B Vik, Jon Olav Hunter, Peter Omholt, Stig W Martens, Harald BMC Syst Biol Research Article BACKGROUND: Deterministic dynamic models of complex biological systems contain a large number of parameters and state variables, related through nonlinear differential equations with various types of feedback. A metamodel of such a dynamic model is a statistical approximation model that maps variation in parameters and initial conditions (inputs) to variation in features of the trajectories of the state variables (outputs) throughout the entire biologically relevant input space. A sufficiently accurate mapping can be exploited both instrumentally and epistemically. Multivariate regression methodology is a commonly used approach for emulating dynamic models. However, when the input-output relations are highly nonlinear or non-monotone, a standard linear regression approach is prone to give suboptimal results. We therefore hypothesised that a more accurate mapping can be obtained by locally linear or locally polynomial regression. We present here a new method for local regression modelling, Hierarchical Cluster-based PLS regression (HC-PLSR), where fuzzy C-means clustering is used to separate the data set into parts according to the structure of the response surface. We compare the metamodelling performance of HC-PLSR with polynomial partial least squares regression (PLSR) and ordinary least squares (OLS) regression on various systems: six different gene regulatory network models with various types of feedback, a deterministic mathematical model of the mammalian circadian clock and a model of the mouse ventricular myocyte function. RESULTS: Our results indicate that multivariate regression is well suited for emulating dynamic models in systems biology. The hierarchical approach turned out to be superior to both polynomial PLSR and OLS regression in all three test cases. The advantage, in terms of explained variance and prediction accuracy, was largest in systems with highly nonlinear functional relationships and in systems with positive feedback loops. CONCLUSIONS: HC-PLSR is a promising approach for metamodelling in systems biology, especially for highly nonlinear or non-monotone parameter to phenotype maps. The algorithm can be flexibly adjusted to suit the complexity of the dynamic model behaviour, inviting automation in the metamodelling of complex systems. BioMed Central 2011-06-01 /pmc/articles/PMC3127793/ /pubmed/21627852 http://dx.doi.org/10.1186/1752-0509-5-90 Text en Copyright ©2011 Tøndel et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Tøndel, Kristin
Indahl, Ulf G
Gjuvsland, Arne B
Vik, Jon Olav
Hunter, Peter
Omholt, Stig W
Martens, Harald
Hierarchical Cluster-based Partial Least Squares Regression (HC-PLSR) is an efficient tool for metamodelling of nonlinear dynamic models
title Hierarchical Cluster-based Partial Least Squares Regression (HC-PLSR) is an efficient tool for metamodelling of nonlinear dynamic models
title_full Hierarchical Cluster-based Partial Least Squares Regression (HC-PLSR) is an efficient tool for metamodelling of nonlinear dynamic models
title_fullStr Hierarchical Cluster-based Partial Least Squares Regression (HC-PLSR) is an efficient tool for metamodelling of nonlinear dynamic models
title_full_unstemmed Hierarchical Cluster-based Partial Least Squares Regression (HC-PLSR) is an efficient tool for metamodelling of nonlinear dynamic models
title_short Hierarchical Cluster-based Partial Least Squares Regression (HC-PLSR) is an efficient tool for metamodelling of nonlinear dynamic models
title_sort hierarchical cluster-based partial least squares regression (hc-plsr) is an efficient tool for metamodelling of nonlinear dynamic models
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3127793/
https://www.ncbi.nlm.nih.gov/pubmed/21627852
http://dx.doi.org/10.1186/1752-0509-5-90
work_keys_str_mv AT tøndelkristin hierarchicalclusterbasedpartialleastsquaresregressionhcplsrisanefficienttoolformetamodellingofnonlineardynamicmodels
AT indahlulfg hierarchicalclusterbasedpartialleastsquaresregressionhcplsrisanefficienttoolformetamodellingofnonlineardynamicmodels
AT gjuvslandarneb hierarchicalclusterbasedpartialleastsquaresregressionhcplsrisanefficienttoolformetamodellingofnonlineardynamicmodels
AT vikjonolav hierarchicalclusterbasedpartialleastsquaresregressionhcplsrisanefficienttoolformetamodellingofnonlineardynamicmodels
AT hunterpeter hierarchicalclusterbasedpartialleastsquaresregressionhcplsrisanefficienttoolformetamodellingofnonlineardynamicmodels
AT omholtstigw hierarchicalclusterbasedpartialleastsquaresregressionhcplsrisanefficienttoolformetamodellingofnonlineardynamicmodels
AT martensharald hierarchicalclusterbasedpartialleastsquaresregressionhcplsrisanefficienttoolformetamodellingofnonlineardynamicmodels