Cargando…
Validation and selection of ODE based systems biology models: how to arrive at more reliable decisions
BACKGROUND: Most ordinary differential equation (ODE) based modeling studies in systems biology involve a hold-out validation step for model validation. In this framework a pre-determined part of the data is used as validation data and, therefore it is not used for estimating the parameters of the m...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2015
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4493957/ https://www.ncbi.nlm.nih.gov/pubmed/26152206 http://dx.doi.org/10.1186/s12918-015-0180-0 |
_version_ | 1782380004288495616 |
---|---|
author | Hasdemir, Dicle Hoefsloot, Huub C.J Smilde, Age K. |
author_facet | Hasdemir, Dicle Hoefsloot, Huub C.J Smilde, Age K. |
author_sort | Hasdemir, Dicle |
collection | PubMed |
description | BACKGROUND: Most ordinary differential equation (ODE) based modeling studies in systems biology involve a hold-out validation step for model validation. In this framework a pre-determined part of the data is used as validation data and, therefore it is not used for estimating the parameters of the model. The model is assumed to be validated if the model predictions on the validation dataset show good agreement with the data. Model selection between alternative model structures can also be performed in the same setting, based on the predictive power of the model structures on the validation dataset. However, drawbacks associated with this approach are usually under-estimated. RESULTS: We have carried out simulations by using a recently published High Osmolarity Glycerol (HOG) pathway from S.cerevisiae to demonstrate these drawbacks. We have shown that it is very important how the data is partitioned and which part of the data is used for validation purposes. The hold-out validation strategy leads to biased conclusions, since it can lead to different validation and selection decisions when different partitioning schemes are used. Furthermore, finding sensible partitioning schemes that would lead to reliable decisions are heavily dependent on the biology and unknown model parameters which turns the problem into a paradox. This brings the need for alternative validation approaches that offer flexible partitioning of the data. For this purpose, we have introduced a stratified random cross-validation (SRCV) approach that successfully overcomes these limitations. CONCLUSIONS: SRCV leads to more stable decisions for both validation and selection which are not biased by underlying biological phenomena. Furthermore, it is less dependent on the specific noise realization in the data. Therefore, it proves to be a promising alternative to the standard hold-out validation strategy. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12918-015-0180-0) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-4493957 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2015 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-44939572015-07-08 Validation and selection of ODE based systems biology models: how to arrive at more reliable decisions Hasdemir, Dicle Hoefsloot, Huub C.J Smilde, Age K. BMC Syst Biol Research Article BACKGROUND: Most ordinary differential equation (ODE) based modeling studies in systems biology involve a hold-out validation step for model validation. In this framework a pre-determined part of the data is used as validation data and, therefore it is not used for estimating the parameters of the model. The model is assumed to be validated if the model predictions on the validation dataset show good agreement with the data. Model selection between alternative model structures can also be performed in the same setting, based on the predictive power of the model structures on the validation dataset. However, drawbacks associated with this approach are usually under-estimated. RESULTS: We have carried out simulations by using a recently published High Osmolarity Glycerol (HOG) pathway from S.cerevisiae to demonstrate these drawbacks. We have shown that it is very important how the data is partitioned and which part of the data is used for validation purposes. The hold-out validation strategy leads to biased conclusions, since it can lead to different validation and selection decisions when different partitioning schemes are used. Furthermore, finding sensible partitioning schemes that would lead to reliable decisions are heavily dependent on the biology and unknown model parameters which turns the problem into a paradox. This brings the need for alternative validation approaches that offer flexible partitioning of the data. For this purpose, we have introduced a stratified random cross-validation (SRCV) approach that successfully overcomes these limitations. CONCLUSIONS: SRCV leads to more stable decisions for both validation and selection which are not biased by underlying biological phenomena. Furthermore, it is less dependent on the specific noise realization in the data. Therefore, it proves to be a promising alternative to the standard hold-out validation strategy. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12918-015-0180-0) contains supplementary material, which is available to authorized users. BioMed Central 2015-07-08 /pmc/articles/PMC4493957/ /pubmed/26152206 http://dx.doi.org/10.1186/s12918-015-0180-0 Text en © Hasdemir et al. 2015 This is an Open Access article distributed under the terms of the Creative Commons Attribution License(http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Article Hasdemir, Dicle Hoefsloot, Huub C.J Smilde, Age K. Validation and selection of ODE based systems biology models: how to arrive at more reliable decisions |
title | Validation and selection of ODE based systems biology models: how to arrive at more reliable decisions |
title_full | Validation and selection of ODE based systems biology models: how to arrive at more reliable decisions |
title_fullStr | Validation and selection of ODE based systems biology models: how to arrive at more reliable decisions |
title_full_unstemmed | Validation and selection of ODE based systems biology models: how to arrive at more reliable decisions |
title_short | Validation and selection of ODE based systems biology models: how to arrive at more reliable decisions |
title_sort | validation and selection of ode based systems biology models: how to arrive at more reliable decisions |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4493957/ https://www.ncbi.nlm.nih.gov/pubmed/26152206 http://dx.doi.org/10.1186/s12918-015-0180-0 |
work_keys_str_mv | AT hasdemirdicle validationandselectionofodebasedsystemsbiologymodelshowtoarriveatmorereliabledecisions AT hoefsloothuubcj validationandselectionofodebasedsystemsbiologymodelshowtoarriveatmorereliabledecisions AT smildeagek validationandselectionofodebasedsystemsbiologymodelshowtoarriveatmorereliabledecisions |