Cargando…
Designs for the simultaneous inference of concentration–response curves
BACKGROUND: An important problem in toxicology in the context of gene expression data is the simultaneous inference of a large number of concentration–response relationships. The quality of the inference substantially depends on the choice of design of the experiments, in particular, on the set of d...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10588042/ https://www.ncbi.nlm.nih.gov/pubmed/37858091 http://dx.doi.org/10.1186/s12859-023-05526-3 |
_version_ | 1785123493502779392 |
---|---|
author | Schürmeyer, Leonie Schorning, Kirsten Rahnenführer, Jörg |
author_facet | Schürmeyer, Leonie Schorning, Kirsten Rahnenführer, Jörg |
author_sort | Schürmeyer, Leonie |
collection | PubMed |
description | BACKGROUND: An important problem in toxicology in the context of gene expression data is the simultaneous inference of a large number of concentration–response relationships. The quality of the inference substantially depends on the choice of design of the experiments, in particular, on the set of different concentrations, at which observations are taken for the different genes under consideration. As this set has to be the same for all genes, the efficient planning of such experiments is very challenging. We address this problem by determining efficient designs for the simultaneous inference of a large number of concentration–response models. For that purpose, we both construct a D-optimality criterion for simultaneous inference and a K-means procedure which clusters the support points of the locally D-optimal designs of the individual models. RESULTS: We show that a planning of experiments that addresses the simultaneous inference of a large number of concentration–response relationships yields a substantially more accurate statistical analysis. In particular, we compare the performance of the constructed designs to the ones of other commonly used designs in terms of D-efficiencies and in terms of the quality of the resulting model fits using a real data example dealing with valproic acid. For the quality comparison we perform an extensive simulation study. CONCLUSIONS: The design maximizing the D-optimality criterion for simultaneous inference improves the inference of the different concentration–response relationships substantially. The design based on the K-means procedure also performs well, whereas a log-equidistant design, which was also included in the analysis, performs poorly in terms of the quality of the simultaneous inference. Based on our findings, the D-optimal design for simultaneous inference should be used for upcoming analyses dealing with high-dimensional gene expression data. |
format | Online Article Text |
id | pubmed-10588042 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-105880422023-10-21 Designs for the simultaneous inference of concentration–response curves Schürmeyer, Leonie Schorning, Kirsten Rahnenführer, Jörg BMC Bioinformatics Research BACKGROUND: An important problem in toxicology in the context of gene expression data is the simultaneous inference of a large number of concentration–response relationships. The quality of the inference substantially depends on the choice of design of the experiments, in particular, on the set of different concentrations, at which observations are taken for the different genes under consideration. As this set has to be the same for all genes, the efficient planning of such experiments is very challenging. We address this problem by determining efficient designs for the simultaneous inference of a large number of concentration–response models. For that purpose, we both construct a D-optimality criterion for simultaneous inference and a K-means procedure which clusters the support points of the locally D-optimal designs of the individual models. RESULTS: We show that a planning of experiments that addresses the simultaneous inference of a large number of concentration–response relationships yields a substantially more accurate statistical analysis. In particular, we compare the performance of the constructed designs to the ones of other commonly used designs in terms of D-efficiencies and in terms of the quality of the resulting model fits using a real data example dealing with valproic acid. For the quality comparison we perform an extensive simulation study. CONCLUSIONS: The design maximizing the D-optimality criterion for simultaneous inference improves the inference of the different concentration–response relationships substantially. The design based on the K-means procedure also performs well, whereas a log-equidistant design, which was also included in the analysis, performs poorly in terms of the quality of the simultaneous inference. Based on our findings, the D-optimal design for simultaneous inference should be used for upcoming analyses dealing with high-dimensional gene expression data. BioMed Central 2023-10-19 /pmc/articles/PMC10588042/ /pubmed/37858091 http://dx.doi.org/10.1186/s12859-023-05526-3 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Research Schürmeyer, Leonie Schorning, Kirsten Rahnenführer, Jörg Designs for the simultaneous inference of concentration–response curves |
title | Designs for the simultaneous inference of concentration–response curves |
title_full | Designs for the simultaneous inference of concentration–response curves |
title_fullStr | Designs for the simultaneous inference of concentration–response curves |
title_full_unstemmed | Designs for the simultaneous inference of concentration–response curves |
title_short | Designs for the simultaneous inference of concentration–response curves |
title_sort | designs for the simultaneous inference of concentration–response curves |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10588042/ https://www.ncbi.nlm.nih.gov/pubmed/37858091 http://dx.doi.org/10.1186/s12859-023-05526-3 |
work_keys_str_mv | AT schurmeyerleonie designsforthesimultaneousinferenceofconcentrationresponsecurves AT schorningkirsten designsforthesimultaneousinferenceofconcentrationresponsecurves AT rahnenfuhrerjorg designsforthesimultaneousinferenceofconcentrationresponsecurves |