Cargando…

Automated data analytics workflow for stability experiments based on regression analysis

INTRODUCTION: We define a designated data analytics workflow for the evaluation of stability experiments, which takes all data situations into account. This complements the evaluation described by the CLSI EP25 [1] guideline by including a targeted exception handling algorithm and thus allows one to...

Descripción completa

Detalles Bibliográficos
Autores principales: Geistanger, Andrea, Braese, Kathrin, Laubender, Ruediger
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8844906/
https://www.ncbi.nlm.nih.gov/pubmed/35199095
http://dx.doi.org/10.1016/j.jmsacl.2022.01.001
_version_ 1784651566909751296
author Geistanger, Andrea
Braese, Kathrin
Laubender, Ruediger
author_facet Geistanger, Andrea
Braese, Kathrin
Laubender, Ruediger
author_sort Geistanger, Andrea
collection PubMed
description INTRODUCTION: We define a designated data analytics workflow for the evaluation of stability experiments, which takes all data situations into account. This complements the evaluation described by the CLSI EP25 [1] guideline by including a targeted exception handling algorithm and thus allows one to automatically evaluate stability data based on linear regression analysis. DESCRIPTION: The evaluation of stability experiments based on regression analysis requires the calculation of the confidence interval of the regression line. The stability time is estimated at the intersection of the confidence interval with the acceptance criterion. This approach results in solving a quadratic equation, with factors that depend on the estimated intercept, slope, the measurement variability and the chosen timepoints. When defining an automated data analytics workflow for this problem, the different cases for the solutions of the quadratic equation must be considered. For some data situations there might be no solution at all, other data situations result in a negative and a positive solution and finally there might be even two positive solutions. All these cases have to be considered for the choice of the right solution to become the estimated stability time. The CLSI EP25 [1] guideline on stability evaluation of in vitro diagnostic reagents addresses this problem only superficially and might even lead to incorrect results for some specific data scenarios. RESULTS: We evaluate all possible data scenarios and provide examples for each. Based on the gained theoretical insights, we define a designated data analytics workflow and visualize it with a flowchart. By following this flowchart one can implement an automated analysis workflow, targeting all data scenarios with the appropriate exception handling. DISCUSSION: We deduce that the description for obtaining stability times according to CLSI EP25 is not fully adequate, as it addresses only best-case scenarios. However, for automated data analytics workflows all possible data situations have to be considered. With the here presented workflow one can program automated data analytics pipelines, which ensure that the right stability time is obtained, in case it exists. In addition all exceptions, where no stability times are present, are addressed in the right way and it provides hints as to the failure reason.
format Online
Article
Text
id pubmed-8844906
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Elsevier
record_format MEDLINE/PubMed
spelling pubmed-88449062022-02-22 Automated data analytics workflow for stability experiments based on regression analysis Geistanger, Andrea Braese, Kathrin Laubender, Ruediger J Mass Spectrom Adv Clin Lab Research Article INTRODUCTION: We define a designated data analytics workflow for the evaluation of stability experiments, which takes all data situations into account. This complements the evaluation described by the CLSI EP25 [1] guideline by including a targeted exception handling algorithm and thus allows one to automatically evaluate stability data based on linear regression analysis. DESCRIPTION: The evaluation of stability experiments based on regression analysis requires the calculation of the confidence interval of the regression line. The stability time is estimated at the intersection of the confidence interval with the acceptance criterion. This approach results in solving a quadratic equation, with factors that depend on the estimated intercept, slope, the measurement variability and the chosen timepoints. When defining an automated data analytics workflow for this problem, the different cases for the solutions of the quadratic equation must be considered. For some data situations there might be no solution at all, other data situations result in a negative and a positive solution and finally there might be even two positive solutions. All these cases have to be considered for the choice of the right solution to become the estimated stability time. The CLSI EP25 [1] guideline on stability evaluation of in vitro diagnostic reagents addresses this problem only superficially and might even lead to incorrect results for some specific data scenarios. RESULTS: We evaluate all possible data scenarios and provide examples for each. Based on the gained theoretical insights, we define a designated data analytics workflow and visualize it with a flowchart. By following this flowchart one can implement an automated analysis workflow, targeting all data scenarios with the appropriate exception handling. DISCUSSION: We deduce that the description for obtaining stability times according to CLSI EP25 is not fully adequate, as it addresses only best-case scenarios. However, for automated data analytics workflows all possible data situations have to be considered. With the here presented workflow one can program automated data analytics pipelines, which ensure that the right stability time is obtained, in case it exists. In addition all exceptions, where no stability times are present, are addressed in the right way and it provides hints as to the failure reason. Elsevier 2022-02-08 /pmc/articles/PMC8844906/ /pubmed/35199095 http://dx.doi.org/10.1016/j.jmsacl.2022.01.001 Text en © 2022 THE AUTHORS https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
spellingShingle Research Article
Geistanger, Andrea
Braese, Kathrin
Laubender, Ruediger
Automated data analytics workflow for stability experiments based on regression analysis
title Automated data analytics workflow for stability experiments based on regression analysis
title_full Automated data analytics workflow for stability experiments based on regression analysis
title_fullStr Automated data analytics workflow for stability experiments based on regression analysis
title_full_unstemmed Automated data analytics workflow for stability experiments based on regression analysis
title_short Automated data analytics workflow for stability experiments based on regression analysis
title_sort automated data analytics workflow for stability experiments based on regression analysis
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8844906/
https://www.ncbi.nlm.nih.gov/pubmed/35199095
http://dx.doi.org/10.1016/j.jmsacl.2022.01.001
work_keys_str_mv AT geistangerandrea automateddataanalyticsworkflowforstabilityexperimentsbasedonregressionanalysis
AT braesekathrin automateddataanalyticsworkflowforstabilityexperimentsbasedonregressionanalysis
AT laubenderruediger automateddataanalyticsworkflowforstabilityexperimentsbasedonregressionanalysis