Cargando…

Comparison of data science workflows for root cause analysis of bioprocesses

Root cause analysis (RCA) is one of the most prominent tools used to comprehensively evaluate a biopharmaceutical production process. Despite of its widespread use in industry, the Food and Drug Administration has observed a lot of unsuitable approaches for RCAs within the last years. The reasons fo...

Descripción completa

Detalles Bibliográficos
Autores principales: Borchert, Daniel, Suarez-Zuluaga, Diego A., Sagmeister, Patrick, Thomassen, Yvonne E., Herwig, Christoph
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer Berlin Heidelberg 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6514075/
https://www.ncbi.nlm.nih.gov/pubmed/30377782
http://dx.doi.org/10.1007/s00449-018-2029-6
_version_ 1783417819950481408
author Borchert, Daniel
Suarez-Zuluaga, Diego A.
Sagmeister, Patrick
Thomassen, Yvonne E.
Herwig, Christoph
author_facet Borchert, Daniel
Suarez-Zuluaga, Diego A.
Sagmeister, Patrick
Thomassen, Yvonne E.
Herwig, Christoph
author_sort Borchert, Daniel
collection PubMed
description Root cause analysis (RCA) is one of the most prominent tools used to comprehensively evaluate a biopharmaceutical production process. Despite of its widespread use in industry, the Food and Drug Administration has observed a lot of unsuitable approaches for RCAs within the last years. The reasons for those unsuitable approaches are the use of incorrect variables during the analysis and the lack in process understanding, which impede correct model interpretation. Two major approaches to perform RCAs are currently dominating the chemical and pharmaceutical industry: raw data analysis and feature-based approach. Both techniques are shown to be able to identify the significant variables causing the variance of the response. Although they are different in data unfolding, the same tools as principal component analysis and partial least square regression are used in both concepts. Within this article we demonstrate the strength and weaknesses of both approaches. We proved that a fusion of both results in a comprehensive and effective workflow, which not only increases better process understanding. We demonstrate this workflow along with an example. Hence, the presented workflow allows to save analysis time and to reduce the effort of data mining by easy detection of the most important variables within the given dataset. Subsequently, the final obtained process knowledge can be translated into new hypotheses, which can be tested experimentally and thereby lead to effectively improving process robustness.
format Online
Article
Text
id pubmed-6514075
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Springer Berlin Heidelberg
record_format MEDLINE/PubMed
spelling pubmed-65140752019-05-28 Comparison of data science workflows for root cause analysis of bioprocesses Borchert, Daniel Suarez-Zuluaga, Diego A. Sagmeister, Patrick Thomassen, Yvonne E. Herwig, Christoph Bioprocess Biosyst Eng Research Paper Root cause analysis (RCA) is one of the most prominent tools used to comprehensively evaluate a biopharmaceutical production process. Despite of its widespread use in industry, the Food and Drug Administration has observed a lot of unsuitable approaches for RCAs within the last years. The reasons for those unsuitable approaches are the use of incorrect variables during the analysis and the lack in process understanding, which impede correct model interpretation. Two major approaches to perform RCAs are currently dominating the chemical and pharmaceutical industry: raw data analysis and feature-based approach. Both techniques are shown to be able to identify the significant variables causing the variance of the response. Although they are different in data unfolding, the same tools as principal component analysis and partial least square regression are used in both concepts. Within this article we demonstrate the strength and weaknesses of both approaches. We proved that a fusion of both results in a comprehensive and effective workflow, which not only increases better process understanding. We demonstrate this workflow along with an example. Hence, the presented workflow allows to save analysis time and to reduce the effort of data mining by easy detection of the most important variables within the given dataset. Subsequently, the final obtained process knowledge can be translated into new hypotheses, which can be tested experimentally and thereby lead to effectively improving process robustness. Springer Berlin Heidelberg 2018-10-31 2019 /pmc/articles/PMC6514075/ /pubmed/30377782 http://dx.doi.org/10.1007/s00449-018-2029-6 Text en © The Author(s) 2018 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.
spellingShingle Research Paper
Borchert, Daniel
Suarez-Zuluaga, Diego A.
Sagmeister, Patrick
Thomassen, Yvonne E.
Herwig, Christoph
Comparison of data science workflows for root cause analysis of bioprocesses
title Comparison of data science workflows for root cause analysis of bioprocesses
title_full Comparison of data science workflows for root cause analysis of bioprocesses
title_fullStr Comparison of data science workflows for root cause analysis of bioprocesses
title_full_unstemmed Comparison of data science workflows for root cause analysis of bioprocesses
title_short Comparison of data science workflows for root cause analysis of bioprocesses
title_sort comparison of data science workflows for root cause analysis of bioprocesses
topic Research Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6514075/
https://www.ncbi.nlm.nih.gov/pubmed/30377782
http://dx.doi.org/10.1007/s00449-018-2029-6
work_keys_str_mv AT borchertdaniel comparisonofdatascienceworkflowsforrootcauseanalysisofbioprocesses
AT suarezzuluagadiegoa comparisonofdatascienceworkflowsforrootcauseanalysisofbioprocesses
AT sagmeisterpatrick comparisonofdatascienceworkflowsforrootcauseanalysisofbioprocesses
AT thomassenyvonnee comparisonofdatascienceworkflowsforrootcauseanalysisofbioprocesses
AT herwigchristoph comparisonofdatascienceworkflowsforrootcauseanalysisofbioprocesses