Cargando…
Comparison of data science workflows for root cause analysis of bioprocesses
Root cause analysis (RCA) is one of the most prominent tools used to comprehensively evaluate a biopharmaceutical production process. Despite of its widespread use in industry, the Food and Drug Administration has observed a lot of unsuitable approaches for RCAs within the last years. The reasons fo...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Springer Berlin Heidelberg
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6514075/ https://www.ncbi.nlm.nih.gov/pubmed/30377782 http://dx.doi.org/10.1007/s00449-018-2029-6 |
_version_ | 1783417819950481408 |
---|---|
author | Borchert, Daniel Suarez-Zuluaga, Diego A. Sagmeister, Patrick Thomassen, Yvonne E. Herwig, Christoph |
author_facet | Borchert, Daniel Suarez-Zuluaga, Diego A. Sagmeister, Patrick Thomassen, Yvonne E. Herwig, Christoph |
author_sort | Borchert, Daniel |
collection | PubMed |
description | Root cause analysis (RCA) is one of the most prominent tools used to comprehensively evaluate a biopharmaceutical production process. Despite of its widespread use in industry, the Food and Drug Administration has observed a lot of unsuitable approaches for RCAs within the last years. The reasons for those unsuitable approaches are the use of incorrect variables during the analysis and the lack in process understanding, which impede correct model interpretation. Two major approaches to perform RCAs are currently dominating the chemical and pharmaceutical industry: raw data analysis and feature-based approach. Both techniques are shown to be able to identify the significant variables causing the variance of the response. Although they are different in data unfolding, the same tools as principal component analysis and partial least square regression are used in both concepts. Within this article we demonstrate the strength and weaknesses of both approaches. We proved that a fusion of both results in a comprehensive and effective workflow, which not only increases better process understanding. We demonstrate this workflow along with an example. Hence, the presented workflow allows to save analysis time and to reduce the effort of data mining by easy detection of the most important variables within the given dataset. Subsequently, the final obtained process knowledge can be translated into new hypotheses, which can be tested experimentally and thereby lead to effectively improving process robustness. |
format | Online Article Text |
id | pubmed-6514075 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | Springer Berlin Heidelberg |
record_format | MEDLINE/PubMed |
spelling | pubmed-65140752019-05-28 Comparison of data science workflows for root cause analysis of bioprocesses Borchert, Daniel Suarez-Zuluaga, Diego A. Sagmeister, Patrick Thomassen, Yvonne E. Herwig, Christoph Bioprocess Biosyst Eng Research Paper Root cause analysis (RCA) is one of the most prominent tools used to comprehensively evaluate a biopharmaceutical production process. Despite of its widespread use in industry, the Food and Drug Administration has observed a lot of unsuitable approaches for RCAs within the last years. The reasons for those unsuitable approaches are the use of incorrect variables during the analysis and the lack in process understanding, which impede correct model interpretation. Two major approaches to perform RCAs are currently dominating the chemical and pharmaceutical industry: raw data analysis and feature-based approach. Both techniques are shown to be able to identify the significant variables causing the variance of the response. Although they are different in data unfolding, the same tools as principal component analysis and partial least square regression are used in both concepts. Within this article we demonstrate the strength and weaknesses of both approaches. We proved that a fusion of both results in a comprehensive and effective workflow, which not only increases better process understanding. We demonstrate this workflow along with an example. Hence, the presented workflow allows to save analysis time and to reduce the effort of data mining by easy detection of the most important variables within the given dataset. Subsequently, the final obtained process knowledge can be translated into new hypotheses, which can be tested experimentally and thereby lead to effectively improving process robustness. Springer Berlin Heidelberg 2018-10-31 2019 /pmc/articles/PMC6514075/ /pubmed/30377782 http://dx.doi.org/10.1007/s00449-018-2029-6 Text en © The Author(s) 2018 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. |
spellingShingle | Research Paper Borchert, Daniel Suarez-Zuluaga, Diego A. Sagmeister, Patrick Thomassen, Yvonne E. Herwig, Christoph Comparison of data science workflows for root cause analysis of bioprocesses |
title | Comparison of data science workflows for root cause analysis of bioprocesses |
title_full | Comparison of data science workflows for root cause analysis of bioprocesses |
title_fullStr | Comparison of data science workflows for root cause analysis of bioprocesses |
title_full_unstemmed | Comparison of data science workflows for root cause analysis of bioprocesses |
title_short | Comparison of data science workflows for root cause analysis of bioprocesses |
title_sort | comparison of data science workflows for root cause analysis of bioprocesses |
topic | Research Paper |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6514075/ https://www.ncbi.nlm.nih.gov/pubmed/30377782 http://dx.doi.org/10.1007/s00449-018-2029-6 |
work_keys_str_mv | AT borchertdaniel comparisonofdatascienceworkflowsforrootcauseanalysisofbioprocesses AT suarezzuluagadiegoa comparisonofdatascienceworkflowsforrootcauseanalysisofbioprocesses AT sagmeisterpatrick comparisonofdatascienceworkflowsforrootcauseanalysisofbioprocesses AT thomassenyvonnee comparisonofdatascienceworkflowsforrootcauseanalysisofbioprocesses AT herwigchristoph comparisonofdatascienceworkflowsforrootcauseanalysisofbioprocesses |