Cargando…

Ensemble learning for prediction of the bioactivity capacity of herbal medicines from chromatographic fingerprints

BACKGROUND: Recent quality control of complex mixtures, including herbal medicines, is not limited to chemical chromatographic definition of one or two selected compounds; multivariate linear regression methods with dimension reduction or regularisation have been used to predict the bioactivity capa...

Descripción completa

Detalles Bibliográficos
Autores principales: Chen, Hao, Poon, Josiah, Poon, Simon K, Cui, Lizhi, Fan, Kei, Sze, Daniel Man-yuen
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4705500/
https://www.ncbi.nlm.nih.gov/pubmed/26329995
http://dx.doi.org/10.1186/1471-2105-16-S12-S4
_version_ 1782409025286045696
author Chen, Hao
Poon, Josiah
Poon, Simon K
Cui, Lizhi
Fan, Kei
Sze, Daniel Man-yuen
author_facet Chen, Hao
Poon, Josiah
Poon, Simon K
Cui, Lizhi
Fan, Kei
Sze, Daniel Man-yuen
author_sort Chen, Hao
collection PubMed
description BACKGROUND: Recent quality control of complex mixtures, including herbal medicines, is not limited to chemical chromatographic definition of one or two selected compounds; multivariate linear regression methods with dimension reduction or regularisation have been used to predict the bioactivity capacity from the chromatographic fingerprints of the herbal extracts. The challenge of this type of analysis requires a multi-dimensional approach at two levels: firstly each herb comprises complex mixtures of active and non-active chemical components; and secondly there are many factors relating to the growth, production, and processing of the herbal products. All these factors result in the significantly diverse concentrations of bioactive compounds in the herbal products. Therefore, it is imminent to have a predictive model with better generalisation that can accurately predict the bioactivity capacity of samples when only the chemical fingerprints data are available. RESULTS: In this study, the algorithm of Stacking Multivariate Linear Regression (SMLR) and a few other commonly used chemometric approaches were evaluated. They were to predict the Cluster of Differentiation 80 (CD80) expression bioactivity of a commonly used herb, Astragali Radix (AR), from the corresponding chemical chromatographic fingerprints. SMLR provides a superior prediction accuracy in comparison with the other multivariate linear regression methods of PCR, PLSR, OPLS and EN in terms of MSE(test )and the goodness of prediction of test samples. CONCLUSIONS: SMLR is a better platform than some multivariate linear regression methods. The first advantage of SMLR is that it has better generalisation to predict the bioactivity capacity of herbal medicines from their chromatographic fingerprints. Future studies should aim to further improve the SMLR algorithm. The second advantage of SMLR is that single chemical compounds can be effectively identified as highly bioactive components which demands further CD80 bioactivity confirmation..
format Online
Article
Text
id pubmed-4705500
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-47055002016-01-20 Ensemble learning for prediction of the bioactivity capacity of herbal medicines from chromatographic fingerprints Chen, Hao Poon, Josiah Poon, Simon K Cui, Lizhi Fan, Kei Sze, Daniel Man-yuen BMC Bioinformatics Research BACKGROUND: Recent quality control of complex mixtures, including herbal medicines, is not limited to chemical chromatographic definition of one or two selected compounds; multivariate linear regression methods with dimension reduction or regularisation have been used to predict the bioactivity capacity from the chromatographic fingerprints of the herbal extracts. The challenge of this type of analysis requires a multi-dimensional approach at two levels: firstly each herb comprises complex mixtures of active and non-active chemical components; and secondly there are many factors relating to the growth, production, and processing of the herbal products. All these factors result in the significantly diverse concentrations of bioactive compounds in the herbal products. Therefore, it is imminent to have a predictive model with better generalisation that can accurately predict the bioactivity capacity of samples when only the chemical fingerprints data are available. RESULTS: In this study, the algorithm of Stacking Multivariate Linear Regression (SMLR) and a few other commonly used chemometric approaches were evaluated. They were to predict the Cluster of Differentiation 80 (CD80) expression bioactivity of a commonly used herb, Astragali Radix (AR), from the corresponding chemical chromatographic fingerprints. SMLR provides a superior prediction accuracy in comparison with the other multivariate linear regression methods of PCR, PLSR, OPLS and EN in terms of MSE(test )and the goodness of prediction of test samples. CONCLUSIONS: SMLR is a better platform than some multivariate linear regression methods. The first advantage of SMLR is that it has better generalisation to predict the bioactivity capacity of herbal medicines from their chromatographic fingerprints. Future studies should aim to further improve the SMLR algorithm. The second advantage of SMLR is that single chemical compounds can be effectively identified as highly bioactive components which demands further CD80 bioactivity confirmation.. BioMed Central 2015-08-25 /pmc/articles/PMC4705500/ /pubmed/26329995 http://dx.doi.org/10.1186/1471-2105-16-S12-S4 Text en Copyright © 2015 Chen et al.; http://creativecommons.org/licenses/by/4.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Chen, Hao
Poon, Josiah
Poon, Simon K
Cui, Lizhi
Fan, Kei
Sze, Daniel Man-yuen
Ensemble learning for prediction of the bioactivity capacity of herbal medicines from chromatographic fingerprints
title Ensemble learning for prediction of the bioactivity capacity of herbal medicines from chromatographic fingerprints
title_full Ensemble learning for prediction of the bioactivity capacity of herbal medicines from chromatographic fingerprints
title_fullStr Ensemble learning for prediction of the bioactivity capacity of herbal medicines from chromatographic fingerprints
title_full_unstemmed Ensemble learning for prediction of the bioactivity capacity of herbal medicines from chromatographic fingerprints
title_short Ensemble learning for prediction of the bioactivity capacity of herbal medicines from chromatographic fingerprints
title_sort ensemble learning for prediction of the bioactivity capacity of herbal medicines from chromatographic fingerprints
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4705500/
https://www.ncbi.nlm.nih.gov/pubmed/26329995
http://dx.doi.org/10.1186/1471-2105-16-S12-S4
work_keys_str_mv AT chenhao ensemblelearningforpredictionofthebioactivitycapacityofherbalmedicinesfromchromatographicfingerprints
AT poonjosiah ensemblelearningforpredictionofthebioactivitycapacityofherbalmedicinesfromchromatographicfingerprints
AT poonsimonk ensemblelearningforpredictionofthebioactivitycapacityofherbalmedicinesfromchromatographicfingerprints
AT cuilizhi ensemblelearningforpredictionofthebioactivitycapacityofherbalmedicinesfromchromatographicfingerprints
AT fankei ensemblelearningforpredictionofthebioactivitycapacityofherbalmedicinesfromchromatographicfingerprints
AT szedanielmanyuen ensemblelearningforpredictionofthebioactivitycapacityofherbalmedicinesfromchromatographicfingerprints