Cargando…

Impact of dataset diversity on accuracy and sensitivity of parallel factor analysis model of dissolved organic matter fluorescence excitation-emission matrix

Parallel factor (PARAFAC) analysis enables a quantitative analysis of excitation-emission matrix (EEM). The impact of a spectral variability stemmed from a diverse dataset on the representativeness of the PARAFAC model needs to be examined. In this study, samples from a river, effluent of a wastewat...

Descripción completa

Detalles Bibliográficos
Autores principales: Yu, Huarong, Liang, Heng, Qu, Fangshu, Han, Zheng-shuang, Shao, Senlin, Chang, Haiqing, Li, Guibai
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4426691/
https://www.ncbi.nlm.nih.gov/pubmed/25958786
http://dx.doi.org/10.1038/srep10207
_version_ 1782370621140762624
author Yu, Huarong
Liang, Heng
Qu, Fangshu
Han, Zheng-shuang
Shao, Senlin
Chang, Haiqing
Li, Guibai
author_facet Yu, Huarong
Liang, Heng
Qu, Fangshu
Han, Zheng-shuang
Shao, Senlin
Chang, Haiqing
Li, Guibai
author_sort Yu, Huarong
collection PubMed
description Parallel factor (PARAFAC) analysis enables a quantitative analysis of excitation-emission matrix (EEM). The impact of a spectral variability stemmed from a diverse dataset on the representativeness of the PARAFAC model needs to be examined. In this study, samples from a river, effluent of a wastewater treatment plant, and algae secretion were collected and subjected to PARAFAC analysis. PARAFAC models of global dataset and individual datasets were compared. It was found that the peak shift derived from source diversity undermined the accuracy of the global model. The results imply that building a universal PARAFAC model that can be widely available for fitting new EEMs would be quite difficult, but fitting EEMs to existing PARAFAC model that belong to a similar environment would be more realistic. The accuracy of online monitoring strategy that monitors the fluorescence intensities at the peaks of PARAFAC components was examined by correlating the EEM data with the maximum fluorescence (F(max)) modeled by PARAFAC. For the individual datasets, remarkable correlations were obtained around the peak positions. However, an analysis of cocktail datasets implies that the involvement of foreign components that are spectrally similar to local components would undermine the online monitoring strategy.
format Online
Article
Text
id pubmed-4426691
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Nature Publishing Group
record_format MEDLINE/PubMed
spelling pubmed-44266912015-05-21 Impact of dataset diversity on accuracy and sensitivity of parallel factor analysis model of dissolved organic matter fluorescence excitation-emission matrix Yu, Huarong Liang, Heng Qu, Fangshu Han, Zheng-shuang Shao, Senlin Chang, Haiqing Li, Guibai Sci Rep Article Parallel factor (PARAFAC) analysis enables a quantitative analysis of excitation-emission matrix (EEM). The impact of a spectral variability stemmed from a diverse dataset on the representativeness of the PARAFAC model needs to be examined. In this study, samples from a river, effluent of a wastewater treatment plant, and algae secretion were collected and subjected to PARAFAC analysis. PARAFAC models of global dataset and individual datasets were compared. It was found that the peak shift derived from source diversity undermined the accuracy of the global model. The results imply that building a universal PARAFAC model that can be widely available for fitting new EEMs would be quite difficult, but fitting EEMs to existing PARAFAC model that belong to a similar environment would be more realistic. The accuracy of online monitoring strategy that monitors the fluorescence intensities at the peaks of PARAFAC components was examined by correlating the EEM data with the maximum fluorescence (F(max)) modeled by PARAFAC. For the individual datasets, remarkable correlations were obtained around the peak positions. However, an analysis of cocktail datasets implies that the involvement of foreign components that are spectrally similar to local components would undermine the online monitoring strategy. Nature Publishing Group 2015-05-11 /pmc/articles/PMC4426691/ /pubmed/25958786 http://dx.doi.org/10.1038/srep10207 Text en Copyright © 2015, Macmillan Publishers Limited http://creativecommons.org/licenses/by/4.0/ This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/
spellingShingle Article
Yu, Huarong
Liang, Heng
Qu, Fangshu
Han, Zheng-shuang
Shao, Senlin
Chang, Haiqing
Li, Guibai
Impact of dataset diversity on accuracy and sensitivity of parallel factor analysis model of dissolved organic matter fluorescence excitation-emission matrix
title Impact of dataset diversity on accuracy and sensitivity of parallel factor analysis model of dissolved organic matter fluorescence excitation-emission matrix
title_full Impact of dataset diversity on accuracy and sensitivity of parallel factor analysis model of dissolved organic matter fluorescence excitation-emission matrix
title_fullStr Impact of dataset diversity on accuracy and sensitivity of parallel factor analysis model of dissolved organic matter fluorescence excitation-emission matrix
title_full_unstemmed Impact of dataset diversity on accuracy and sensitivity of parallel factor analysis model of dissolved organic matter fluorescence excitation-emission matrix
title_short Impact of dataset diversity on accuracy and sensitivity of parallel factor analysis model of dissolved organic matter fluorescence excitation-emission matrix
title_sort impact of dataset diversity on accuracy and sensitivity of parallel factor analysis model of dissolved organic matter fluorescence excitation-emission matrix
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4426691/
https://www.ncbi.nlm.nih.gov/pubmed/25958786
http://dx.doi.org/10.1038/srep10207
work_keys_str_mv AT yuhuarong impactofdatasetdiversityonaccuracyandsensitivityofparallelfactoranalysismodelofdissolvedorganicmatterfluorescenceexcitationemissionmatrix
AT liangheng impactofdatasetdiversityonaccuracyandsensitivityofparallelfactoranalysismodelofdissolvedorganicmatterfluorescenceexcitationemissionmatrix
AT qufangshu impactofdatasetdiversityonaccuracyandsensitivityofparallelfactoranalysismodelofdissolvedorganicmatterfluorescenceexcitationemissionmatrix
AT hanzhengshuang impactofdatasetdiversityonaccuracyandsensitivityofparallelfactoranalysismodelofdissolvedorganicmatterfluorescenceexcitationemissionmatrix
AT shaosenlin impactofdatasetdiversityonaccuracyandsensitivityofparallelfactoranalysismodelofdissolvedorganicmatterfluorescenceexcitationemissionmatrix
AT changhaiqing impactofdatasetdiversityonaccuracyandsensitivityofparallelfactoranalysismodelofdissolvedorganicmatterfluorescenceexcitationemissionmatrix
AT liguibai impactofdatasetdiversityonaccuracyandsensitivityofparallelfactoranalysismodelofdissolvedorganicmatterfluorescenceexcitationemissionmatrix