Cargando…

Improved One-Class Modeling of High-Dimensional Metabolomics Data via Eigenvalue-Shrinkage

One-class modelling is a useful approach in metabolomics for the untargeted detection of abnormal metabolite profiles, when information from a set of reference observations is available to model “normal” or baseline metabolite profiles. Such outlying profiles are typically identified by comparing th...

Descripción completa

Detalles Bibliográficos
Autores principales: Brini, Alberto, Avagyan, Vahe, de Vos, Ric C. H., Vossen, Jack H., van den Heuvel, Edwin R., Engel, Jasper
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8069634/
https://www.ncbi.nlm.nih.gov/pubmed/33924479
http://dx.doi.org/10.3390/metabo11040237
_version_ 1783683283177963520
author Brini, Alberto
Avagyan, Vahe
de Vos, Ric C. H.
Vossen, Jack H.
van den Heuvel, Edwin R.
Engel, Jasper
author_facet Brini, Alberto
Avagyan, Vahe
de Vos, Ric C. H.
Vossen, Jack H.
van den Heuvel, Edwin R.
Engel, Jasper
author_sort Brini, Alberto
collection PubMed
description One-class modelling is a useful approach in metabolomics for the untargeted detection of abnormal metabolite profiles, when information from a set of reference observations is available to model “normal” or baseline metabolite profiles. Such outlying profiles are typically identified by comparing the distance between an observation and the reference class to a critical limit. Often, multivariate distance measures such as the Mahalanobis distance ([Formula: see text]) or principal component-based measures are used. These approaches, however, are either not applicable to untargeted metabolomics data, or their results are unreliable. In this paper, five distance measures for one-class modeling in untargeted metabolites are proposed. They are based on a combination of the [Formula: see text] and five so-called eigenvalue-shrinkage estimators of the covariance matrix of the reference class. A simple cross-validation procedure is proposed to set the critical limit for outlier detection. Simulation studies are used to identify which distance measure provides the best performance for one-class modeling, in terms of type I error and power to identify abnormal metabolite profiles. Empirical evidence demonstrates that this method has better type I error (false positive rate) and improved outlier detection power than the standard (principal component-based) one-class models. The method is illustrated by its application to liquid chromatography coupled to mass spectrometry (LC-MS) and nuclear magnetic response spectroscopy (NMR) untargeted metabolomics data from two studies on food safety assessment and diagnosis of rare diseases, respectively.
format Online
Article
Text
id pubmed-8069634
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-80696342021-04-26 Improved One-Class Modeling of High-Dimensional Metabolomics Data via Eigenvalue-Shrinkage Brini, Alberto Avagyan, Vahe de Vos, Ric C. H. Vossen, Jack H. van den Heuvel, Edwin R. Engel, Jasper Metabolites Article One-class modelling is a useful approach in metabolomics for the untargeted detection of abnormal metabolite profiles, when information from a set of reference observations is available to model “normal” or baseline metabolite profiles. Such outlying profiles are typically identified by comparing the distance between an observation and the reference class to a critical limit. Often, multivariate distance measures such as the Mahalanobis distance ([Formula: see text]) or principal component-based measures are used. These approaches, however, are either not applicable to untargeted metabolomics data, or their results are unreliable. In this paper, five distance measures for one-class modeling in untargeted metabolites are proposed. They are based on a combination of the [Formula: see text] and five so-called eigenvalue-shrinkage estimators of the covariance matrix of the reference class. A simple cross-validation procedure is proposed to set the critical limit for outlier detection. Simulation studies are used to identify which distance measure provides the best performance for one-class modeling, in terms of type I error and power to identify abnormal metabolite profiles. Empirical evidence demonstrates that this method has better type I error (false positive rate) and improved outlier detection power than the standard (principal component-based) one-class models. The method is illustrated by its application to liquid chromatography coupled to mass spectrometry (LC-MS) and nuclear magnetic response spectroscopy (NMR) untargeted metabolomics data from two studies on food safety assessment and diagnosis of rare diseases, respectively. MDPI 2021-04-13 /pmc/articles/PMC8069634/ /pubmed/33924479 http://dx.doi.org/10.3390/metabo11040237 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Brini, Alberto
Avagyan, Vahe
de Vos, Ric C. H.
Vossen, Jack H.
van den Heuvel, Edwin R.
Engel, Jasper
Improved One-Class Modeling of High-Dimensional Metabolomics Data via Eigenvalue-Shrinkage
title Improved One-Class Modeling of High-Dimensional Metabolomics Data via Eigenvalue-Shrinkage
title_full Improved One-Class Modeling of High-Dimensional Metabolomics Data via Eigenvalue-Shrinkage
title_fullStr Improved One-Class Modeling of High-Dimensional Metabolomics Data via Eigenvalue-Shrinkage
title_full_unstemmed Improved One-Class Modeling of High-Dimensional Metabolomics Data via Eigenvalue-Shrinkage
title_short Improved One-Class Modeling of High-Dimensional Metabolomics Data via Eigenvalue-Shrinkage
title_sort improved one-class modeling of high-dimensional metabolomics data via eigenvalue-shrinkage
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8069634/
https://www.ncbi.nlm.nih.gov/pubmed/33924479
http://dx.doi.org/10.3390/metabo11040237
work_keys_str_mv AT brinialberto improvedoneclassmodelingofhighdimensionalmetabolomicsdataviaeigenvalueshrinkage
AT avagyanvahe improvedoneclassmodelingofhighdimensionalmetabolomicsdataviaeigenvalueshrinkage
AT devosricch improvedoneclassmodelingofhighdimensionalmetabolomicsdataviaeigenvalueshrinkage
AT vossenjackh improvedoneclassmodelingofhighdimensionalmetabolomicsdataviaeigenvalueshrinkage
AT vandenheuveledwinr improvedoneclassmodelingofhighdimensionalmetabolomicsdataviaeigenvalueshrinkage
AT engeljasper improvedoneclassmodelingofhighdimensionalmetabolomicsdataviaeigenvalueshrinkage