Cargando…

Cellwise outlier detection and biomarker identification in metabolomics based on pairwise log ratios

Data outliers can carry very valuable information and might be most informative for the interpretation. Nevertheless, they are often neglected. An algorithm called cellwise outlier diagnostics using robust pairwise log ratios (cell‐rPLR) for the identification of outliers in single cell of a data ma...

Descripción completa

Detalles Bibliográficos
Autores principales: Walach, Jan, Filzmoser, Peter, Kouřil, Štěpán, Friedecký, David, Adam, Tomáš
Formato: Online Artículo Texto
Lenguaje:English
Publicado: John Wiley and Sons Inc. 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7063692/
https://www.ncbi.nlm.nih.gov/pubmed/32189829
http://dx.doi.org/10.1002/cem.3182
_version_ 1783504738948481024
author Walach, Jan
Filzmoser, Peter
Kouřil, Štěpán
Friedecký, David
Adam, Tomáš
author_facet Walach, Jan
Filzmoser, Peter
Kouřil, Štěpán
Friedecký, David
Adam, Tomáš
author_sort Walach, Jan
collection PubMed
description Data outliers can carry very valuable information and might be most informative for the interpretation. Nevertheless, they are often neglected. An algorithm called cellwise outlier diagnostics using robust pairwise log ratios (cell‐rPLR) for the identification of outliers in single cell of a data matrix is proposed. The algorithm is designed for metabolomic data, where due to the size effect, the measured values are not directly comparable. Pairwise log ratios between the variable values form the elemental information for the algorithm, and the aggregation of appropriate outlyingness values results in outlyingness information. A further feature of cell‐rPLR is that it is useful for biomarker identification, particularly in the presence of cellwise outliers. Real data examples and simulation studies underline the good performance of this algorithm in comparison with alternative methods.
format Online
Article
Text
id pubmed-7063692
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher John Wiley and Sons Inc.
record_format MEDLINE/PubMed
spelling pubmed-70636922020-03-16 Cellwise outlier detection and biomarker identification in metabolomics based on pairwise log ratios Walach, Jan Filzmoser, Peter Kouřil, Štěpán Friedecký, David Adam, Tomáš J Chemom Research Articles Data outliers can carry very valuable information and might be most informative for the interpretation. Nevertheless, they are often neglected. An algorithm called cellwise outlier diagnostics using robust pairwise log ratios (cell‐rPLR) for the identification of outliers in single cell of a data matrix is proposed. The algorithm is designed for metabolomic data, where due to the size effect, the measured values are not directly comparable. Pairwise log ratios between the variable values form the elemental information for the algorithm, and the aggregation of appropriate outlyingness values results in outlyingness information. A further feature of cell‐rPLR is that it is useful for biomarker identification, particularly in the presence of cellwise outliers. Real data examples and simulation studies underline the good performance of this algorithm in comparison with alternative methods. John Wiley and Sons Inc. 2019-12-02 2020-01 /pmc/articles/PMC7063692/ /pubmed/32189829 http://dx.doi.org/10.1002/cem.3182 Text en © 2019 The Authors. Journal of Chemometrics published by John Wiley & Sons Ltd This is an open access article under the terms of the http://creativecommons.org/licenses/by/4.0/ License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Articles
Walach, Jan
Filzmoser, Peter
Kouřil, Štěpán
Friedecký, David
Adam, Tomáš
Cellwise outlier detection and biomarker identification in metabolomics based on pairwise log ratios
title Cellwise outlier detection and biomarker identification in metabolomics based on pairwise log ratios
title_full Cellwise outlier detection and biomarker identification in metabolomics based on pairwise log ratios
title_fullStr Cellwise outlier detection and biomarker identification in metabolomics based on pairwise log ratios
title_full_unstemmed Cellwise outlier detection and biomarker identification in metabolomics based on pairwise log ratios
title_short Cellwise outlier detection and biomarker identification in metabolomics based on pairwise log ratios
title_sort cellwise outlier detection and biomarker identification in metabolomics based on pairwise log ratios
topic Research Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7063692/
https://www.ncbi.nlm.nih.gov/pubmed/32189829
http://dx.doi.org/10.1002/cem.3182
work_keys_str_mv AT walachjan cellwiseoutlierdetectionandbiomarkeridentificationinmetabolomicsbasedonpairwiselogratios
AT filzmoserpeter cellwiseoutlierdetectionandbiomarkeridentificationinmetabolomicsbasedonpairwiselogratios
AT kourilstepan cellwiseoutlierdetectionandbiomarkeridentificationinmetabolomicsbasedonpairwiselogratios
AT friedeckydavid cellwiseoutlierdetectionandbiomarkeridentificationinmetabolomicsbasedonpairwiselogratios
AT adamtomas cellwiseoutlierdetectionandbiomarkeridentificationinmetabolomicsbasedonpairwiselogratios