Cargando…

Corruption of the Pearson correlation coefficient by measurement error and its estimation, bias, and correction under different error models

Correlation coefficients are abundantly used in the life sciences. Their use can be limited to simple exploratory analysis or to construct association networks for visualization but they are also basic ingredients for sophisticated multivariate data analysis methods. It is therefore important to hav...

Descripción completa

Detalles Bibliográficos
Autores principales: Saccenti, Edoardo, Hendriks, Margriet H. W. B., Smilde, Age K.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6965177/
https://www.ncbi.nlm.nih.gov/pubmed/31949233
http://dx.doi.org/10.1038/s41598-019-57247-4
_version_ 1783488605604282368
author Saccenti, Edoardo
Hendriks, Margriet H. W. B.
Smilde, Age K.
author_facet Saccenti, Edoardo
Hendriks, Margriet H. W. B.
Smilde, Age K.
author_sort Saccenti, Edoardo
collection PubMed
description Correlation coefficients are abundantly used in the life sciences. Their use can be limited to simple exploratory analysis or to construct association networks for visualization but they are also basic ingredients for sophisticated multivariate data analysis methods. It is therefore important to have reliable estimates for correlation coefficients. In modern life sciences, comprehensive measurement techniques are used to measure metabolites, proteins, gene-expressions and other types of data. All these measurement techniques have errors. Whereas in the old days, with simple measurements, the errors were also simple, that is not the case anymore. Errors are heterogeneous, non-constant and not independent. This hampers the quality of the estimated correlation coefficients seriously. We will discuss the different types of errors as present in modern comprehensive life science data and show with theory, simulations and real-life data how these affect the correlation coefficients. We will briefly discuss ways to improve the estimation of such coefficients.
format Online
Article
Text
id pubmed-6965177
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-69651772020-01-23 Corruption of the Pearson correlation coefficient by measurement error and its estimation, bias, and correction under different error models Saccenti, Edoardo Hendriks, Margriet H. W. B. Smilde, Age K. Sci Rep Article Correlation coefficients are abundantly used in the life sciences. Their use can be limited to simple exploratory analysis or to construct association networks for visualization but they are also basic ingredients for sophisticated multivariate data analysis methods. It is therefore important to have reliable estimates for correlation coefficients. In modern life sciences, comprehensive measurement techniques are used to measure metabolites, proteins, gene-expressions and other types of data. All these measurement techniques have errors. Whereas in the old days, with simple measurements, the errors were also simple, that is not the case anymore. Errors are heterogeneous, non-constant and not independent. This hampers the quality of the estimated correlation coefficients seriously. We will discuss the different types of errors as present in modern comprehensive life science data and show with theory, simulations and real-life data how these affect the correlation coefficients. We will briefly discuss ways to improve the estimation of such coefficients. Nature Publishing Group UK 2020-01-16 /pmc/articles/PMC6965177/ /pubmed/31949233 http://dx.doi.org/10.1038/s41598-019-57247-4 Text en © The Author(s) 2020 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
spellingShingle Article
Saccenti, Edoardo
Hendriks, Margriet H. W. B.
Smilde, Age K.
Corruption of the Pearson correlation coefficient by measurement error and its estimation, bias, and correction under different error models
title Corruption of the Pearson correlation coefficient by measurement error and its estimation, bias, and correction under different error models
title_full Corruption of the Pearson correlation coefficient by measurement error and its estimation, bias, and correction under different error models
title_fullStr Corruption of the Pearson correlation coefficient by measurement error and its estimation, bias, and correction under different error models
title_full_unstemmed Corruption of the Pearson correlation coefficient by measurement error and its estimation, bias, and correction under different error models
title_short Corruption of the Pearson correlation coefficient by measurement error and its estimation, bias, and correction under different error models
title_sort corruption of the pearson correlation coefficient by measurement error and its estimation, bias, and correction under different error models
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6965177/
https://www.ncbi.nlm.nih.gov/pubmed/31949233
http://dx.doi.org/10.1038/s41598-019-57247-4
work_keys_str_mv AT saccentiedoardo corruptionofthepearsoncorrelationcoefficientbymeasurementerroranditsestimationbiasandcorrectionunderdifferenterrormodels
AT hendriksmargriethwb corruptionofthepearsoncorrelationcoefficientbymeasurementerroranditsestimationbiasandcorrectionunderdifferenterrormodels
AT smildeagek corruptionofthepearsoncorrelationcoefficientbymeasurementerroranditsestimationbiasandcorrectionunderdifferenterrormodels