Cargando…
Corruption of the Pearson correlation coefficient by measurement error and its estimation, bias, and correction under different error models
Correlation coefficients are abundantly used in the life sciences. Their use can be limited to simple exploratory analysis or to construct association networks for visualization but they are also basic ingredients for sophisticated multivariate data analysis methods. It is therefore important to hav...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6965177/ https://www.ncbi.nlm.nih.gov/pubmed/31949233 http://dx.doi.org/10.1038/s41598-019-57247-4 |
_version_ | 1783488605604282368 |
---|---|
author | Saccenti, Edoardo Hendriks, Margriet H. W. B. Smilde, Age K. |
author_facet | Saccenti, Edoardo Hendriks, Margriet H. W. B. Smilde, Age K. |
author_sort | Saccenti, Edoardo |
collection | PubMed |
description | Correlation coefficients are abundantly used in the life sciences. Their use can be limited to simple exploratory analysis or to construct association networks for visualization but they are also basic ingredients for sophisticated multivariate data analysis methods. It is therefore important to have reliable estimates for correlation coefficients. In modern life sciences, comprehensive measurement techniques are used to measure metabolites, proteins, gene-expressions and other types of data. All these measurement techniques have errors. Whereas in the old days, with simple measurements, the errors were also simple, that is not the case anymore. Errors are heterogeneous, non-constant and not independent. This hampers the quality of the estimated correlation coefficients seriously. We will discuss the different types of errors as present in modern comprehensive life science data and show with theory, simulations and real-life data how these affect the correlation coefficients. We will briefly discuss ways to improve the estimation of such coefficients. |
format | Online Article Text |
id | pubmed-6965177 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-69651772020-01-23 Corruption of the Pearson correlation coefficient by measurement error and its estimation, bias, and correction under different error models Saccenti, Edoardo Hendriks, Margriet H. W. B. Smilde, Age K. Sci Rep Article Correlation coefficients are abundantly used in the life sciences. Their use can be limited to simple exploratory analysis or to construct association networks for visualization but they are also basic ingredients for sophisticated multivariate data analysis methods. It is therefore important to have reliable estimates for correlation coefficients. In modern life sciences, comprehensive measurement techniques are used to measure metabolites, proteins, gene-expressions and other types of data. All these measurement techniques have errors. Whereas in the old days, with simple measurements, the errors were also simple, that is not the case anymore. Errors are heterogeneous, non-constant and not independent. This hampers the quality of the estimated correlation coefficients seriously. We will discuss the different types of errors as present in modern comprehensive life science data and show with theory, simulations and real-life data how these affect the correlation coefficients. We will briefly discuss ways to improve the estimation of such coefficients. Nature Publishing Group UK 2020-01-16 /pmc/articles/PMC6965177/ /pubmed/31949233 http://dx.doi.org/10.1038/s41598-019-57247-4 Text en © The Author(s) 2020 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/. |
spellingShingle | Article Saccenti, Edoardo Hendriks, Margriet H. W. B. Smilde, Age K. Corruption of the Pearson correlation coefficient by measurement error and its estimation, bias, and correction under different error models |
title | Corruption of the Pearson correlation coefficient by measurement error and its estimation, bias, and correction under different error models |
title_full | Corruption of the Pearson correlation coefficient by measurement error and its estimation, bias, and correction under different error models |
title_fullStr | Corruption of the Pearson correlation coefficient by measurement error and its estimation, bias, and correction under different error models |
title_full_unstemmed | Corruption of the Pearson correlation coefficient by measurement error and its estimation, bias, and correction under different error models |
title_short | Corruption of the Pearson correlation coefficient by measurement error and its estimation, bias, and correction under different error models |
title_sort | corruption of the pearson correlation coefficient by measurement error and its estimation, bias, and correction under different error models |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6965177/ https://www.ncbi.nlm.nih.gov/pubmed/31949233 http://dx.doi.org/10.1038/s41598-019-57247-4 |
work_keys_str_mv | AT saccentiedoardo corruptionofthepearsoncorrelationcoefficientbymeasurementerroranditsestimationbiasandcorrectionunderdifferenterrormodels AT hendriksmargriethwb corruptionofthepearsoncorrelationcoefficientbymeasurementerroranditsestimationbiasandcorrectionunderdifferenterrormodels AT smildeagek corruptionofthepearsoncorrelationcoefficientbymeasurementerroranditsestimationbiasandcorrectionunderdifferenterrormodels |