Cargando…
Guidelines for correlation coefficient threshold settings in metabolite correlation networks exemplified on a potato association panel
BACKGROUND: Correlation network analysis has become an integral tool to study metabolite datasets. Networks are constructed by omitting correlations between metabolites based on two thresholds—namely the r and the associated p-values. While p-value threshold settings follow the rules of multiple hyp...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7945624/ https://www.ncbi.nlm.nih.gov/pubmed/33691629 http://dx.doi.org/10.1186/s12859-021-03994-z |
_version_ | 1783662895355133952 |
---|---|
author | Toubiana, David Maruenda, Helena |
author_facet | Toubiana, David Maruenda, Helena |
author_sort | Toubiana, David |
collection | PubMed |
description | BACKGROUND: Correlation network analysis has become an integral tool to study metabolite datasets. Networks are constructed by omitting correlations between metabolites based on two thresholds—namely the r and the associated p-values. While p-value threshold settings follow the rules of multiple hypotheses testing correction, guidelines for r-value threshold settings have not been defined. RESULTS: Here, we introduce a method that allows determining the r-value threshold based on an iterative approach, where different networks are constructed and their network topology is monitored. Once the network topology changes significantly, the threshold is set to the corresponding correlation coefficient value. The approach was exemplified on: (i) a metabolite and morphological trait dataset from a potato association panel, which was grown under normal irrigation and water recovery conditions; and validated (ii) on a metabolite dataset of hearts of fed and fasted mice. For the potato normal irrigation correlation network a threshold of Pearson’s |r|≥ 0.23 was suggested, while for the water recovery correlation network a threshold of Pearson’s |r|≥ 0.41 was estimated. For both mice networks the threshold was calculated with Pearson’s |r|≥ 0.84. CONCLUSIONS: Our analysis corrected the previously stated Pearson’s correlation coefficient threshold from 0.4 to 0.41 in the water recovery network and from 0.4 to 0.23 for the normal irrigation network. Furthermore, the proposed method suggested a correlation threshold of 0.84 for both mice networks rather than a threshold of 0.7 as applied earlier. We demonstrate that the proposed approach is a valuable tool for constructing biological meaningful networks. |
format | Online Article Text |
id | pubmed-7945624 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-79456242021-03-11 Guidelines for correlation coefficient threshold settings in metabolite correlation networks exemplified on a potato association panel Toubiana, David Maruenda, Helena BMC Bioinformatics Methodology Article BACKGROUND: Correlation network analysis has become an integral tool to study metabolite datasets. Networks are constructed by omitting correlations between metabolites based on two thresholds—namely the r and the associated p-values. While p-value threshold settings follow the rules of multiple hypotheses testing correction, guidelines for r-value threshold settings have not been defined. RESULTS: Here, we introduce a method that allows determining the r-value threshold based on an iterative approach, where different networks are constructed and their network topology is monitored. Once the network topology changes significantly, the threshold is set to the corresponding correlation coefficient value. The approach was exemplified on: (i) a metabolite and morphological trait dataset from a potato association panel, which was grown under normal irrigation and water recovery conditions; and validated (ii) on a metabolite dataset of hearts of fed and fasted mice. For the potato normal irrigation correlation network a threshold of Pearson’s |r|≥ 0.23 was suggested, while for the water recovery correlation network a threshold of Pearson’s |r|≥ 0.41 was estimated. For both mice networks the threshold was calculated with Pearson’s |r|≥ 0.84. CONCLUSIONS: Our analysis corrected the previously stated Pearson’s correlation coefficient threshold from 0.4 to 0.41 in the water recovery network and from 0.4 to 0.23 for the normal irrigation network. Furthermore, the proposed method suggested a correlation threshold of 0.84 for both mice networks rather than a threshold of 0.7 as applied earlier. We demonstrate that the proposed approach is a valuable tool for constructing biological meaningful networks. BioMed Central 2021-03-10 /pmc/articles/PMC7945624/ /pubmed/33691629 http://dx.doi.org/10.1186/s12859-021-03994-z Text en © The Author(s) 2021 Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Methodology Article Toubiana, David Maruenda, Helena Guidelines for correlation coefficient threshold settings in metabolite correlation networks exemplified on a potato association panel |
title | Guidelines for correlation coefficient threshold settings in metabolite correlation networks exemplified on a potato association panel |
title_full | Guidelines for correlation coefficient threshold settings in metabolite correlation networks exemplified on a potato association panel |
title_fullStr | Guidelines for correlation coefficient threshold settings in metabolite correlation networks exemplified on a potato association panel |
title_full_unstemmed | Guidelines for correlation coefficient threshold settings in metabolite correlation networks exemplified on a potato association panel |
title_short | Guidelines for correlation coefficient threshold settings in metabolite correlation networks exemplified on a potato association panel |
title_sort | guidelines for correlation coefficient threshold settings in metabolite correlation networks exemplified on a potato association panel |
topic | Methodology Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7945624/ https://www.ncbi.nlm.nih.gov/pubmed/33691629 http://dx.doi.org/10.1186/s12859-021-03994-z |
work_keys_str_mv | AT toubianadavid guidelinesforcorrelationcoefficientthresholdsettingsinmetabolitecorrelationnetworksexemplifiedonapotatoassociationpanel AT maruendahelena guidelinesforcorrelationcoefficientthresholdsettingsinmetabolitecorrelationnetworksexemplifiedonapotatoassociationpanel |