Cargando…

Guidelines for correlation coefficient threshold settings in metabolite correlation networks exemplified on a potato association panel

BACKGROUND: Correlation network analysis has become an integral tool to study metabolite datasets. Networks are constructed by omitting correlations between metabolites based on two thresholds—namely the r and the associated p-values. While p-value threshold settings follow the rules of multiple hyp...

Descripción completa

Detalles Bibliográficos
Autores principales: Toubiana, David, Maruenda, Helena
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7945624/
https://www.ncbi.nlm.nih.gov/pubmed/33691629
http://dx.doi.org/10.1186/s12859-021-03994-z
_version_ 1783662895355133952
author Toubiana, David
Maruenda, Helena
author_facet Toubiana, David
Maruenda, Helena
author_sort Toubiana, David
collection PubMed
description BACKGROUND: Correlation network analysis has become an integral tool to study metabolite datasets. Networks are constructed by omitting correlations between metabolites based on two thresholds—namely the r and the associated p-values. While p-value threshold settings follow the rules of multiple hypotheses testing correction, guidelines for r-value threshold settings have not been defined. RESULTS: Here, we introduce a method that allows determining the r-value threshold based on an iterative approach, where different networks are constructed and their network topology is monitored. Once the network topology changes significantly, the threshold is set to the corresponding correlation coefficient value. The approach was exemplified on: (i) a metabolite and morphological trait dataset from a potato association panel, which was grown under normal irrigation and water recovery conditions; and validated (ii) on a metabolite dataset of hearts of fed and fasted mice. For the potato normal irrigation correlation network a threshold of Pearson’s |r|≥ 0.23 was suggested, while for the water recovery correlation network a threshold of Pearson’s |r|≥ 0.41 was estimated. For both mice networks the threshold was calculated with Pearson’s |r|≥ 0.84. CONCLUSIONS: Our analysis corrected the previously stated Pearson’s correlation coefficient threshold from 0.4 to 0.41 in the water recovery network and from 0.4 to 0.23 for the normal irrigation network. Furthermore, the proposed method suggested a correlation threshold of 0.84 for both mice networks rather than a threshold of 0.7 as applied earlier. We demonstrate that the proposed approach is a valuable tool for constructing biological meaningful networks.
format Online
Article
Text
id pubmed-7945624
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-79456242021-03-11 Guidelines for correlation coefficient threshold settings in metabolite correlation networks exemplified on a potato association panel Toubiana, David Maruenda, Helena BMC Bioinformatics Methodology Article BACKGROUND: Correlation network analysis has become an integral tool to study metabolite datasets. Networks are constructed by omitting correlations between metabolites based on two thresholds—namely the r and the associated p-values. While p-value threshold settings follow the rules of multiple hypotheses testing correction, guidelines for r-value threshold settings have not been defined. RESULTS: Here, we introduce a method that allows determining the r-value threshold based on an iterative approach, where different networks are constructed and their network topology is monitored. Once the network topology changes significantly, the threshold is set to the corresponding correlation coefficient value. The approach was exemplified on: (i) a metabolite and morphological trait dataset from a potato association panel, which was grown under normal irrigation and water recovery conditions; and validated (ii) on a metabolite dataset of hearts of fed and fasted mice. For the potato normal irrigation correlation network a threshold of Pearson’s |r|≥ 0.23 was suggested, while for the water recovery correlation network a threshold of Pearson’s |r|≥ 0.41 was estimated. For both mice networks the threshold was calculated with Pearson’s |r|≥ 0.84. CONCLUSIONS: Our analysis corrected the previously stated Pearson’s correlation coefficient threshold from 0.4 to 0.41 in the water recovery network and from 0.4 to 0.23 for the normal irrigation network. Furthermore, the proposed method suggested a correlation threshold of 0.84 for both mice networks rather than a threshold of 0.7 as applied earlier. We demonstrate that the proposed approach is a valuable tool for constructing biological meaningful networks. BioMed Central 2021-03-10 /pmc/articles/PMC7945624/ /pubmed/33691629 http://dx.doi.org/10.1186/s12859-021-03994-z Text en © The Author(s) 2021 Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Methodology Article
Toubiana, David
Maruenda, Helena
Guidelines for correlation coefficient threshold settings in metabolite correlation networks exemplified on a potato association panel
title Guidelines for correlation coefficient threshold settings in metabolite correlation networks exemplified on a potato association panel
title_full Guidelines for correlation coefficient threshold settings in metabolite correlation networks exemplified on a potato association panel
title_fullStr Guidelines for correlation coefficient threshold settings in metabolite correlation networks exemplified on a potato association panel
title_full_unstemmed Guidelines for correlation coefficient threshold settings in metabolite correlation networks exemplified on a potato association panel
title_short Guidelines for correlation coefficient threshold settings in metabolite correlation networks exemplified on a potato association panel
title_sort guidelines for correlation coefficient threshold settings in metabolite correlation networks exemplified on a potato association panel
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7945624/
https://www.ncbi.nlm.nih.gov/pubmed/33691629
http://dx.doi.org/10.1186/s12859-021-03994-z
work_keys_str_mv AT toubianadavid guidelinesforcorrelationcoefficientthresholdsettingsinmetabolitecorrelationnetworksexemplifiedonapotatoassociationpanel
AT maruendahelena guidelinesforcorrelationcoefficientthresholdsettingsinmetabolitecorrelationnetworksexemplifiedonapotatoassociationpanel