Cargando…
Quantifying Total Influence between Variables with Information Theoretic and Machine Learning Techniques
The increasingly sophisticated investigations of complex systems require more robust estimates of the correlations between the measured quantities. The traditional Pearson correlation coefficient is easy to calculate but sensitive only to linear correlations. The total influence between quantities i...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7516551/ https://www.ncbi.nlm.nih.gov/pubmed/33285916 http://dx.doi.org/10.3390/e22020141 |
_version_ | 1783587027903578112 |
---|---|
author | Murari, Andrea Rossi, Riccardo Lungaroni, Michele Gaudio, Pasquale Gelfusa, Michela |
author_facet | Murari, Andrea Rossi, Riccardo Lungaroni, Michele Gaudio, Pasquale Gelfusa, Michela |
author_sort | Murari, Andrea |
collection | PubMed |
description | The increasingly sophisticated investigations of complex systems require more robust estimates of the correlations between the measured quantities. The traditional Pearson correlation coefficient is easy to calculate but sensitive only to linear correlations. The total influence between quantities is, therefore, often expressed in terms of the mutual information, which also takes into account the nonlinear effects but is not normalized. To compare data from different experiments, the information quality ratio is, therefore, in many cases, of easier interpretation. On the other hand, both mutual information and information quality ratio are always positive and, therefore, cannot provide information about the sign of the influence between quantities. Moreover, they require an accurate determination of the probability distribution functions of the variables involved. As the quality and amount of data available are not always sufficient to grant an accurate estimation of the probability distribution functions, it has been investigated whether neural computational tools can help and complement the aforementioned indicators. Specific encoders and autoencoders have been developed for the task of determining the total correlation between quantities related by a functional dependence, including information about the sign of their mutual influence. Both their accuracy and computational efficiencies have been addressed in detail, with extensive numerical tests using synthetic data. A careful analysis of the robustness against noise has also been performed. The neural computational tools typically outperform the traditional indicators in practically every respect. |
format | Online Article Text |
id | pubmed-7516551 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-75165512020-11-09 Quantifying Total Influence between Variables with Information Theoretic and Machine Learning Techniques Murari, Andrea Rossi, Riccardo Lungaroni, Michele Gaudio, Pasquale Gelfusa, Michela Entropy (Basel) Article The increasingly sophisticated investigations of complex systems require more robust estimates of the correlations between the measured quantities. The traditional Pearson correlation coefficient is easy to calculate but sensitive only to linear correlations. The total influence between quantities is, therefore, often expressed in terms of the mutual information, which also takes into account the nonlinear effects but is not normalized. To compare data from different experiments, the information quality ratio is, therefore, in many cases, of easier interpretation. On the other hand, both mutual information and information quality ratio are always positive and, therefore, cannot provide information about the sign of the influence between quantities. Moreover, they require an accurate determination of the probability distribution functions of the variables involved. As the quality and amount of data available are not always sufficient to grant an accurate estimation of the probability distribution functions, it has been investigated whether neural computational tools can help and complement the aforementioned indicators. Specific encoders and autoencoders have been developed for the task of determining the total correlation between quantities related by a functional dependence, including information about the sign of their mutual influence. Both their accuracy and computational efficiencies have been addressed in detail, with extensive numerical tests using synthetic data. A careful analysis of the robustness against noise has also been performed. The neural computational tools typically outperform the traditional indicators in practically every respect. MDPI 2020-01-24 /pmc/articles/PMC7516551/ /pubmed/33285916 http://dx.doi.org/10.3390/e22020141 Text en © 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Murari, Andrea Rossi, Riccardo Lungaroni, Michele Gaudio, Pasquale Gelfusa, Michela Quantifying Total Influence between Variables with Information Theoretic and Machine Learning Techniques |
title | Quantifying Total Influence between Variables with Information Theoretic and Machine Learning Techniques |
title_full | Quantifying Total Influence between Variables with Information Theoretic and Machine Learning Techniques |
title_fullStr | Quantifying Total Influence between Variables with Information Theoretic and Machine Learning Techniques |
title_full_unstemmed | Quantifying Total Influence between Variables with Information Theoretic and Machine Learning Techniques |
title_short | Quantifying Total Influence between Variables with Information Theoretic and Machine Learning Techniques |
title_sort | quantifying total influence between variables with information theoretic and machine learning techniques |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7516551/ https://www.ncbi.nlm.nih.gov/pubmed/33285916 http://dx.doi.org/10.3390/e22020141 |
work_keys_str_mv | AT murariandrea quantifyingtotalinfluencebetweenvariableswithinformationtheoreticandmachinelearningtechniques AT rossiriccardo quantifyingtotalinfluencebetweenvariableswithinformationtheoreticandmachinelearningtechniques AT lungaronimichele quantifyingtotalinfluencebetweenvariableswithinformationtheoreticandmachinelearningtechniques AT gaudiopasquale quantifyingtotalinfluencebetweenvariableswithinformationtheoreticandmachinelearningtechniques AT gelfusamichela quantifyingtotalinfluencebetweenvariableswithinformationtheoreticandmachinelearningtechniques |