Cargando…

Mutual Information between Discrete and Continuous Data Sets

Mutual information (MI) is a powerful method for detecting relationships between data sets. There are accurate methods for estimating MI that avoid problems with “binning” when both data sets are discrete or when both data sets are continuous. We present an accurate, non-binning MI estimator for the...

Descripción completa

Detalles Bibliográficos
Autor principal: Ross, Brian C.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3929353/
https://www.ncbi.nlm.nih.gov/pubmed/24586270
http://dx.doi.org/10.1371/journal.pone.0087357
_version_ 1782304384079626240
author Ross, Brian C.
author_facet Ross, Brian C.
author_sort Ross, Brian C.
collection PubMed
description Mutual information (MI) is a powerful method for detecting relationships between data sets. There are accurate methods for estimating MI that avoid problems with “binning” when both data sets are discrete or when both data sets are continuous. We present an accurate, non-binning MI estimator for the case of one discrete data set and one continuous data set. This case applies when measuring, for example, the relationship between base sequence and gene expression level, or the effect of a cancer drug on patient survival time. We also show how our method can be adapted to calculate the Jensen–Shannon divergence of two or more data sets.
format Online
Article
Text
id pubmed-3929353
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-39293532014-02-25 Mutual Information between Discrete and Continuous Data Sets Ross, Brian C. PLoS One Research Article Mutual information (MI) is a powerful method for detecting relationships between data sets. There are accurate methods for estimating MI that avoid problems with “binning” when both data sets are discrete or when both data sets are continuous. We present an accurate, non-binning MI estimator for the case of one discrete data set and one continuous data set. This case applies when measuring, for example, the relationship between base sequence and gene expression level, or the effect of a cancer drug on patient survival time. We also show how our method can be adapted to calculate the Jensen–Shannon divergence of two or more data sets. Public Library of Science 2014-02-19 /pmc/articles/PMC3929353/ /pubmed/24586270 http://dx.doi.org/10.1371/journal.pone.0087357 Text en © 2014 Brian C http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Ross, Brian C.
Mutual Information between Discrete and Continuous Data Sets
title Mutual Information between Discrete and Continuous Data Sets
title_full Mutual Information between Discrete and Continuous Data Sets
title_fullStr Mutual Information between Discrete and Continuous Data Sets
title_full_unstemmed Mutual Information between Discrete and Continuous Data Sets
title_short Mutual Information between Discrete and Continuous Data Sets
title_sort mutual information between discrete and continuous data sets
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3929353/
https://www.ncbi.nlm.nih.gov/pubmed/24586270
http://dx.doi.org/10.1371/journal.pone.0087357
work_keys_str_mv AT rossbrianc mutualinformationbetweendiscreteandcontinuousdatasets