Cargando…

NML Computation Algorithms for Tree-Structured Multinomial Bayesian Networks

Typical problems in bioinformatics involve large discrete datasets. Therefore, in order to apply statistical methods in such domains, it is important to develop efficient algorithms suitable for discrete data. The minimum description length (MDL) principle is a theoretically well-founded, general fr...

Descripción completa

Detalles Bibliográficos
Autores principales: Kontkanen, Petri, Wettig, Hannes, Myllymäki, Petri
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer 2008
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3171356/
https://www.ncbi.nlm.nih.gov/pubmed/18382603
http://dx.doi.org/10.1155/2007/90947
_version_ 1782211744187285504
author Kontkanen, Petri
Wettig, Hannes
Myllymäki, Petri
author_facet Kontkanen, Petri
Wettig, Hannes
Myllymäki, Petri
author_sort Kontkanen, Petri
collection PubMed
description Typical problems in bioinformatics involve large discrete datasets. Therefore, in order to apply statistical methods in such domains, it is important to develop efficient algorithms suitable for discrete data. The minimum description length (MDL) principle is a theoretically well-founded, general framework for performing statistical inference. The mathematical formalization of MDL is based on the normalized maximum likelihood (NML) distribution, which has several desirable theoretical properties. In the case of discrete data, straightforward computation of the NML distribution requires exponential time with respect to the sample size, since the definition involves a sum over all the possible data samples of a fixed size. In this paper, we first review some existing algorithms for efficient NML computation in the case of multinomial and naive Bayes model families. Then we proceed by extending these algorithms to more complex, tree-structured Bayesian networks.
format Online
Article
Text
id pubmed-3171356
institution National Center for Biotechnology Information
language English
publishDate 2008
publisher Springer
record_format MEDLINE/PubMed
spelling pubmed-31713562011-09-13 NML Computation Algorithms for Tree-Structured Multinomial Bayesian Networks Kontkanen, Petri Wettig, Hannes Myllymäki, Petri EURASIP J Bioinform Syst Biol Research Article Typical problems in bioinformatics involve large discrete datasets. Therefore, in order to apply statistical methods in such domains, it is important to develop efficient algorithms suitable for discrete data. The minimum description length (MDL) principle is a theoretically well-founded, general framework for performing statistical inference. The mathematical formalization of MDL is based on the normalized maximum likelihood (NML) distribution, which has several desirable theoretical properties. In the case of discrete data, straightforward computation of the NML distribution requires exponential time with respect to the sample size, since the definition involves a sum over all the possible data samples of a fixed size. In this paper, we first review some existing algorithms for efficient NML computation in the case of multinomial and naive Bayes model families. Then we proceed by extending these algorithms to more complex, tree-structured Bayesian networks. Springer 2008-01-20 /pmc/articles/PMC3171356/ /pubmed/18382603 http://dx.doi.org/10.1155/2007/90947 Text en Copyright © 2007 Petri Kontkanen et al. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Kontkanen, Petri
Wettig, Hannes
Myllymäki, Petri
NML Computation Algorithms for Tree-Structured Multinomial Bayesian Networks
title NML Computation Algorithms for Tree-Structured Multinomial Bayesian Networks
title_full NML Computation Algorithms for Tree-Structured Multinomial Bayesian Networks
title_fullStr NML Computation Algorithms for Tree-Structured Multinomial Bayesian Networks
title_full_unstemmed NML Computation Algorithms for Tree-Structured Multinomial Bayesian Networks
title_short NML Computation Algorithms for Tree-Structured Multinomial Bayesian Networks
title_sort nml computation algorithms for tree-structured multinomial bayesian networks
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3171356/
https://www.ncbi.nlm.nih.gov/pubmed/18382603
http://dx.doi.org/10.1155/2007/90947
work_keys_str_mv AT kontkanenpetri nmlcomputationalgorithmsfortreestructuredmultinomialbayesiannetworks
AT wettighannes nmlcomputationalgorithmsfortreestructuredmultinomialbayesiannetworks
AT myllymakipetri nmlcomputationalgorithmsfortreestructuredmultinomialbayesiannetworks