Cargando…
NML Computation Algorithms for Tree-Structured Multinomial Bayesian Networks
Typical problems in bioinformatics involve large discrete datasets. Therefore, in order to apply statistical methods in such domains, it is important to develop efficient algorithms suitable for discrete data. The minimum description length (MDL) principle is a theoretically well-founded, general fr...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Springer
2008
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3171356/ https://www.ncbi.nlm.nih.gov/pubmed/18382603 http://dx.doi.org/10.1155/2007/90947 |
_version_ | 1782211744187285504 |
---|---|
author | Kontkanen, Petri Wettig, Hannes Myllymäki, Petri |
author_facet | Kontkanen, Petri Wettig, Hannes Myllymäki, Petri |
author_sort | Kontkanen, Petri |
collection | PubMed |
description | Typical problems in bioinformatics involve large discrete datasets. Therefore, in order to apply statistical methods in such domains, it is important to develop efficient algorithms suitable for discrete data. The minimum description length (MDL) principle is a theoretically well-founded, general framework for performing statistical inference. The mathematical formalization of MDL is based on the normalized maximum likelihood (NML) distribution, which has several desirable theoretical properties. In the case of discrete data, straightforward computation of the NML distribution requires exponential time with respect to the sample size, since the definition involves a sum over all the possible data samples of a fixed size. In this paper, we first review some existing algorithms for efficient NML computation in the case of multinomial and naive Bayes model families. Then we proceed by extending these algorithms to more complex, tree-structured Bayesian networks. |
format | Online Article Text |
id | pubmed-3171356 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2008 |
publisher | Springer |
record_format | MEDLINE/PubMed |
spelling | pubmed-31713562011-09-13 NML Computation Algorithms for Tree-Structured Multinomial Bayesian Networks Kontkanen, Petri Wettig, Hannes Myllymäki, Petri EURASIP J Bioinform Syst Biol Research Article Typical problems in bioinformatics involve large discrete datasets. Therefore, in order to apply statistical methods in such domains, it is important to develop efficient algorithms suitable for discrete data. The minimum description length (MDL) principle is a theoretically well-founded, general framework for performing statistical inference. The mathematical formalization of MDL is based on the normalized maximum likelihood (NML) distribution, which has several desirable theoretical properties. In the case of discrete data, straightforward computation of the NML distribution requires exponential time with respect to the sample size, since the definition involves a sum over all the possible data samples of a fixed size. In this paper, we first review some existing algorithms for efficient NML computation in the case of multinomial and naive Bayes model families. Then we proceed by extending these algorithms to more complex, tree-structured Bayesian networks. Springer 2008-01-20 /pmc/articles/PMC3171356/ /pubmed/18382603 http://dx.doi.org/10.1155/2007/90947 Text en Copyright © 2007 Petri Kontkanen et al. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Kontkanen, Petri Wettig, Hannes Myllymäki, Petri NML Computation Algorithms for Tree-Structured Multinomial Bayesian Networks |
title | NML Computation Algorithms for Tree-Structured Multinomial Bayesian Networks |
title_full | NML Computation Algorithms for Tree-Structured Multinomial Bayesian Networks |
title_fullStr | NML Computation Algorithms for Tree-Structured Multinomial Bayesian Networks |
title_full_unstemmed | NML Computation Algorithms for Tree-Structured Multinomial Bayesian Networks |
title_short | NML Computation Algorithms for Tree-Structured Multinomial Bayesian Networks |
title_sort | nml computation algorithms for tree-structured multinomial bayesian networks |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3171356/ https://www.ncbi.nlm.nih.gov/pubmed/18382603 http://dx.doi.org/10.1155/2007/90947 |
work_keys_str_mv | AT kontkanenpetri nmlcomputationalgorithmsfortreestructuredmultinomialbayesiannetworks AT wettighannes nmlcomputationalgorithmsfortreestructuredmultinomialbayesiannetworks AT myllymakipetri nmlcomputationalgorithmsfortreestructuredmultinomialbayesiannetworks |