Cargando…

A Beta-mixture model for dimensionality reduction, sample classification and analysis

BACKGROUND: Patterns of genome-wide methylation vary between tissue types. For example, cancer tissue shows markedly different patterns from those of normal tissue. In this paper we propose a beta-mixture model to describe genome-wide methylation patterns based on probe data from methylation microar...

Descripción completa

Detalles Bibliográficos
Autores principales: Laurila, Kirsti, Oster, Bodil, Andersen, Claus L, Lamy, Philippe, Orntoft, Torben, Yli-Harja, Olli, Wiuf, Carsten
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3126746/
https://www.ncbi.nlm.nih.gov/pubmed/21619656
http://dx.doi.org/10.1186/1471-2105-12-215
_version_ 1782207282165055488
author Laurila, Kirsti
Oster, Bodil
Andersen, Claus L
Lamy, Philippe
Orntoft, Torben
Yli-Harja, Olli
Wiuf, Carsten
author_facet Laurila, Kirsti
Oster, Bodil
Andersen, Claus L
Lamy, Philippe
Orntoft, Torben
Yli-Harja, Olli
Wiuf, Carsten
author_sort Laurila, Kirsti
collection PubMed
description BACKGROUND: Patterns of genome-wide methylation vary between tissue types. For example, cancer tissue shows markedly different patterns from those of normal tissue. In this paper we propose a beta-mixture model to describe genome-wide methylation patterns based on probe data from methylation microarrays. The model takes dependencies between neighbour probe pairs into account and assumes three broad categories of methylation, low, medium and high. The model is described by 37 parameters, which reduces the dimensionality of a typical methylation microarray significantly. We used methylation microarray data from 42 colon cancer samples to assess the model. RESULTS: Based on data from colon cancer samples we show that our model captures genome-wide characteristics of methylation patterns. We estimate the parameters of the model and show that they vary between different tissue types. Further, for each methylation probe the posterior probability of a methylation state (low, medium or high) is calculated and the probability that the state is correctly predicted is assessed. We demonstrate that the model can be applied to classify cancer tissue types accurately and that the model provides accessible and easily interpretable data summaries. CONCLUSIONS: We have developed a beta-mixture model for methylation microarray data. The model substantially reduces the dimensionality of the data. It can be used for further analysis, such as sample classification or to detect changes in methylation status between different samples and tissues.
format Online
Article
Text
id pubmed-3126746
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-31267462011-06-30 A Beta-mixture model for dimensionality reduction, sample classification and analysis Laurila, Kirsti Oster, Bodil Andersen, Claus L Lamy, Philippe Orntoft, Torben Yli-Harja, Olli Wiuf, Carsten BMC Bioinformatics Research Article BACKGROUND: Patterns of genome-wide methylation vary between tissue types. For example, cancer tissue shows markedly different patterns from those of normal tissue. In this paper we propose a beta-mixture model to describe genome-wide methylation patterns based on probe data from methylation microarrays. The model takes dependencies between neighbour probe pairs into account and assumes three broad categories of methylation, low, medium and high. The model is described by 37 parameters, which reduces the dimensionality of a typical methylation microarray significantly. We used methylation microarray data from 42 colon cancer samples to assess the model. RESULTS: Based on data from colon cancer samples we show that our model captures genome-wide characteristics of methylation patterns. We estimate the parameters of the model and show that they vary between different tissue types. Further, for each methylation probe the posterior probability of a methylation state (low, medium or high) is calculated and the probability that the state is correctly predicted is assessed. We demonstrate that the model can be applied to classify cancer tissue types accurately and that the model provides accessible and easily interpretable data summaries. CONCLUSIONS: We have developed a beta-mixture model for methylation microarray data. The model substantially reduces the dimensionality of the data. It can be used for further analysis, such as sample classification or to detect changes in methylation status between different samples and tissues. BioMed Central 2011-05-27 /pmc/articles/PMC3126746/ /pubmed/21619656 http://dx.doi.org/10.1186/1471-2105-12-215 Text en Copyright ©2011 Laurila et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Laurila, Kirsti
Oster, Bodil
Andersen, Claus L
Lamy, Philippe
Orntoft, Torben
Yli-Harja, Olli
Wiuf, Carsten
A Beta-mixture model for dimensionality reduction, sample classification and analysis
title A Beta-mixture model for dimensionality reduction, sample classification and analysis
title_full A Beta-mixture model for dimensionality reduction, sample classification and analysis
title_fullStr A Beta-mixture model for dimensionality reduction, sample classification and analysis
title_full_unstemmed A Beta-mixture model for dimensionality reduction, sample classification and analysis
title_short A Beta-mixture model for dimensionality reduction, sample classification and analysis
title_sort beta-mixture model for dimensionality reduction, sample classification and analysis
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3126746/
https://www.ncbi.nlm.nih.gov/pubmed/21619656
http://dx.doi.org/10.1186/1471-2105-12-215
work_keys_str_mv AT laurilakirsti abetamixturemodelfordimensionalityreductionsampleclassificationandanalysis
AT osterbodil abetamixturemodelfordimensionalityreductionsampleclassificationandanalysis
AT andersenclausl abetamixturemodelfordimensionalityreductionsampleclassificationandanalysis
AT lamyphilippe abetamixturemodelfordimensionalityreductionsampleclassificationandanalysis
AT orntofttorben abetamixturemodelfordimensionalityreductionsampleclassificationandanalysis
AT yliharjaolli abetamixturemodelfordimensionalityreductionsampleclassificationandanalysis
AT wiufcarsten abetamixturemodelfordimensionalityreductionsampleclassificationandanalysis
AT laurilakirsti betamixturemodelfordimensionalityreductionsampleclassificationandanalysis
AT osterbodil betamixturemodelfordimensionalityreductionsampleclassificationandanalysis
AT andersenclausl betamixturemodelfordimensionalityreductionsampleclassificationandanalysis
AT lamyphilippe betamixturemodelfordimensionalityreductionsampleclassificationandanalysis
AT orntofttorben betamixturemodelfordimensionalityreductionsampleclassificationandanalysis
AT yliharjaolli betamixturemodelfordimensionalityreductionsampleclassificationandanalysis
AT wiufcarsten betamixturemodelfordimensionalityreductionsampleclassificationandanalysis