Cargando…

BicPAM: Pattern-based biclustering for biomedical data analysis

BACKGROUND: Biclustering, the discovery of sets of objects with a coherent pattern across a subset of conditions, is a critical task to study a wide-set of biomedical problems, where molecular units or patients are meaningfully related with a set of properties. The challenging combinatorial nature o...

Descripción completa

Detalles Bibliográficos
Autores principales: Henriques, Rui, Madeira, Sara C
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4302537/
https://www.ncbi.nlm.nih.gov/pubmed/25649207
http://dx.doi.org/10.1186/s13015-014-0027-z
_version_ 1782353818367819776
author Henriques, Rui
Madeira, Sara C
author_facet Henriques, Rui
Madeira, Sara C
author_sort Henriques, Rui
collection PubMed
description BACKGROUND: Biclustering, the discovery of sets of objects with a coherent pattern across a subset of conditions, is a critical task to study a wide-set of biomedical problems, where molecular units or patients are meaningfully related with a set of properties. The challenging combinatorial nature of this task led to the development of approaches with restrictions on the allowed type, number and quality of biclusters. Contrasting, recent biclustering approaches relying on pattern mining methods can exhaustively discover flexible structures of robust biclusters. However, these approaches are only prepared to discover constant biclusters and their underlying contributions remain dispersed. METHODS: The proposed BicPAM biclustering approach integrates existing principles made available by state-of-the-art pattern-based approaches with two new contributions. First, BicPAM is the first efficient attempt to exhaustively mine non-constant types of biclusters, including additive and multiplicative coherencies in the presence or absence of symmetries. Second, BicPAM provides strategies to effectively compose different biclustering structures and to handle arbitrary levels of noise inherent to data and with discretization procedures. RESULTS: Results show BicPAM’s superiority against its peers and its ability to retrieve unique types of biclusters of interest, to efficiently deliver exhaustive solutions and to successfully recover planted biclusters in datasets with varying levels of missing values and noise. Its application over gene expression data leads to unique solutions with heightened biological relevance. CONCLUSIONS: BicPAM approaches integrate existing disperse efforts towards pattern-based biclustering and provides the first critical strategies to efficiently discover exhaustive solutions of biclusters with shifting, scaling and symmetric assumptions with varying quality and underlying structures. Additionally, BicPAM dynamically adapts its behavior to mine data with different levels of missing values and noise.
format Online
Article
Text
id pubmed-4302537
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-43025372015-02-03 BicPAM: Pattern-based biclustering for biomedical data analysis Henriques, Rui Madeira, Sara C Algorithms Mol Biol Research BACKGROUND: Biclustering, the discovery of sets of objects with a coherent pattern across a subset of conditions, is a critical task to study a wide-set of biomedical problems, where molecular units or patients are meaningfully related with a set of properties. The challenging combinatorial nature of this task led to the development of approaches with restrictions on the allowed type, number and quality of biclusters. Contrasting, recent biclustering approaches relying on pattern mining methods can exhaustively discover flexible structures of robust biclusters. However, these approaches are only prepared to discover constant biclusters and their underlying contributions remain dispersed. METHODS: The proposed BicPAM biclustering approach integrates existing principles made available by state-of-the-art pattern-based approaches with two new contributions. First, BicPAM is the first efficient attempt to exhaustively mine non-constant types of biclusters, including additive and multiplicative coherencies in the presence or absence of symmetries. Second, BicPAM provides strategies to effectively compose different biclustering structures and to handle arbitrary levels of noise inherent to data and with discretization procedures. RESULTS: Results show BicPAM’s superiority against its peers and its ability to retrieve unique types of biclusters of interest, to efficiently deliver exhaustive solutions and to successfully recover planted biclusters in datasets with varying levels of missing values and noise. Its application over gene expression data leads to unique solutions with heightened biological relevance. CONCLUSIONS: BicPAM approaches integrate existing disperse efforts towards pattern-based biclustering and provides the first critical strategies to efficiently discover exhaustive solutions of biclusters with shifting, scaling and symmetric assumptions with varying quality and underlying structures. Additionally, BicPAM dynamically adapts its behavior to mine data with different levels of missing values and noise. BioMed Central 2014-12-16 /pmc/articles/PMC4302537/ /pubmed/25649207 http://dx.doi.org/10.1186/s13015-014-0027-z Text en © Henriques and Madeira; licensee BioMed Central. 2014 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Henriques, Rui
Madeira, Sara C
BicPAM: Pattern-based biclustering for biomedical data analysis
title BicPAM: Pattern-based biclustering for biomedical data analysis
title_full BicPAM: Pattern-based biclustering for biomedical data analysis
title_fullStr BicPAM: Pattern-based biclustering for biomedical data analysis
title_full_unstemmed BicPAM: Pattern-based biclustering for biomedical data analysis
title_short BicPAM: Pattern-based biclustering for biomedical data analysis
title_sort bicpam: pattern-based biclustering for biomedical data analysis
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4302537/
https://www.ncbi.nlm.nih.gov/pubmed/25649207
http://dx.doi.org/10.1186/s13015-014-0027-z
work_keys_str_mv AT henriquesrui bicpampatternbasedbiclusteringforbiomedicaldataanalysis
AT madeirasarac bicpampatternbasedbiclusteringforbiomedicaldataanalysis