Cargando…
BicPAM: Pattern-based biclustering for biomedical data analysis
BACKGROUND: Biclustering, the discovery of sets of objects with a coherent pattern across a subset of conditions, is a critical task to study a wide-set of biomedical problems, where molecular units or patients are meaningfully related with a set of properties. The challenging combinatorial nature o...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2014
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4302537/ https://www.ncbi.nlm.nih.gov/pubmed/25649207 http://dx.doi.org/10.1186/s13015-014-0027-z |
_version_ | 1782353818367819776 |
---|---|
author | Henriques, Rui Madeira, Sara C |
author_facet | Henriques, Rui Madeira, Sara C |
author_sort | Henriques, Rui |
collection | PubMed |
description | BACKGROUND: Biclustering, the discovery of sets of objects with a coherent pattern across a subset of conditions, is a critical task to study a wide-set of biomedical problems, where molecular units or patients are meaningfully related with a set of properties. The challenging combinatorial nature of this task led to the development of approaches with restrictions on the allowed type, number and quality of biclusters. Contrasting, recent biclustering approaches relying on pattern mining methods can exhaustively discover flexible structures of robust biclusters. However, these approaches are only prepared to discover constant biclusters and their underlying contributions remain dispersed. METHODS: The proposed BicPAM biclustering approach integrates existing principles made available by state-of-the-art pattern-based approaches with two new contributions. First, BicPAM is the first efficient attempt to exhaustively mine non-constant types of biclusters, including additive and multiplicative coherencies in the presence or absence of symmetries. Second, BicPAM provides strategies to effectively compose different biclustering structures and to handle arbitrary levels of noise inherent to data and with discretization procedures. RESULTS: Results show BicPAM’s superiority against its peers and its ability to retrieve unique types of biclusters of interest, to efficiently deliver exhaustive solutions and to successfully recover planted biclusters in datasets with varying levels of missing values and noise. Its application over gene expression data leads to unique solutions with heightened biological relevance. CONCLUSIONS: BicPAM approaches integrate existing disperse efforts towards pattern-based biclustering and provides the first critical strategies to efficiently discover exhaustive solutions of biclusters with shifting, scaling and symmetric assumptions with varying quality and underlying structures. Additionally, BicPAM dynamically adapts its behavior to mine data with different levels of missing values and noise. |
format | Online Article Text |
id | pubmed-4302537 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2014 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-43025372015-02-03 BicPAM: Pattern-based biclustering for biomedical data analysis Henriques, Rui Madeira, Sara C Algorithms Mol Biol Research BACKGROUND: Biclustering, the discovery of sets of objects with a coherent pattern across a subset of conditions, is a critical task to study a wide-set of biomedical problems, where molecular units or patients are meaningfully related with a set of properties. The challenging combinatorial nature of this task led to the development of approaches with restrictions on the allowed type, number and quality of biclusters. Contrasting, recent biclustering approaches relying on pattern mining methods can exhaustively discover flexible structures of robust biclusters. However, these approaches are only prepared to discover constant biclusters and their underlying contributions remain dispersed. METHODS: The proposed BicPAM biclustering approach integrates existing principles made available by state-of-the-art pattern-based approaches with two new contributions. First, BicPAM is the first efficient attempt to exhaustively mine non-constant types of biclusters, including additive and multiplicative coherencies in the presence or absence of symmetries. Second, BicPAM provides strategies to effectively compose different biclustering structures and to handle arbitrary levels of noise inherent to data and with discretization procedures. RESULTS: Results show BicPAM’s superiority against its peers and its ability to retrieve unique types of biclusters of interest, to efficiently deliver exhaustive solutions and to successfully recover planted biclusters in datasets with varying levels of missing values and noise. Its application over gene expression data leads to unique solutions with heightened biological relevance. CONCLUSIONS: BicPAM approaches integrate existing disperse efforts towards pattern-based biclustering and provides the first critical strategies to efficiently discover exhaustive solutions of biclusters with shifting, scaling and symmetric assumptions with varying quality and underlying structures. Additionally, BicPAM dynamically adapts its behavior to mine data with different levels of missing values and noise. BioMed Central 2014-12-16 /pmc/articles/PMC4302537/ /pubmed/25649207 http://dx.doi.org/10.1186/s13015-014-0027-z Text en © Henriques and Madeira; licensee BioMed Central. 2014 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Henriques, Rui Madeira, Sara C BicPAM: Pattern-based biclustering for biomedical data analysis |
title | BicPAM: Pattern-based biclustering for biomedical data analysis |
title_full | BicPAM: Pattern-based biclustering for biomedical data analysis |
title_fullStr | BicPAM: Pattern-based biclustering for biomedical data analysis |
title_full_unstemmed | BicPAM: Pattern-based biclustering for biomedical data analysis |
title_short | BicPAM: Pattern-based biclustering for biomedical data analysis |
title_sort | bicpam: pattern-based biclustering for biomedical data analysis |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4302537/ https://www.ncbi.nlm.nih.gov/pubmed/25649207 http://dx.doi.org/10.1186/s13015-014-0027-z |
work_keys_str_mv | AT henriquesrui bicpampatternbasedbiclusteringforbiomedicaldataanalysis AT madeirasarac bicpampatternbasedbiclusteringforbiomedicaldataanalysis |