Cargando…
Evaluation of Plaid Models in Biclustering of Gene Expression Data
Background. Biclustering algorithms for the analysis of high-dimensional gene expression data were proposed. Among them, the plaid model is arguably one of the most flexible biclustering models up to now. Objective. The main goal of this study is to provide an evaluation of plaid models. To that end...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Hindawi Publishing Corporation
2016
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4804094/ https://www.ncbi.nlm.nih.gov/pubmed/27051553 http://dx.doi.org/10.1155/2016/3059767 |
_version_ | 1782422969657589760 |
---|---|
author | Alavi Majd, Hamid Shahsavari, Soodeh Baghestani, Ahmad Reza Tabatabaei, Seyyed Mohammad Khadem Bashi, Naghme Rezaei Tavirani, Mostafa Hamidpour, Mohsen |
author_facet | Alavi Majd, Hamid Shahsavari, Soodeh Baghestani, Ahmad Reza Tabatabaei, Seyyed Mohammad Khadem Bashi, Naghme Rezaei Tavirani, Mostafa Hamidpour, Mohsen |
author_sort | Alavi Majd, Hamid |
collection | PubMed |
description | Background. Biclustering algorithms for the analysis of high-dimensional gene expression data were proposed. Among them, the plaid model is arguably one of the most flexible biclustering models up to now. Objective. The main goal of this study is to provide an evaluation of plaid models. To that end, we will investigate this model on both simulation data and real gene expression datasets. Methods. Two simulated matrices with different degrees of overlap and noise are generated and then the intrinsic structure of these data is compared with biclusters result. Also, we have searched biologically significant discovered biclusters by GO analysis. Results. When there is no noise the algorithm almost discovered all of the biclusters but when there is moderate noise in the dataset, this algorithm cannot perform very well in finding overlapping biclusters and if noise is big, the result of biclustering is not reliable. Conclusion. The plaid model needs to be modified because when there is a moderate or big noise in the data, it cannot find good biclusters. This is a statistical model and is a quite flexible one. In summary, in order to reduce the errors, model can be manipulated and distribution of error can be changed. |
format | Online Article Text |
id | pubmed-4804094 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2016 |
publisher | Hindawi Publishing Corporation |
record_format | MEDLINE/PubMed |
spelling | pubmed-48040942016-04-05 Evaluation of Plaid Models in Biclustering of Gene Expression Data Alavi Majd, Hamid Shahsavari, Soodeh Baghestani, Ahmad Reza Tabatabaei, Seyyed Mohammad Khadem Bashi, Naghme Rezaei Tavirani, Mostafa Hamidpour, Mohsen Scientifica (Cairo) Research Article Background. Biclustering algorithms for the analysis of high-dimensional gene expression data were proposed. Among them, the plaid model is arguably one of the most flexible biclustering models up to now. Objective. The main goal of this study is to provide an evaluation of plaid models. To that end, we will investigate this model on both simulation data and real gene expression datasets. Methods. Two simulated matrices with different degrees of overlap and noise are generated and then the intrinsic structure of these data is compared with biclusters result. Also, we have searched biologically significant discovered biclusters by GO analysis. Results. When there is no noise the algorithm almost discovered all of the biclusters but when there is moderate noise in the dataset, this algorithm cannot perform very well in finding overlapping biclusters and if noise is big, the result of biclustering is not reliable. Conclusion. The plaid model needs to be modified because when there is a moderate or big noise in the data, it cannot find good biclusters. This is a statistical model and is a quite flexible one. In summary, in order to reduce the errors, model can be manipulated and distribution of error can be changed. Hindawi Publishing Corporation 2016 2016-03-09 /pmc/articles/PMC4804094/ /pubmed/27051553 http://dx.doi.org/10.1155/2016/3059767 Text en Copyright © 2016 Hamid Alavi Majd et al. https://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Alavi Majd, Hamid Shahsavari, Soodeh Baghestani, Ahmad Reza Tabatabaei, Seyyed Mohammad Khadem Bashi, Naghme Rezaei Tavirani, Mostafa Hamidpour, Mohsen Evaluation of Plaid Models in Biclustering of Gene Expression Data |
title | Evaluation of Plaid Models in Biclustering of Gene Expression Data |
title_full | Evaluation of Plaid Models in Biclustering of Gene Expression Data |
title_fullStr | Evaluation of Plaid Models in Biclustering of Gene Expression Data |
title_full_unstemmed | Evaluation of Plaid Models in Biclustering of Gene Expression Data |
title_short | Evaluation of Plaid Models in Biclustering of Gene Expression Data |
title_sort | evaluation of plaid models in biclustering of gene expression data |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4804094/ https://www.ncbi.nlm.nih.gov/pubmed/27051553 http://dx.doi.org/10.1155/2016/3059767 |
work_keys_str_mv | AT alavimajdhamid evaluationofplaidmodelsinbiclusteringofgeneexpressiondata AT shahsavarisoodeh evaluationofplaidmodelsinbiclusteringofgeneexpressiondata AT baghestaniahmadreza evaluationofplaidmodelsinbiclusteringofgeneexpressiondata AT tabatabaeiseyyedmohammad evaluationofplaidmodelsinbiclusteringofgeneexpressiondata AT khadembashinaghme evaluationofplaidmodelsinbiclusteringofgeneexpressiondata AT rezaeitaviranimostafa evaluationofplaidmodelsinbiclusteringofgeneexpressiondata AT hamidpourmohsen evaluationofplaidmodelsinbiclusteringofgeneexpressiondata |