Cargando…

Evaluation of Plaid Models in Biclustering of Gene Expression Data

Background. Biclustering algorithms for the analysis of high-dimensional gene expression data were proposed. Among them, the plaid model is arguably one of the most flexible biclustering models up to now. Objective. The main goal of this study is to provide an evaluation of plaid models. To that end...

Descripción completa

Detalles Bibliográficos
Autores principales: Alavi Majd, Hamid, Shahsavari, Soodeh, Baghestani, Ahmad Reza, Tabatabaei, Seyyed Mohammad, Khadem Bashi, Naghme, Rezaei Tavirani, Mostafa, Hamidpour, Mohsen
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Hindawi Publishing Corporation 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4804094/
https://www.ncbi.nlm.nih.gov/pubmed/27051553
http://dx.doi.org/10.1155/2016/3059767
_version_ 1782422969657589760
author Alavi Majd, Hamid
Shahsavari, Soodeh
Baghestani, Ahmad Reza
Tabatabaei, Seyyed Mohammad
Khadem Bashi, Naghme
Rezaei Tavirani, Mostafa
Hamidpour, Mohsen
author_facet Alavi Majd, Hamid
Shahsavari, Soodeh
Baghestani, Ahmad Reza
Tabatabaei, Seyyed Mohammad
Khadem Bashi, Naghme
Rezaei Tavirani, Mostafa
Hamidpour, Mohsen
author_sort Alavi Majd, Hamid
collection PubMed
description Background. Biclustering algorithms for the analysis of high-dimensional gene expression data were proposed. Among them, the plaid model is arguably one of the most flexible biclustering models up to now. Objective. The main goal of this study is to provide an evaluation of plaid models. To that end, we will investigate this model on both simulation data and real gene expression datasets. Methods. Two simulated matrices with different degrees of overlap and noise are generated and then the intrinsic structure of these data is compared with biclusters result. Also, we have searched biologically significant discovered biclusters by GO analysis. Results. When there is no noise the algorithm almost discovered all of the biclusters but when there is moderate noise in the dataset, this algorithm cannot perform very well in finding overlapping biclusters and if noise is big, the result of biclustering is not reliable. Conclusion. The plaid model needs to be modified because when there is a moderate or big noise in the data, it cannot find good biclusters. This is a statistical model and is a quite flexible one. In summary, in order to reduce the errors, model can be manipulated and distribution of error can be changed.
format Online
Article
Text
id pubmed-4804094
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Hindawi Publishing Corporation
record_format MEDLINE/PubMed
spelling pubmed-48040942016-04-05 Evaluation of Plaid Models in Biclustering of Gene Expression Data Alavi Majd, Hamid Shahsavari, Soodeh Baghestani, Ahmad Reza Tabatabaei, Seyyed Mohammad Khadem Bashi, Naghme Rezaei Tavirani, Mostafa Hamidpour, Mohsen Scientifica (Cairo) Research Article Background. Biclustering algorithms for the analysis of high-dimensional gene expression data were proposed. Among them, the plaid model is arguably one of the most flexible biclustering models up to now. Objective. The main goal of this study is to provide an evaluation of plaid models. To that end, we will investigate this model on both simulation data and real gene expression datasets. Methods. Two simulated matrices with different degrees of overlap and noise are generated and then the intrinsic structure of these data is compared with biclusters result. Also, we have searched biologically significant discovered biclusters by GO analysis. Results. When there is no noise the algorithm almost discovered all of the biclusters but when there is moderate noise in the dataset, this algorithm cannot perform very well in finding overlapping biclusters and if noise is big, the result of biclustering is not reliable. Conclusion. The plaid model needs to be modified because when there is a moderate or big noise in the data, it cannot find good biclusters. This is a statistical model and is a quite flexible one. In summary, in order to reduce the errors, model can be manipulated and distribution of error can be changed. Hindawi Publishing Corporation 2016 2016-03-09 /pmc/articles/PMC4804094/ /pubmed/27051553 http://dx.doi.org/10.1155/2016/3059767 Text en Copyright © 2016 Hamid Alavi Majd et al. https://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Alavi Majd, Hamid
Shahsavari, Soodeh
Baghestani, Ahmad Reza
Tabatabaei, Seyyed Mohammad
Khadem Bashi, Naghme
Rezaei Tavirani, Mostafa
Hamidpour, Mohsen
Evaluation of Plaid Models in Biclustering of Gene Expression Data
title Evaluation of Plaid Models in Biclustering of Gene Expression Data
title_full Evaluation of Plaid Models in Biclustering of Gene Expression Data
title_fullStr Evaluation of Plaid Models in Biclustering of Gene Expression Data
title_full_unstemmed Evaluation of Plaid Models in Biclustering of Gene Expression Data
title_short Evaluation of Plaid Models in Biclustering of Gene Expression Data
title_sort evaluation of plaid models in biclustering of gene expression data
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4804094/
https://www.ncbi.nlm.nih.gov/pubmed/27051553
http://dx.doi.org/10.1155/2016/3059767
work_keys_str_mv AT alavimajdhamid evaluationofplaidmodelsinbiclusteringofgeneexpressiondata
AT shahsavarisoodeh evaluationofplaidmodelsinbiclusteringofgeneexpressiondata
AT baghestaniahmadreza evaluationofplaidmodelsinbiclusteringofgeneexpressiondata
AT tabatabaeiseyyedmohammad evaluationofplaidmodelsinbiclusteringofgeneexpressiondata
AT khadembashinaghme evaluationofplaidmodelsinbiclusteringofgeneexpressiondata
AT rezaeitaviranimostafa evaluationofplaidmodelsinbiclusteringofgeneexpressiondata
AT hamidpourmohsen evaluationofplaidmodelsinbiclusteringofgeneexpressiondata