Cargando…
A binary biclustering algorithm based on the adjacency difference matrix for gene expression data analysis
Biclustering algorithm is an effective tool for processing gene expression datasets. There are two kinds of data matrices, binary data and non-binary data, which are processed by biclustering method. A binary matrix is usually converted from pre-processed gene expression data, which can effectively...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9484244/ https://www.ncbi.nlm.nih.gov/pubmed/36123637 http://dx.doi.org/10.1186/s12859-022-04842-4 |
_version_ | 1784791840553172992 |
---|---|
author | Chu, He-Ming Liu, Jin-Xing Zhang, Ke Zheng, Chun-Hou Wang, Juan Kong, Xiang-Zhen |
author_facet | Chu, He-Ming Liu, Jin-Xing Zhang, Ke Zheng, Chun-Hou Wang, Juan Kong, Xiang-Zhen |
author_sort | Chu, He-Ming |
collection | PubMed |
description | Biclustering algorithm is an effective tool for processing gene expression datasets. There are two kinds of data matrices, binary data and non-binary data, which are processed by biclustering method. A binary matrix is usually converted from pre-processed gene expression data, which can effectively reduce the interference from noise and abnormal data, and is then processed using a biclustering algorithm. However, biclustering algorithms of dealing with binary data have a poor balance between running time and performance. In this paper, we propose a new biclustering algorithm called the Adjacency Difference Matrix Binary Biclustering algorithm (AMBB) for dealing with binary data to address the drawback. The AMBB algorithm constructs the adjacency matrix based on the adjacency difference values, and the submatrix obtained by continuously updating the adjacency difference matrix is called a bicluster. The adjacency matrix allows for clustering of gene that undergo similar reactions under different conditions into clusters, which is important for subsequent genes analysis. Meanwhile, experiments on synthetic and real datasets visually demonstrate that the AMBB algorithm has high practicability. |
format | Online Article Text |
id | pubmed-9484244 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-94842442022-09-20 A binary biclustering algorithm based on the adjacency difference matrix for gene expression data analysis Chu, He-Ming Liu, Jin-Xing Zhang, Ke Zheng, Chun-Hou Wang, Juan Kong, Xiang-Zhen BMC Bioinformatics Research Biclustering algorithm is an effective tool for processing gene expression datasets. There are two kinds of data matrices, binary data and non-binary data, which are processed by biclustering method. A binary matrix is usually converted from pre-processed gene expression data, which can effectively reduce the interference from noise and abnormal data, and is then processed using a biclustering algorithm. However, biclustering algorithms of dealing with binary data have a poor balance between running time and performance. In this paper, we propose a new biclustering algorithm called the Adjacency Difference Matrix Binary Biclustering algorithm (AMBB) for dealing with binary data to address the drawback. The AMBB algorithm constructs the adjacency matrix based on the adjacency difference values, and the submatrix obtained by continuously updating the adjacency difference matrix is called a bicluster. The adjacency matrix allows for clustering of gene that undergo similar reactions under different conditions into clusters, which is important for subsequent genes analysis. Meanwhile, experiments on synthetic and real datasets visually demonstrate that the AMBB algorithm has high practicability. BioMed Central 2022-09-19 /pmc/articles/PMC9484244/ /pubmed/36123637 http://dx.doi.org/10.1186/s12859-022-04842-4 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Research Chu, He-Ming Liu, Jin-Xing Zhang, Ke Zheng, Chun-Hou Wang, Juan Kong, Xiang-Zhen A binary biclustering algorithm based on the adjacency difference matrix for gene expression data analysis |
title | A binary biclustering algorithm based on the adjacency difference matrix for gene expression data analysis |
title_full | A binary biclustering algorithm based on the adjacency difference matrix for gene expression data analysis |
title_fullStr | A binary biclustering algorithm based on the adjacency difference matrix for gene expression data analysis |
title_full_unstemmed | A binary biclustering algorithm based on the adjacency difference matrix for gene expression data analysis |
title_short | A binary biclustering algorithm based on the adjacency difference matrix for gene expression data analysis |
title_sort | binary biclustering algorithm based on the adjacency difference matrix for gene expression data analysis |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9484244/ https://www.ncbi.nlm.nih.gov/pubmed/36123637 http://dx.doi.org/10.1186/s12859-022-04842-4 |
work_keys_str_mv | AT chuheming abinarybiclusteringalgorithmbasedontheadjacencydifferencematrixforgeneexpressiondataanalysis AT liujinxing abinarybiclusteringalgorithmbasedontheadjacencydifferencematrixforgeneexpressiondataanalysis AT zhangke abinarybiclusteringalgorithmbasedontheadjacencydifferencematrixforgeneexpressiondataanalysis AT zhengchunhou abinarybiclusteringalgorithmbasedontheadjacencydifferencematrixforgeneexpressiondataanalysis AT wangjuan abinarybiclusteringalgorithmbasedontheadjacencydifferencematrixforgeneexpressiondataanalysis AT kongxiangzhen abinarybiclusteringalgorithmbasedontheadjacencydifferencematrixforgeneexpressiondataanalysis AT chuheming binarybiclusteringalgorithmbasedontheadjacencydifferencematrixforgeneexpressiondataanalysis AT liujinxing binarybiclusteringalgorithmbasedontheadjacencydifferencematrixforgeneexpressiondataanalysis AT zhangke binarybiclusteringalgorithmbasedontheadjacencydifferencematrixforgeneexpressiondataanalysis AT zhengchunhou binarybiclusteringalgorithmbasedontheadjacencydifferencematrixforgeneexpressiondataanalysis AT wangjuan binarybiclusteringalgorithmbasedontheadjacencydifferencematrixforgeneexpressiondataanalysis AT kongxiangzhen binarybiclusteringalgorithmbasedontheadjacencydifferencematrixforgeneexpressiondataanalysis |