Cargando…

A binary biclustering algorithm based on the adjacency difference matrix for gene expression data analysis

Biclustering algorithm is an effective tool for processing gene expression datasets. There are two kinds of data matrices, binary data and non-binary data, which are processed by biclustering method. A binary matrix is usually converted from pre-processed gene expression data, which can effectively...

Descripción completa

Detalles Bibliográficos
Autores principales: Chu, He-Ming, Liu, Jin-Xing, Zhang, Ke, Zheng, Chun-Hou, Wang, Juan, Kong, Xiang-Zhen
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9484244/
https://www.ncbi.nlm.nih.gov/pubmed/36123637
http://dx.doi.org/10.1186/s12859-022-04842-4
_version_ 1784791840553172992
author Chu, He-Ming
Liu, Jin-Xing
Zhang, Ke
Zheng, Chun-Hou
Wang, Juan
Kong, Xiang-Zhen
author_facet Chu, He-Ming
Liu, Jin-Xing
Zhang, Ke
Zheng, Chun-Hou
Wang, Juan
Kong, Xiang-Zhen
author_sort Chu, He-Ming
collection PubMed
description Biclustering algorithm is an effective tool for processing gene expression datasets. There are two kinds of data matrices, binary data and non-binary data, which are processed by biclustering method. A binary matrix is usually converted from pre-processed gene expression data, which can effectively reduce the interference from noise and abnormal data, and is then processed using a biclustering algorithm. However, biclustering algorithms of dealing with binary data have a poor balance between running time and performance. In this paper, we propose a new biclustering algorithm called the Adjacency Difference Matrix Binary Biclustering algorithm (AMBB) for dealing with binary data to address the drawback. The AMBB algorithm constructs the adjacency matrix based on the adjacency difference values, and the submatrix obtained by continuously updating the adjacency difference matrix is called a bicluster. The adjacency matrix allows for clustering of gene that undergo similar reactions under different conditions into clusters, which is important for subsequent genes analysis. Meanwhile, experiments on synthetic and real datasets visually demonstrate that the AMBB algorithm has high practicability.
format Online
Article
Text
id pubmed-9484244
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-94842442022-09-20 A binary biclustering algorithm based on the adjacency difference matrix for gene expression data analysis Chu, He-Ming Liu, Jin-Xing Zhang, Ke Zheng, Chun-Hou Wang, Juan Kong, Xiang-Zhen BMC Bioinformatics Research Biclustering algorithm is an effective tool for processing gene expression datasets. There are two kinds of data matrices, binary data and non-binary data, which are processed by biclustering method. A binary matrix is usually converted from pre-processed gene expression data, which can effectively reduce the interference from noise and abnormal data, and is then processed using a biclustering algorithm. However, biclustering algorithms of dealing with binary data have a poor balance between running time and performance. In this paper, we propose a new biclustering algorithm called the Adjacency Difference Matrix Binary Biclustering algorithm (AMBB) for dealing with binary data to address the drawback. The AMBB algorithm constructs the adjacency matrix based on the adjacency difference values, and the submatrix obtained by continuously updating the adjacency difference matrix is called a bicluster. The adjacency matrix allows for clustering of gene that undergo similar reactions under different conditions into clusters, which is important for subsequent genes analysis. Meanwhile, experiments on synthetic and real datasets visually demonstrate that the AMBB algorithm has high practicability. BioMed Central 2022-09-19 /pmc/articles/PMC9484244/ /pubmed/36123637 http://dx.doi.org/10.1186/s12859-022-04842-4 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Chu, He-Ming
Liu, Jin-Xing
Zhang, Ke
Zheng, Chun-Hou
Wang, Juan
Kong, Xiang-Zhen
A binary biclustering algorithm based on the adjacency difference matrix for gene expression data analysis
title A binary biclustering algorithm based on the adjacency difference matrix for gene expression data analysis
title_full A binary biclustering algorithm based on the adjacency difference matrix for gene expression data analysis
title_fullStr A binary biclustering algorithm based on the adjacency difference matrix for gene expression data analysis
title_full_unstemmed A binary biclustering algorithm based on the adjacency difference matrix for gene expression data analysis
title_short A binary biclustering algorithm based on the adjacency difference matrix for gene expression data analysis
title_sort binary biclustering algorithm based on the adjacency difference matrix for gene expression data analysis
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9484244/
https://www.ncbi.nlm.nih.gov/pubmed/36123637
http://dx.doi.org/10.1186/s12859-022-04842-4
work_keys_str_mv AT chuheming abinarybiclusteringalgorithmbasedontheadjacencydifferencematrixforgeneexpressiondataanalysis
AT liujinxing abinarybiclusteringalgorithmbasedontheadjacencydifferencematrixforgeneexpressiondataanalysis
AT zhangke abinarybiclusteringalgorithmbasedontheadjacencydifferencematrixforgeneexpressiondataanalysis
AT zhengchunhou abinarybiclusteringalgorithmbasedontheadjacencydifferencematrixforgeneexpressiondataanalysis
AT wangjuan abinarybiclusteringalgorithmbasedontheadjacencydifferencematrixforgeneexpressiondataanalysis
AT kongxiangzhen abinarybiclusteringalgorithmbasedontheadjacencydifferencematrixforgeneexpressiondataanalysis
AT chuheming binarybiclusteringalgorithmbasedontheadjacencydifferencematrixforgeneexpressiondataanalysis
AT liujinxing binarybiclusteringalgorithmbasedontheadjacencydifferencematrixforgeneexpressiondataanalysis
AT zhangke binarybiclusteringalgorithmbasedontheadjacencydifferencematrixforgeneexpressiondataanalysis
AT zhengchunhou binarybiclusteringalgorithmbasedontheadjacencydifferencematrixforgeneexpressiondataanalysis
AT wangjuan binarybiclusteringalgorithmbasedontheadjacencydifferencematrixforgeneexpressiondataanalysis
AT kongxiangzhen binarybiclusteringalgorithmbasedontheadjacencydifferencematrixforgeneexpressiondataanalysis