Cargando…

An Iterative Unsupervised Method for Gene Expression Differentiation

For several decades, intensive research for understanding gene activity and its role in organism’s lives is the research focus of scientists in different areas. A part of these investigations is the analysis of gene expression data for selecting differentially expressed genes. Methods that identify...

Descripción completa

Detalles Bibliográficos
Autor principal: Georgieva, Olga
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9956932/
https://www.ncbi.nlm.nih.gov/pubmed/36833339
http://dx.doi.org/10.3390/genes14020412
_version_ 1784894701664468992
author Georgieva, Olga
author_facet Georgieva, Olga
author_sort Georgieva, Olga
collection PubMed
description For several decades, intensive research for understanding gene activity and its role in organism’s lives is the research focus of scientists in different areas. A part of these investigations is the analysis of gene expression data for selecting differentially expressed genes. Methods that identify the interested genes have been proposed on statistical data analysis. The problem is that there is no good agreement among them, as different results are produced by distinct methods. By taking the advantage of the unsupervised data analysis, an iterative clustering procedure that finds differentially expressed genes shows promising results. In the present paper, a comparative study of the clustering methods applied for gene expression analysis is presented to explicate the choice of the clustering algorithm implemented in the method. An investigation of different distance measures is provided to reveal those that increase the efficiency of the method in finding the real data structure. Further, the method is improved by incorporating an additional aggregation measure based on the standard deviation of the expression levels. Its usage increases the gene distinction as a new amount of differentially expressed genes is found. The method is summarized in a detailed procedure. The significance of the method is proved by an analysis of two mice strain data sets. The differentially expressed genes defined by the proposed method are compared with those selected by the well-known statistical methods applied to the same data set.
format Online
Article
Text
id pubmed-9956932
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-99569322023-02-25 An Iterative Unsupervised Method for Gene Expression Differentiation Georgieva, Olga Genes (Basel) Article For several decades, intensive research for understanding gene activity and its role in organism’s lives is the research focus of scientists in different areas. A part of these investigations is the analysis of gene expression data for selecting differentially expressed genes. Methods that identify the interested genes have been proposed on statistical data analysis. The problem is that there is no good agreement among them, as different results are produced by distinct methods. By taking the advantage of the unsupervised data analysis, an iterative clustering procedure that finds differentially expressed genes shows promising results. In the present paper, a comparative study of the clustering methods applied for gene expression analysis is presented to explicate the choice of the clustering algorithm implemented in the method. An investigation of different distance measures is provided to reveal those that increase the efficiency of the method in finding the real data structure. Further, the method is improved by incorporating an additional aggregation measure based on the standard deviation of the expression levels. Its usage increases the gene distinction as a new amount of differentially expressed genes is found. The method is summarized in a detailed procedure. The significance of the method is proved by an analysis of two mice strain data sets. The differentially expressed genes defined by the proposed method are compared with those selected by the well-known statistical methods applied to the same data set. MDPI 2023-02-04 /pmc/articles/PMC9956932/ /pubmed/36833339 http://dx.doi.org/10.3390/genes14020412 Text en © 2023 by the author. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Georgieva, Olga
An Iterative Unsupervised Method for Gene Expression Differentiation
title An Iterative Unsupervised Method for Gene Expression Differentiation
title_full An Iterative Unsupervised Method for Gene Expression Differentiation
title_fullStr An Iterative Unsupervised Method for Gene Expression Differentiation
title_full_unstemmed An Iterative Unsupervised Method for Gene Expression Differentiation
title_short An Iterative Unsupervised Method for Gene Expression Differentiation
title_sort iterative unsupervised method for gene expression differentiation
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9956932/
https://www.ncbi.nlm.nih.gov/pubmed/36833339
http://dx.doi.org/10.3390/genes14020412
work_keys_str_mv AT georgievaolga aniterativeunsupervisedmethodforgeneexpressiondifferentiation
AT georgievaolga iterativeunsupervisedmethodforgeneexpressiondifferentiation