Cargando…

Discriminative pattern discovery for the characterization of different network populations

MOTIVATION: An interesting problem is to study how gene co-expression varies in two different populations, associated with healthy and unhealthy individuals, respectively. To this aim, two important aspects should be taken into account: (i) in some cases, pairs/groups of genes show collaborative att...

Descripción completa

Detalles Bibliográficos
Autores principales: Fassetti, Fabio, Rombo, Simona E, Serrao, Cristina
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10121338/
https://www.ncbi.nlm.nih.gov/pubmed/37021928
http://dx.doi.org/10.1093/bioinformatics/btad168
_version_ 1785029362785976320
author Fassetti, Fabio
Rombo, Simona E
Serrao, Cristina
author_facet Fassetti, Fabio
Rombo, Simona E
Serrao, Cristina
author_sort Fassetti, Fabio
collection PubMed
description MOTIVATION: An interesting problem is to study how gene co-expression varies in two different populations, associated with healthy and unhealthy individuals, respectively. To this aim, two important aspects should be taken into account: (i) in some cases, pairs/groups of genes show collaborative attitudes, emerging in the study of disorders and diseases; (ii) information coming from each single individual may be crucial to capture specific details, at the basis of complex cellular mechanisms; therefore, it is important avoiding to miss potentially powerful information, associated with the single samples. RESULTS: Here, a novel approach is proposed, such that two different input populations are considered, and represented by two datasets of edge-labeled graphs. Each graph is associated to an individual, and the edge label is the co-expression value between the two genes associated to the nodes. Discriminative patterns among graphs belonging to different sample sets are searched for, based on a statistical notion of ‘relevance’ able to take into account important local similarities, and also collaborative effects, involving the co-expression among multiple genes. Four different gene expression datasets have been analyzed by the proposed approach, each associated to a different disease. An extensive set of experiments show that the extracted patterns significantly characterize important differences between healthy and unhealthy samples, both in the cooperation and in the biological functionality of the involved genes/proteins. Moreover, the provided analysis confirms some results already presented in the literature on genes with a central role for the considered diseases, still allowing to identify novel and useful insights on this aspect. AVAILABILITY AND IMPLEMENTATION: The algorithm has been implemented using the Java programming language. The data underlying this article and the code are available at https://github.com/CriSe92/DiscriminativeSubgraphDiscovery.
format Online
Article
Text
id pubmed-10121338
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-101213382023-04-22 Discriminative pattern discovery for the characterization of different network populations Fassetti, Fabio Rombo, Simona E Serrao, Cristina Bioinformatics Original Paper MOTIVATION: An interesting problem is to study how gene co-expression varies in two different populations, associated with healthy and unhealthy individuals, respectively. To this aim, two important aspects should be taken into account: (i) in some cases, pairs/groups of genes show collaborative attitudes, emerging in the study of disorders and diseases; (ii) information coming from each single individual may be crucial to capture specific details, at the basis of complex cellular mechanisms; therefore, it is important avoiding to miss potentially powerful information, associated with the single samples. RESULTS: Here, a novel approach is proposed, such that two different input populations are considered, and represented by two datasets of edge-labeled graphs. Each graph is associated to an individual, and the edge label is the co-expression value between the two genes associated to the nodes. Discriminative patterns among graphs belonging to different sample sets are searched for, based on a statistical notion of ‘relevance’ able to take into account important local similarities, and also collaborative effects, involving the co-expression among multiple genes. Four different gene expression datasets have been analyzed by the proposed approach, each associated to a different disease. An extensive set of experiments show that the extracted patterns significantly characterize important differences between healthy and unhealthy samples, both in the cooperation and in the biological functionality of the involved genes/proteins. Moreover, the provided analysis confirms some results already presented in the literature on genes with a central role for the considered diseases, still allowing to identify novel and useful insights on this aspect. AVAILABILITY AND IMPLEMENTATION: The algorithm has been implemented using the Java programming language. The data underlying this article and the code are available at https://github.com/CriSe92/DiscriminativeSubgraphDiscovery. Oxford University Press 2023-04-06 /pmc/articles/PMC10121338/ /pubmed/37021928 http://dx.doi.org/10.1093/bioinformatics/btad168 Text en © The Author(s) 2023. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Paper
Fassetti, Fabio
Rombo, Simona E
Serrao, Cristina
Discriminative pattern discovery for the characterization of different network populations
title Discriminative pattern discovery for the characterization of different network populations
title_full Discriminative pattern discovery for the characterization of different network populations
title_fullStr Discriminative pattern discovery for the characterization of different network populations
title_full_unstemmed Discriminative pattern discovery for the characterization of different network populations
title_short Discriminative pattern discovery for the characterization of different network populations
title_sort discriminative pattern discovery for the characterization of different network populations
topic Original Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10121338/
https://www.ncbi.nlm.nih.gov/pubmed/37021928
http://dx.doi.org/10.1093/bioinformatics/btad168
work_keys_str_mv AT fassettifabio discriminativepatterndiscoveryforthecharacterizationofdifferentnetworkpopulations
AT rombosimonae discriminativepatterndiscoveryforthecharacterizationofdifferentnetworkpopulations
AT serraocristina discriminativepatterndiscoveryforthecharacterizationofdifferentnetworkpopulations