Cargando…

Clique-based data mining for related genes in a biomedical database

BACKGROUND: Progress in the life sciences cannot be made without integrating biomedical knowledge on numerous genes in order to help formulate hypotheses on the genetic mechanisms behind various biological phenomena, including diseases. There is thus a strong need for a way to automatically and comp...

Descripción completa

Detalles Bibliográficos
Autores principales: Matsunaga, Tsutomu, Yonemori, Chikara, Tomita, Etsuji, Muramatsu, Masaaki
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2009
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2721841/
https://www.ncbi.nlm.nih.gov/pubmed/19566964
http://dx.doi.org/10.1186/1471-2105-10-205
_version_ 1782170240175570944
author Matsunaga, Tsutomu
Yonemori, Chikara
Tomita, Etsuji
Muramatsu, Masaaki
author_facet Matsunaga, Tsutomu
Yonemori, Chikara
Tomita, Etsuji
Muramatsu, Masaaki
author_sort Matsunaga, Tsutomu
collection PubMed
description BACKGROUND: Progress in the life sciences cannot be made without integrating biomedical knowledge on numerous genes in order to help formulate hypotheses on the genetic mechanisms behind various biological phenomena, including diseases. There is thus a strong need for a way to automatically and comprehensively search from biomedical databases for related genes, such as genes in the same families and genes encoding components of the same pathways. Here we address the extraction of related genes by searching for densely-connected subgraphs, which are modeled as cliques, in a biomedical relational graph. RESULTS: We constructed a graph whose nodes were gene or disease pages, and edges were the hyperlink connections between those pages in the Online Mendelian Inheritance in Man (OMIM) database. We obtained over 20,000 sets of related genes (called 'gene modules') by enumerating cliques computationally. The modules included genes in the same family, genes for proteins that form a complex, and genes for components of the same signaling pathway. The results of experiments using 'metabolic syndrome'-related gene modules show that the gene modules can be used to get a coherent holistic picture helpful for interpreting relations among genes. CONCLUSION: We presented a data mining approach extracting related genes by enumerating cliques. The extracted gene sets provide a holistic picture useful for comprehending complex disease mechanisms.
format Text
id pubmed-2721841
institution National Center for Biotechnology Information
language English
publishDate 2009
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-27218412009-08-06 Clique-based data mining for related genes in a biomedical database Matsunaga, Tsutomu Yonemori, Chikara Tomita, Etsuji Muramatsu, Masaaki BMC Bioinformatics Research Article BACKGROUND: Progress in the life sciences cannot be made without integrating biomedical knowledge on numerous genes in order to help formulate hypotheses on the genetic mechanisms behind various biological phenomena, including diseases. There is thus a strong need for a way to automatically and comprehensively search from biomedical databases for related genes, such as genes in the same families and genes encoding components of the same pathways. Here we address the extraction of related genes by searching for densely-connected subgraphs, which are modeled as cliques, in a biomedical relational graph. RESULTS: We constructed a graph whose nodes were gene or disease pages, and edges were the hyperlink connections between those pages in the Online Mendelian Inheritance in Man (OMIM) database. We obtained over 20,000 sets of related genes (called 'gene modules') by enumerating cliques computationally. The modules included genes in the same family, genes for proteins that form a complex, and genes for components of the same signaling pathway. The results of experiments using 'metabolic syndrome'-related gene modules show that the gene modules can be used to get a coherent holistic picture helpful for interpreting relations among genes. CONCLUSION: We presented a data mining approach extracting related genes by enumerating cliques. The extracted gene sets provide a holistic picture useful for comprehending complex disease mechanisms. BioMed Central 2009-07-01 /pmc/articles/PMC2721841/ /pubmed/19566964 http://dx.doi.org/10.1186/1471-2105-10-205 Text en Copyright © 2009 Matsunaga et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Matsunaga, Tsutomu
Yonemori, Chikara
Tomita, Etsuji
Muramatsu, Masaaki
Clique-based data mining for related genes in a biomedical database
title Clique-based data mining for related genes in a biomedical database
title_full Clique-based data mining for related genes in a biomedical database
title_fullStr Clique-based data mining for related genes in a biomedical database
title_full_unstemmed Clique-based data mining for related genes in a biomedical database
title_short Clique-based data mining for related genes in a biomedical database
title_sort clique-based data mining for related genes in a biomedical database
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2721841/
https://www.ncbi.nlm.nih.gov/pubmed/19566964
http://dx.doi.org/10.1186/1471-2105-10-205
work_keys_str_mv AT matsunagatsutomu cliquebaseddataminingforrelatedgenesinabiomedicaldatabase
AT yonemorichikara cliquebaseddataminingforrelatedgenesinabiomedicaldatabase
AT tomitaetsuji cliquebaseddataminingforrelatedgenesinabiomedicaldatabase
AT muramatsumasaaki cliquebaseddataminingforrelatedgenesinabiomedicaldatabase