Cargando…
Application of a New Probabilistic Model for Mining Implicit Associated Cancer Genes from OMIM and Medline
An important issue in current medical science research is to find the genes that are strongly related to an inherited disease. A particular focus is placed on cancer-gene relations, since some types of cancers are inherited. As biomedical databases have grown speedily in recent years, an informatics...
Autores principales: | , , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
Libertas Academica
2007
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2675505/ https://www.ncbi.nlm.nih.gov/pubmed/19458778 |
_version_ | 1782166703253225472 |
---|---|
author | Zhu, Shanfeng Okuno, Yasushi Tsujimoto, Gozoh Mamitsuka, Hiroshi |
author_facet | Zhu, Shanfeng Okuno, Yasushi Tsujimoto, Gozoh Mamitsuka, Hiroshi |
author_sort | Zhu, Shanfeng |
collection | PubMed |
description | An important issue in current medical science research is to find the genes that are strongly related to an inherited disease. A particular focus is placed on cancer-gene relations, since some types of cancers are inherited. As biomedical databases have grown speedily in recent years, an informatics approach to predict such relations from currently available databases should be developed. Our objective is to find implicit associated cancer-genes from biomedical databases including the literature database. Co-occurrence of biological entities has been shown to be a popular and efficient technique in biomedical text mining. We have applied a new probabilistic model, called mixture aspect model (MAM) [48], to combine different types of co-occurrences of genes and cancer derived from Medline and OMIM (Online Mendelian Inheritance in Man). We trained the probability parameters of MAM using a learning method based on an EM (Expectation and Maximization) algorithm. We examined the performance of MAM by predicting associated cancer gene pairs. Through cross-validation, prediction accuracy was shown to be improved by adding gene-gene co-occurrences from Medline to cancer-gene cooccurrences in OMIM. Further experiments showed that MAM found new cancer-gene relations which are unknown in the literature. Supplementary information can be found at http://www.bic.kyotou.ac.jp/pathway/zhusf/CancerInformatics/Supplemental2006.html |
format | Text |
id | pubmed-2675505 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2007 |
publisher | Libertas Academica |
record_format | MEDLINE/PubMed |
spelling | pubmed-26755052009-05-20 Application of a New Probabilistic Model for Mining Implicit Associated Cancer Genes from OMIM and Medline Zhu, Shanfeng Okuno, Yasushi Tsujimoto, Gozoh Mamitsuka, Hiroshi Cancer Inform Original Research An important issue in current medical science research is to find the genes that are strongly related to an inherited disease. A particular focus is placed on cancer-gene relations, since some types of cancers are inherited. As biomedical databases have grown speedily in recent years, an informatics approach to predict such relations from currently available databases should be developed. Our objective is to find implicit associated cancer-genes from biomedical databases including the literature database. Co-occurrence of biological entities has been shown to be a popular and efficient technique in biomedical text mining. We have applied a new probabilistic model, called mixture aspect model (MAM) [48], to combine different types of co-occurrences of genes and cancer derived from Medline and OMIM (Online Mendelian Inheritance in Man). We trained the probability parameters of MAM using a learning method based on an EM (Expectation and Maximization) algorithm. We examined the performance of MAM by predicting associated cancer gene pairs. Through cross-validation, prediction accuracy was shown to be improved by adding gene-gene co-occurrences from Medline to cancer-gene cooccurrences in OMIM. Further experiments showed that MAM found new cancer-gene relations which are unknown in the literature. Supplementary information can be found at http://www.bic.kyotou.ac.jp/pathway/zhusf/CancerInformatics/Supplemental2006.html Libertas Academica 2007-02-25 /pmc/articles/PMC2675505/ /pubmed/19458778 Text en © 2006 The authors. This article is an open-access article distributed under the terms and conditions of the Creative Commons Attribution license (http://creativecommons.org/licenses/by/3.0/). |
spellingShingle | Original Research Zhu, Shanfeng Okuno, Yasushi Tsujimoto, Gozoh Mamitsuka, Hiroshi Application of a New Probabilistic Model for Mining Implicit Associated Cancer Genes from OMIM and Medline |
title | Application of a New Probabilistic Model for Mining Implicit Associated Cancer Genes from OMIM and Medline |
title_full | Application of a New Probabilistic Model for Mining Implicit Associated Cancer Genes from OMIM and Medline |
title_fullStr | Application of a New Probabilistic Model for Mining Implicit Associated Cancer Genes from OMIM and Medline |
title_full_unstemmed | Application of a New Probabilistic Model for Mining Implicit Associated Cancer Genes from OMIM and Medline |
title_short | Application of a New Probabilistic Model for Mining Implicit Associated Cancer Genes from OMIM and Medline |
title_sort | application of a new probabilistic model for mining implicit associated cancer genes from omim and medline |
topic | Original Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2675505/ https://www.ncbi.nlm.nih.gov/pubmed/19458778 |
work_keys_str_mv | AT zhushanfeng applicationofanewprobabilisticmodelforminingimplicitassociatedcancergenesfromomimandmedline AT okunoyasushi applicationofanewprobabilisticmodelforminingimplicitassociatedcancergenesfromomimandmedline AT tsujimotogozoh applicationofanewprobabilisticmodelforminingimplicitassociatedcancergenesfromomimandmedline AT mamitsukahiroshi applicationofanewprobabilisticmodelforminingimplicitassociatedcancergenesfromomimandmedline |