Cargando…

Application of a New Probabilistic Model for Mining Implicit Associated Cancer Genes from OMIM and Medline

An important issue in current medical science research is to find the genes that are strongly related to an inherited disease. A particular focus is placed on cancer-gene relations, since some types of cancers are inherited. As biomedical databases have grown speedily in recent years, an informatics...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhu, Shanfeng, Okuno, Yasushi, Tsujimoto, Gozoh, Mamitsuka, Hiroshi
Formato: Texto
Lenguaje:English
Publicado: Libertas Academica 2007
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2675505/
https://www.ncbi.nlm.nih.gov/pubmed/19458778
_version_ 1782166703253225472
author Zhu, Shanfeng
Okuno, Yasushi
Tsujimoto, Gozoh
Mamitsuka, Hiroshi
author_facet Zhu, Shanfeng
Okuno, Yasushi
Tsujimoto, Gozoh
Mamitsuka, Hiroshi
author_sort Zhu, Shanfeng
collection PubMed
description An important issue in current medical science research is to find the genes that are strongly related to an inherited disease. A particular focus is placed on cancer-gene relations, since some types of cancers are inherited. As biomedical databases have grown speedily in recent years, an informatics approach to predict such relations from currently available databases should be developed. Our objective is to find implicit associated cancer-genes from biomedical databases including the literature database. Co-occurrence of biological entities has been shown to be a popular and efficient technique in biomedical text mining. We have applied a new probabilistic model, called mixture aspect model (MAM) [48], to combine different types of co-occurrences of genes and cancer derived from Medline and OMIM (Online Mendelian Inheritance in Man). We trained the probability parameters of MAM using a learning method based on an EM (Expectation and Maximization) algorithm. We examined the performance of MAM by predicting associated cancer gene pairs. Through cross-validation, prediction accuracy was shown to be improved by adding gene-gene co-occurrences from Medline to cancer-gene cooccurrences in OMIM. Further experiments showed that MAM found new cancer-gene relations which are unknown in the literature. Supplementary information can be found at http://www.bic.kyotou.ac.jp/pathway/zhusf/CancerInformatics/Supplemental2006.html
format Text
id pubmed-2675505
institution National Center for Biotechnology Information
language English
publishDate 2007
publisher Libertas Academica
record_format MEDLINE/PubMed
spelling pubmed-26755052009-05-20 Application of a New Probabilistic Model for Mining Implicit Associated Cancer Genes from OMIM and Medline Zhu, Shanfeng Okuno, Yasushi Tsujimoto, Gozoh Mamitsuka, Hiroshi Cancer Inform Original Research An important issue in current medical science research is to find the genes that are strongly related to an inherited disease. A particular focus is placed on cancer-gene relations, since some types of cancers are inherited. As biomedical databases have grown speedily in recent years, an informatics approach to predict such relations from currently available databases should be developed. Our objective is to find implicit associated cancer-genes from biomedical databases including the literature database. Co-occurrence of biological entities has been shown to be a popular and efficient technique in biomedical text mining. We have applied a new probabilistic model, called mixture aspect model (MAM) [48], to combine different types of co-occurrences of genes and cancer derived from Medline and OMIM (Online Mendelian Inheritance in Man). We trained the probability parameters of MAM using a learning method based on an EM (Expectation and Maximization) algorithm. We examined the performance of MAM by predicting associated cancer gene pairs. Through cross-validation, prediction accuracy was shown to be improved by adding gene-gene co-occurrences from Medline to cancer-gene cooccurrences in OMIM. Further experiments showed that MAM found new cancer-gene relations which are unknown in the literature. Supplementary information can be found at http://www.bic.kyotou.ac.jp/pathway/zhusf/CancerInformatics/Supplemental2006.html Libertas Academica 2007-02-25 /pmc/articles/PMC2675505/ /pubmed/19458778 Text en © 2006 The authors. This article is an open-access article distributed under the terms and conditions of the Creative Commons Attribution license (http://creativecommons.org/licenses/by/3.0/).
spellingShingle Original Research
Zhu, Shanfeng
Okuno, Yasushi
Tsujimoto, Gozoh
Mamitsuka, Hiroshi
Application of a New Probabilistic Model for Mining Implicit Associated Cancer Genes from OMIM and Medline
title Application of a New Probabilistic Model for Mining Implicit Associated Cancer Genes from OMIM and Medline
title_full Application of a New Probabilistic Model for Mining Implicit Associated Cancer Genes from OMIM and Medline
title_fullStr Application of a New Probabilistic Model for Mining Implicit Associated Cancer Genes from OMIM and Medline
title_full_unstemmed Application of a New Probabilistic Model for Mining Implicit Associated Cancer Genes from OMIM and Medline
title_short Application of a New Probabilistic Model for Mining Implicit Associated Cancer Genes from OMIM and Medline
title_sort application of a new probabilistic model for mining implicit associated cancer genes from omim and medline
topic Original Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2675505/
https://www.ncbi.nlm.nih.gov/pubmed/19458778
work_keys_str_mv AT zhushanfeng applicationofanewprobabilisticmodelforminingimplicitassociatedcancergenesfromomimandmedline
AT okunoyasushi applicationofanewprobabilisticmodelforminingimplicitassociatedcancergenesfromomimandmedline
AT tsujimotogozoh applicationofanewprobabilisticmodelforminingimplicitassociatedcancergenesfromomimandmedline
AT mamitsukahiroshi applicationofanewprobabilisticmodelforminingimplicitassociatedcancergenesfromomimandmedline