Cargando…

A disease-related gene mining method based on weakly supervised learning model

BACKGROUND: Predicting disease-related genes is helpful for understanding the disease pathology and the molecular mechanisms during the disease progression. However, traditional methods are not suitable for screening genes related to the disease development, because there are some samples with weak...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhang, Han, Huo, Xueting, Guo, Xia, Su, Xin, Quan, Xiongwen, Jin, Chen
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6886251/
https://www.ncbi.nlm.nih.gov/pubmed/31787083
http://dx.doi.org/10.1186/s12859-019-3078-9
_version_ 1783474845029236736
author Zhang, Han
Huo, Xueting
Guo, Xia
Su, Xin
Quan, Xiongwen
Jin, Chen
author_facet Zhang, Han
Huo, Xueting
Guo, Xia
Su, Xin
Quan, Xiongwen
Jin, Chen
author_sort Zhang, Han
collection PubMed
description BACKGROUND: Predicting disease-related genes is helpful for understanding the disease pathology and the molecular mechanisms during the disease progression. However, traditional methods are not suitable for screening genes related to the disease development, because there are some samples with weak label information in the disease dataset and a small number of genes are known disease-related genes. RESULTS: We designed a disease-related gene mining method based on the weakly supervised learning model in this paper. The method is separated into two steps. Firstly, the differentially expressed genes are screened based on the weakly supervised learning model. In the model, the strong and weak label information at different stages of the disease progression is fully utilized. The obtained differentially expressed gene set is stable and complete after the algorithm converges. Then, we screen disease-related genes in the obtained differentially expressed gene set using transductive support vector machine based on the difference kernel function. The difference kernel function can map the input space of the original Huntington’s disease gene expression dataset to the difference space. The relation between the two genes can be evaluated more accurately in the difference space and the known disease-related gene information can be used effectively. CONCLUSIONS: The experimental results show that the disease-related gene mining method based on the weakly supervised learning model can effectively improve the precision of the disease-related gene prediction compared with other excellent methods.
format Online
Article
Text
id pubmed-6886251
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-68862512019-12-11 A disease-related gene mining method based on weakly supervised learning model Zhang, Han Huo, Xueting Guo, Xia Su, Xin Quan, Xiongwen Jin, Chen BMC Bioinformatics Research BACKGROUND: Predicting disease-related genes is helpful for understanding the disease pathology and the molecular mechanisms during the disease progression. However, traditional methods are not suitable for screening genes related to the disease development, because there are some samples with weak label information in the disease dataset and a small number of genes are known disease-related genes. RESULTS: We designed a disease-related gene mining method based on the weakly supervised learning model in this paper. The method is separated into two steps. Firstly, the differentially expressed genes are screened based on the weakly supervised learning model. In the model, the strong and weak label information at different stages of the disease progression is fully utilized. The obtained differentially expressed gene set is stable and complete after the algorithm converges. Then, we screen disease-related genes in the obtained differentially expressed gene set using transductive support vector machine based on the difference kernel function. The difference kernel function can map the input space of the original Huntington’s disease gene expression dataset to the difference space. The relation between the two genes can be evaluated more accurately in the difference space and the known disease-related gene information can be used effectively. CONCLUSIONS: The experimental results show that the disease-related gene mining method based on the weakly supervised learning model can effectively improve the precision of the disease-related gene prediction compared with other excellent methods. BioMed Central 2019-12-02 /pmc/articles/PMC6886251/ /pubmed/31787083 http://dx.doi.org/10.1186/s12859-019-3078-9 Text en © The Author(s) 2019 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Zhang, Han
Huo, Xueting
Guo, Xia
Su, Xin
Quan, Xiongwen
Jin, Chen
A disease-related gene mining method based on weakly supervised learning model
title A disease-related gene mining method based on weakly supervised learning model
title_full A disease-related gene mining method based on weakly supervised learning model
title_fullStr A disease-related gene mining method based on weakly supervised learning model
title_full_unstemmed A disease-related gene mining method based on weakly supervised learning model
title_short A disease-related gene mining method based on weakly supervised learning model
title_sort disease-related gene mining method based on weakly supervised learning model
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6886251/
https://www.ncbi.nlm.nih.gov/pubmed/31787083
http://dx.doi.org/10.1186/s12859-019-3078-9
work_keys_str_mv AT zhanghan adiseaserelatedgeneminingmethodbasedonweaklysupervisedlearningmodel
AT huoxueting adiseaserelatedgeneminingmethodbasedonweaklysupervisedlearningmodel
AT guoxia adiseaserelatedgeneminingmethodbasedonweaklysupervisedlearningmodel
AT suxin adiseaserelatedgeneminingmethodbasedonweaklysupervisedlearningmodel
AT quanxiongwen adiseaserelatedgeneminingmethodbasedonweaklysupervisedlearningmodel
AT jinchen adiseaserelatedgeneminingmethodbasedonweaklysupervisedlearningmodel
AT zhanghan diseaserelatedgeneminingmethodbasedonweaklysupervisedlearningmodel
AT huoxueting diseaserelatedgeneminingmethodbasedonweaklysupervisedlearningmodel
AT guoxia diseaserelatedgeneminingmethodbasedonweaklysupervisedlearningmodel
AT suxin diseaserelatedgeneminingmethodbasedonweaklysupervisedlearningmodel
AT quanxiongwen diseaserelatedgeneminingmethodbasedonweaklysupervisedlearningmodel
AT jinchen diseaserelatedgeneminingmethodbasedonweaklysupervisedlearningmodel