Cargando…

A semi-supervised learning approach to predict synthetic genetic interactions by combining functional and topological properties of functional gene network

BACKGROUND: Genetic interaction profiles are highly informative and helpful for understanding the functional linkages between genes, and therefore have been extensively exploited for annotating gene functions and dissecting specific pathway structures. However, our understanding is rather limited to...

Descripción completa

Detalles Bibliográficos
Autores principales: You, Zhu-Hong, Yin, Zheng, Han, Kyungsook, Huang, De-Shuang, Zhou, Xiaobo
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2909217/
https://www.ncbi.nlm.nih.gov/pubmed/20573270
http://dx.doi.org/10.1186/1471-2105-11-343
_version_ 1782184289441415168
author You, Zhu-Hong
Yin, Zheng
Han, Kyungsook
Huang, De-Shuang
Zhou, Xiaobo
author_facet You, Zhu-Hong
Yin, Zheng
Han, Kyungsook
Huang, De-Shuang
Zhou, Xiaobo
author_sort You, Zhu-Hong
collection PubMed
description BACKGROUND: Genetic interaction profiles are highly informative and helpful for understanding the functional linkages between genes, and therefore have been extensively exploited for annotating gene functions and dissecting specific pathway structures. However, our understanding is rather limited to the relationship between double concurrent perturbation and various higher level phenotypic changes, e.g. those in cells, tissues or organs. Modifier screens, such as synthetic genetic arrays (SGA) can help us to understand the phenotype caused by combined gene mutations. Unfortunately, exhaustive tests on all possible combined mutations in any genome are vulnerable to combinatorial explosion and are infeasible either technically or financially. Therefore, an accurate computational approach to predict genetic interaction is highly desirable, and such methods have the potential of alleviating the bottleneck on experiment design. RESULTS: In this work, we introduce a computational systems biology approach for the accurate prediction of pairwise synthetic genetic interactions (SGI). First, a high-coverage and high-precision functional gene network (FGN) is constructed by integrating protein-protein interaction (PPI), protein complex and gene expression data; then, a graph-based semi-supervised learning (SSL) classifier is utilized to identify SGI, where the topological properties of protein pairs in weighted FGN is used as input features of the classifier. We compare the proposed SSL method with the state-of-the-art supervised classifier, the support vector machines (SVM), on a benchmark dataset in S. cerevisiae to validate our method's ability to distinguish synthetic genetic interactions from non-interaction gene pairs. Experimental results show that the proposed method can accurately predict genetic interactions in S. cerevisiae (with a sensitivity of 92% and specificity of 91%). Noticeably, the SSL method is more efficient than SVM, especially for very small training sets and large test sets. CONCLUSIONS: We developed a graph-based SSL classifier for predicting the SGI. The classifier employs topological properties of weighted FGN as input features and simultaneously employs information induced from labelled and unlabelled data. Our analysis indicates that the topological properties of weighted FGN can be employed to accurately predict SGI. Also, the graph-based SSL method outperforms the traditional standard supervised approach, especially when used with small training sets. The proposed method can alleviate experimental burden of exhaustive test and provide a useful guide for the biologist in narrowing down the candidate gene pairs with SGI. The data and source code implementing the method are available from the website: http://home.ustc.edu.cn/~yzh33108/GeneticInterPred.htm
format Text
id pubmed-2909217
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-29092172010-07-24 A semi-supervised learning approach to predict synthetic genetic interactions by combining functional and topological properties of functional gene network You, Zhu-Hong Yin, Zheng Han, Kyungsook Huang, De-Shuang Zhou, Xiaobo BMC Bioinformatics Research Article BACKGROUND: Genetic interaction profiles are highly informative and helpful for understanding the functional linkages between genes, and therefore have been extensively exploited for annotating gene functions and dissecting specific pathway structures. However, our understanding is rather limited to the relationship between double concurrent perturbation and various higher level phenotypic changes, e.g. those in cells, tissues or organs. Modifier screens, such as synthetic genetic arrays (SGA) can help us to understand the phenotype caused by combined gene mutations. Unfortunately, exhaustive tests on all possible combined mutations in any genome are vulnerable to combinatorial explosion and are infeasible either technically or financially. Therefore, an accurate computational approach to predict genetic interaction is highly desirable, and such methods have the potential of alleviating the bottleneck on experiment design. RESULTS: In this work, we introduce a computational systems biology approach for the accurate prediction of pairwise synthetic genetic interactions (SGI). First, a high-coverage and high-precision functional gene network (FGN) is constructed by integrating protein-protein interaction (PPI), protein complex and gene expression data; then, a graph-based semi-supervised learning (SSL) classifier is utilized to identify SGI, where the topological properties of protein pairs in weighted FGN is used as input features of the classifier. We compare the proposed SSL method with the state-of-the-art supervised classifier, the support vector machines (SVM), on a benchmark dataset in S. cerevisiae to validate our method's ability to distinguish synthetic genetic interactions from non-interaction gene pairs. Experimental results show that the proposed method can accurately predict genetic interactions in S. cerevisiae (with a sensitivity of 92% and specificity of 91%). Noticeably, the SSL method is more efficient than SVM, especially for very small training sets and large test sets. CONCLUSIONS: We developed a graph-based SSL classifier for predicting the SGI. The classifier employs topological properties of weighted FGN as input features and simultaneously employs information induced from labelled and unlabelled data. Our analysis indicates that the topological properties of weighted FGN can be employed to accurately predict SGI. Also, the graph-based SSL method outperforms the traditional standard supervised approach, especially when used with small training sets. The proposed method can alleviate experimental burden of exhaustive test and provide a useful guide for the biologist in narrowing down the candidate gene pairs with SGI. The data and source code implementing the method are available from the website: http://home.ustc.edu.cn/~yzh33108/GeneticInterPred.htm BioMed Central 2010-06-24 /pmc/articles/PMC2909217/ /pubmed/20573270 http://dx.doi.org/10.1186/1471-2105-11-343 Text en Copyright ©2010 You et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
You, Zhu-Hong
Yin, Zheng
Han, Kyungsook
Huang, De-Shuang
Zhou, Xiaobo
A semi-supervised learning approach to predict synthetic genetic interactions by combining functional and topological properties of functional gene network
title A semi-supervised learning approach to predict synthetic genetic interactions by combining functional and topological properties of functional gene network
title_full A semi-supervised learning approach to predict synthetic genetic interactions by combining functional and topological properties of functional gene network
title_fullStr A semi-supervised learning approach to predict synthetic genetic interactions by combining functional and topological properties of functional gene network
title_full_unstemmed A semi-supervised learning approach to predict synthetic genetic interactions by combining functional and topological properties of functional gene network
title_short A semi-supervised learning approach to predict synthetic genetic interactions by combining functional and topological properties of functional gene network
title_sort semi-supervised learning approach to predict synthetic genetic interactions by combining functional and topological properties of functional gene network
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2909217/
https://www.ncbi.nlm.nih.gov/pubmed/20573270
http://dx.doi.org/10.1186/1471-2105-11-343
work_keys_str_mv AT youzhuhong asemisupervisedlearningapproachtopredictsyntheticgeneticinteractionsbycombiningfunctionalandtopologicalpropertiesoffunctionalgenenetwork
AT yinzheng asemisupervisedlearningapproachtopredictsyntheticgeneticinteractionsbycombiningfunctionalandtopologicalpropertiesoffunctionalgenenetwork
AT hankyungsook asemisupervisedlearningapproachtopredictsyntheticgeneticinteractionsbycombiningfunctionalandtopologicalpropertiesoffunctionalgenenetwork
AT huangdeshuang asemisupervisedlearningapproachtopredictsyntheticgeneticinteractionsbycombiningfunctionalandtopologicalpropertiesoffunctionalgenenetwork
AT zhouxiaobo asemisupervisedlearningapproachtopredictsyntheticgeneticinteractionsbycombiningfunctionalandtopologicalpropertiesoffunctionalgenenetwork
AT youzhuhong semisupervisedlearningapproachtopredictsyntheticgeneticinteractionsbycombiningfunctionalandtopologicalpropertiesoffunctionalgenenetwork
AT yinzheng semisupervisedlearningapproachtopredictsyntheticgeneticinteractionsbycombiningfunctionalandtopologicalpropertiesoffunctionalgenenetwork
AT hankyungsook semisupervisedlearningapproachtopredictsyntheticgeneticinteractionsbycombiningfunctionalandtopologicalpropertiesoffunctionalgenenetwork
AT huangdeshuang semisupervisedlearningapproachtopredictsyntheticgeneticinteractionsbycombiningfunctionalandtopologicalpropertiesoffunctionalgenenetwork
AT zhouxiaobo semisupervisedlearningapproachtopredictsyntheticgeneticinteractionsbycombiningfunctionalandtopologicalpropertiesoffunctionalgenenetwork