Cargando…
A semi-supervised learning approach to predict synthetic genetic interactions by combining functional and topological properties of functional gene network
BACKGROUND: Genetic interaction profiles are highly informative and helpful for understanding the functional linkages between genes, and therefore have been extensively exploited for annotating gene functions and dissecting specific pathway structures. However, our understanding is rather limited to...
Autores principales: | , , , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2010
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2909217/ https://www.ncbi.nlm.nih.gov/pubmed/20573270 http://dx.doi.org/10.1186/1471-2105-11-343 |
_version_ | 1782184289441415168 |
---|---|
author | You, Zhu-Hong Yin, Zheng Han, Kyungsook Huang, De-Shuang Zhou, Xiaobo |
author_facet | You, Zhu-Hong Yin, Zheng Han, Kyungsook Huang, De-Shuang Zhou, Xiaobo |
author_sort | You, Zhu-Hong |
collection | PubMed |
description | BACKGROUND: Genetic interaction profiles are highly informative and helpful for understanding the functional linkages between genes, and therefore have been extensively exploited for annotating gene functions and dissecting specific pathway structures. However, our understanding is rather limited to the relationship between double concurrent perturbation and various higher level phenotypic changes, e.g. those in cells, tissues or organs. Modifier screens, such as synthetic genetic arrays (SGA) can help us to understand the phenotype caused by combined gene mutations. Unfortunately, exhaustive tests on all possible combined mutations in any genome are vulnerable to combinatorial explosion and are infeasible either technically or financially. Therefore, an accurate computational approach to predict genetic interaction is highly desirable, and such methods have the potential of alleviating the bottleneck on experiment design. RESULTS: In this work, we introduce a computational systems biology approach for the accurate prediction of pairwise synthetic genetic interactions (SGI). First, a high-coverage and high-precision functional gene network (FGN) is constructed by integrating protein-protein interaction (PPI), protein complex and gene expression data; then, a graph-based semi-supervised learning (SSL) classifier is utilized to identify SGI, where the topological properties of protein pairs in weighted FGN is used as input features of the classifier. We compare the proposed SSL method with the state-of-the-art supervised classifier, the support vector machines (SVM), on a benchmark dataset in S. cerevisiae to validate our method's ability to distinguish synthetic genetic interactions from non-interaction gene pairs. Experimental results show that the proposed method can accurately predict genetic interactions in S. cerevisiae (with a sensitivity of 92% and specificity of 91%). Noticeably, the SSL method is more efficient than SVM, especially for very small training sets and large test sets. CONCLUSIONS: We developed a graph-based SSL classifier for predicting the SGI. The classifier employs topological properties of weighted FGN as input features and simultaneously employs information induced from labelled and unlabelled data. Our analysis indicates that the topological properties of weighted FGN can be employed to accurately predict SGI. Also, the graph-based SSL method outperforms the traditional standard supervised approach, especially when used with small training sets. The proposed method can alleviate experimental burden of exhaustive test and provide a useful guide for the biologist in narrowing down the candidate gene pairs with SGI. The data and source code implementing the method are available from the website: http://home.ustc.edu.cn/~yzh33108/GeneticInterPred.htm |
format | Text |
id | pubmed-2909217 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2010 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-29092172010-07-24 A semi-supervised learning approach to predict synthetic genetic interactions by combining functional and topological properties of functional gene network You, Zhu-Hong Yin, Zheng Han, Kyungsook Huang, De-Shuang Zhou, Xiaobo BMC Bioinformatics Research Article BACKGROUND: Genetic interaction profiles are highly informative and helpful for understanding the functional linkages between genes, and therefore have been extensively exploited for annotating gene functions and dissecting specific pathway structures. However, our understanding is rather limited to the relationship between double concurrent perturbation and various higher level phenotypic changes, e.g. those in cells, tissues or organs. Modifier screens, such as synthetic genetic arrays (SGA) can help us to understand the phenotype caused by combined gene mutations. Unfortunately, exhaustive tests on all possible combined mutations in any genome are vulnerable to combinatorial explosion and are infeasible either technically or financially. Therefore, an accurate computational approach to predict genetic interaction is highly desirable, and such methods have the potential of alleviating the bottleneck on experiment design. RESULTS: In this work, we introduce a computational systems biology approach for the accurate prediction of pairwise synthetic genetic interactions (SGI). First, a high-coverage and high-precision functional gene network (FGN) is constructed by integrating protein-protein interaction (PPI), protein complex and gene expression data; then, a graph-based semi-supervised learning (SSL) classifier is utilized to identify SGI, where the topological properties of protein pairs in weighted FGN is used as input features of the classifier. We compare the proposed SSL method with the state-of-the-art supervised classifier, the support vector machines (SVM), on a benchmark dataset in S. cerevisiae to validate our method's ability to distinguish synthetic genetic interactions from non-interaction gene pairs. Experimental results show that the proposed method can accurately predict genetic interactions in S. cerevisiae (with a sensitivity of 92% and specificity of 91%). Noticeably, the SSL method is more efficient than SVM, especially for very small training sets and large test sets. CONCLUSIONS: We developed a graph-based SSL classifier for predicting the SGI. The classifier employs topological properties of weighted FGN as input features and simultaneously employs information induced from labelled and unlabelled data. Our analysis indicates that the topological properties of weighted FGN can be employed to accurately predict SGI. Also, the graph-based SSL method outperforms the traditional standard supervised approach, especially when used with small training sets. The proposed method can alleviate experimental burden of exhaustive test and provide a useful guide for the biologist in narrowing down the candidate gene pairs with SGI. The data and source code implementing the method are available from the website: http://home.ustc.edu.cn/~yzh33108/GeneticInterPred.htm BioMed Central 2010-06-24 /pmc/articles/PMC2909217/ /pubmed/20573270 http://dx.doi.org/10.1186/1471-2105-11-343 Text en Copyright ©2010 You et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article You, Zhu-Hong Yin, Zheng Han, Kyungsook Huang, De-Shuang Zhou, Xiaobo A semi-supervised learning approach to predict synthetic genetic interactions by combining functional and topological properties of functional gene network |
title | A semi-supervised learning approach to predict synthetic genetic interactions by combining functional and topological properties of functional gene network |
title_full | A semi-supervised learning approach to predict synthetic genetic interactions by combining functional and topological properties of functional gene network |
title_fullStr | A semi-supervised learning approach to predict synthetic genetic interactions by combining functional and topological properties of functional gene network |
title_full_unstemmed | A semi-supervised learning approach to predict synthetic genetic interactions by combining functional and topological properties of functional gene network |
title_short | A semi-supervised learning approach to predict synthetic genetic interactions by combining functional and topological properties of functional gene network |
title_sort | semi-supervised learning approach to predict synthetic genetic interactions by combining functional and topological properties of functional gene network |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2909217/ https://www.ncbi.nlm.nih.gov/pubmed/20573270 http://dx.doi.org/10.1186/1471-2105-11-343 |
work_keys_str_mv | AT youzhuhong asemisupervisedlearningapproachtopredictsyntheticgeneticinteractionsbycombiningfunctionalandtopologicalpropertiesoffunctionalgenenetwork AT yinzheng asemisupervisedlearningapproachtopredictsyntheticgeneticinteractionsbycombiningfunctionalandtopologicalpropertiesoffunctionalgenenetwork AT hankyungsook asemisupervisedlearningapproachtopredictsyntheticgeneticinteractionsbycombiningfunctionalandtopologicalpropertiesoffunctionalgenenetwork AT huangdeshuang asemisupervisedlearningapproachtopredictsyntheticgeneticinteractionsbycombiningfunctionalandtopologicalpropertiesoffunctionalgenenetwork AT zhouxiaobo asemisupervisedlearningapproachtopredictsyntheticgeneticinteractionsbycombiningfunctionalandtopologicalpropertiesoffunctionalgenenetwork AT youzhuhong semisupervisedlearningapproachtopredictsyntheticgeneticinteractionsbycombiningfunctionalandtopologicalpropertiesoffunctionalgenenetwork AT yinzheng semisupervisedlearningapproachtopredictsyntheticgeneticinteractionsbycombiningfunctionalandtopologicalpropertiesoffunctionalgenenetwork AT hankyungsook semisupervisedlearningapproachtopredictsyntheticgeneticinteractionsbycombiningfunctionalandtopologicalpropertiesoffunctionalgenenetwork AT huangdeshuang semisupervisedlearningapproachtopredictsyntheticgeneticinteractionsbycombiningfunctionalandtopologicalpropertiesoffunctionalgenenetwork AT zhouxiaobo semisupervisedlearningapproachtopredictsyntheticgeneticinteractionsbycombiningfunctionalandtopologicalpropertiesoffunctionalgenenetwork |