Cargando…
Computational reconstruction of proteome-wide protein interaction networks between HTLV retroviruses and Homo sapiens
BACKGROUND: Human T-cell leukemia viruses (HTLV) tend to induce some fatal human diseases like Adult T-cell Leukemia (ATL) by targeting human T lymphocytes. To indentify the protein-protein interactions (PPI) between HTLV viruses and Homo sapiens is one of the significant approaches to reveal the un...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2014
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4133621/ https://www.ncbi.nlm.nih.gov/pubmed/25037487 http://dx.doi.org/10.1186/1471-2105-15-245 |
_version_ | 1782330766266466304 |
---|---|
author | Mei, Suyu Zhu, Hao |
author_facet | Mei, Suyu Zhu, Hao |
author_sort | Mei, Suyu |
collection | PubMed |
description | BACKGROUND: Human T-cell leukemia viruses (HTLV) tend to induce some fatal human diseases like Adult T-cell Leukemia (ATL) by targeting human T lymphocytes. To indentify the protein-protein interactions (PPI) between HTLV viruses and Homo sapiens is one of the significant approaches to reveal the underlying mechanism of HTLV infection and host defence. At present, as biological experiments are labor-intensive and expensive, the identified part of the HTLV-human PPI networks is rather small. Although recent years have witnessed much progress in computational modeling for reconstructing pathogen-host PPI networks, data scarcity and data unavailability are two major challenges to be effectively addressed. To our knowledge, no computational method for proteome-wide HTLV-human PPI networks reconstruction has been reported. RESULTS: In this work we develop Multi-instance Adaboost method to conduct homolog knowledge transfer for computationally reconstructing proteome-wide HTLV-human PPI networks. In this method, the homolog knowledge in the form of gene ontology (GO) is treated as auxiliary homolog instance to address the problems of data scarcity and data unavailability, while the potential negative knowledge transfer is automatically attenuated by AdaBoost instance reweighting. The cross validation experiments show that the homolog knowledge transfer in the form of independent homolog instances can effectively enrich the feature information and substitute for the missing GO information. Moreover, the independent tests show that the method can validate 70.3% of the recently curated interactions, significantly exceeding the 2.1% recognition rate by the HT-Y2H experiment. We have used the method to reconstruct the proteome-wide HTLV-human PPI networks and further conducted gene ontology based clustering of the predicted networks for further biomedical research. The gene ontology based clustering analysis of the predictions provides much biological insight into the pathogenesis of HTLV retroviruses. CONCLUSIONS: The Multi-instance AdaBoost method can effectively address the problems of data scarcity and data unavailability for the proteome-wide HTLV-human PPI interaction networks reconstruction. The gene ontology based clustering analysis of the predictions reveals some important signaling pathways and biological modules that HTLV retroviruses are likely to target. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/1471-2105-15-245) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-4133621 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2014 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-41336212014-08-16 Computational reconstruction of proteome-wide protein interaction networks between HTLV retroviruses and Homo sapiens Mei, Suyu Zhu, Hao BMC Bioinformatics Research Article BACKGROUND: Human T-cell leukemia viruses (HTLV) tend to induce some fatal human diseases like Adult T-cell Leukemia (ATL) by targeting human T lymphocytes. To indentify the protein-protein interactions (PPI) between HTLV viruses and Homo sapiens is one of the significant approaches to reveal the underlying mechanism of HTLV infection and host defence. At present, as biological experiments are labor-intensive and expensive, the identified part of the HTLV-human PPI networks is rather small. Although recent years have witnessed much progress in computational modeling for reconstructing pathogen-host PPI networks, data scarcity and data unavailability are two major challenges to be effectively addressed. To our knowledge, no computational method for proteome-wide HTLV-human PPI networks reconstruction has been reported. RESULTS: In this work we develop Multi-instance Adaboost method to conduct homolog knowledge transfer for computationally reconstructing proteome-wide HTLV-human PPI networks. In this method, the homolog knowledge in the form of gene ontology (GO) is treated as auxiliary homolog instance to address the problems of data scarcity and data unavailability, while the potential negative knowledge transfer is automatically attenuated by AdaBoost instance reweighting. The cross validation experiments show that the homolog knowledge transfer in the form of independent homolog instances can effectively enrich the feature information and substitute for the missing GO information. Moreover, the independent tests show that the method can validate 70.3% of the recently curated interactions, significantly exceeding the 2.1% recognition rate by the HT-Y2H experiment. We have used the method to reconstruct the proteome-wide HTLV-human PPI networks and further conducted gene ontology based clustering of the predicted networks for further biomedical research. The gene ontology based clustering analysis of the predictions provides much biological insight into the pathogenesis of HTLV retroviruses. CONCLUSIONS: The Multi-instance AdaBoost method can effectively address the problems of data scarcity and data unavailability for the proteome-wide HTLV-human PPI interaction networks reconstruction. The gene ontology based clustering analysis of the predictions reveals some important signaling pathways and biological modules that HTLV retroviruses are likely to target. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/1471-2105-15-245) contains supplementary material, which is available to authorized users. BioMed Central 2014-07-18 /pmc/articles/PMC4133621/ /pubmed/25037487 http://dx.doi.org/10.1186/1471-2105-15-245 Text en © Mei and Zhu; licensee BioMed Central Ltd. 2014 This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Article Mei, Suyu Zhu, Hao Computational reconstruction of proteome-wide protein interaction networks between HTLV retroviruses and Homo sapiens |
title | Computational reconstruction of proteome-wide protein interaction networks between HTLV retroviruses and Homo sapiens |
title_full | Computational reconstruction of proteome-wide protein interaction networks between HTLV retroviruses and Homo sapiens |
title_fullStr | Computational reconstruction of proteome-wide protein interaction networks between HTLV retroviruses and Homo sapiens |
title_full_unstemmed | Computational reconstruction of proteome-wide protein interaction networks between HTLV retroviruses and Homo sapiens |
title_short | Computational reconstruction of proteome-wide protein interaction networks between HTLV retroviruses and Homo sapiens |
title_sort | computational reconstruction of proteome-wide protein interaction networks between htlv retroviruses and homo sapiens |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4133621/ https://www.ncbi.nlm.nih.gov/pubmed/25037487 http://dx.doi.org/10.1186/1471-2105-15-245 |
work_keys_str_mv | AT meisuyu computationalreconstructionofproteomewideproteininteractionnetworksbetweenhtlvretrovirusesandhomosapiens AT zhuhao computationalreconstructionofproteomewideproteininteractionnetworksbetweenhtlvretrovirusesandhomosapiens |