Cargando…

Predicting virus-host association by Kernelized logistic matrix factorization and similarity network fusion

BACKGROUND: Viruses are closely related to bacteria and human diseases. It is of great significance to predict associations between viruses and hosts for understanding the dynamics and complex functional networks in microbial community. With the rapid development of the metagenomics sequencing, some...

Descripción completa

Detalles Bibliográficos
Autores principales: Liu, Dan, Ma, Yingjun, Jiang, Xingpeng, He, Tingting
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6886165/
https://www.ncbi.nlm.nih.gov/pubmed/31787095
http://dx.doi.org/10.1186/s12859-019-3082-0
_version_ 1783474829215662080
author Liu, Dan
Ma, Yingjun
Jiang, Xingpeng
He, Tingting
author_facet Liu, Dan
Ma, Yingjun
Jiang, Xingpeng
He, Tingting
author_sort Liu, Dan
collection PubMed
description BACKGROUND: Viruses are closely related to bacteria and human diseases. It is of great significance to predict associations between viruses and hosts for understanding the dynamics and complex functional networks in microbial community. With the rapid development of the metagenomics sequencing, some methods based on sequence similarity and genomic homology have been used to predict associations between viruses and hosts. However, the known virus-host association network was ignored in these methods. RESULTS: We proposed a kernelized logistic matrix factorization with integrating different information to predict potential virus-host associations on the heterogeneous network (ILMF-VH) which is constructed by connecting a virus network with a host network based on known virus-host associations. The virus network is constructed based on oligonucleotide frequency measurement, and the host network is constructed by integrating oligonucleotide frequency similarity and Gaussian interaction profile kernel similarity through similarity network fusion. The host prediction accuracy of our method is better than other methods. In addition, case studies show that the host of crAssphage predicted by ILMF-VH is consistent with presumed host in previous studies, and another potential host Escherichia coli is also predicted. CONCLUSIONS: The proposed model is an effective computational tool for predicting interactions between viruses and hosts effectively, and it has great potential for discovering novel hosts of viruses.
format Online
Article
Text
id pubmed-6886165
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-68861652019-12-11 Predicting virus-host association by Kernelized logistic matrix factorization and similarity network fusion Liu, Dan Ma, Yingjun Jiang, Xingpeng He, Tingting BMC Bioinformatics Research BACKGROUND: Viruses are closely related to bacteria and human diseases. It is of great significance to predict associations between viruses and hosts for understanding the dynamics and complex functional networks in microbial community. With the rapid development of the metagenomics sequencing, some methods based on sequence similarity and genomic homology have been used to predict associations between viruses and hosts. However, the known virus-host association network was ignored in these methods. RESULTS: We proposed a kernelized logistic matrix factorization with integrating different information to predict potential virus-host associations on the heterogeneous network (ILMF-VH) which is constructed by connecting a virus network with a host network based on known virus-host associations. The virus network is constructed based on oligonucleotide frequency measurement, and the host network is constructed by integrating oligonucleotide frequency similarity and Gaussian interaction profile kernel similarity through similarity network fusion. The host prediction accuracy of our method is better than other methods. In addition, case studies show that the host of crAssphage predicted by ILMF-VH is consistent with presumed host in previous studies, and another potential host Escherichia coli is also predicted. CONCLUSIONS: The proposed model is an effective computational tool for predicting interactions between viruses and hosts effectively, and it has great potential for discovering novel hosts of viruses. BioMed Central 2019-12-02 /pmc/articles/PMC6886165/ /pubmed/31787095 http://dx.doi.org/10.1186/s12859-019-3082-0 Text en © The Author(s). 2019 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Liu, Dan
Ma, Yingjun
Jiang, Xingpeng
He, Tingting
Predicting virus-host association by Kernelized logistic matrix factorization and similarity network fusion
title Predicting virus-host association by Kernelized logistic matrix factorization and similarity network fusion
title_full Predicting virus-host association by Kernelized logistic matrix factorization and similarity network fusion
title_fullStr Predicting virus-host association by Kernelized logistic matrix factorization and similarity network fusion
title_full_unstemmed Predicting virus-host association by Kernelized logistic matrix factorization and similarity network fusion
title_short Predicting virus-host association by Kernelized logistic matrix factorization and similarity network fusion
title_sort predicting virus-host association by kernelized logistic matrix factorization and similarity network fusion
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6886165/
https://www.ncbi.nlm.nih.gov/pubmed/31787095
http://dx.doi.org/10.1186/s12859-019-3082-0
work_keys_str_mv AT liudan predictingvirushostassociationbykernelizedlogisticmatrixfactorizationandsimilaritynetworkfusion
AT mayingjun predictingvirushostassociationbykernelizedlogisticmatrixfactorizationandsimilaritynetworkfusion
AT jiangxingpeng predictingvirushostassociationbykernelizedlogisticmatrixfactorizationandsimilaritynetworkfusion
AT hetingting predictingvirushostassociationbykernelizedlogisticmatrixfactorizationandsimilaritynetworkfusion