Cargando…

DeepViral: prediction of novel virus–host interactions from protein sequences and infectious disease phenotypes

MOTIVATION: Infectious diseases caused by novel viruses have become a major public health concern. Rapid identification of virus–host interactions can reveal mechanistic insights into infectious diseases and shed light on potential treatments. Current computational prediction methods for novel virus...

Descripción completa

Detalles Bibliográficos
Autores principales: Liu-Wei, Wang, Kafkas, Şenay, Chen, Jun, Dimonaco, Nicholas J., Tegnér, Jesper, Hoehndorf, Robert
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8428617/
https://www.ncbi.nlm.nih.gov/pubmed/33682875
http://dx.doi.org/10.1093/bioinformatics/btab147
_version_ 1783750411214127104
author Liu-Wei, Wang
Kafkas, Şenay
Chen, Jun
Dimonaco, Nicholas J.
Tegnér, Jesper
Hoehndorf, Robert
author_facet Liu-Wei, Wang
Kafkas, Şenay
Chen, Jun
Dimonaco, Nicholas J.
Tegnér, Jesper
Hoehndorf, Robert
author_sort Liu-Wei, Wang
collection PubMed
description MOTIVATION: Infectious diseases caused by novel viruses have become a major public health concern. Rapid identification of virus–host interactions can reveal mechanistic insights into infectious diseases and shed light on potential treatments. Current computational prediction methods for novel viruses are based mainly on protein sequences. However, it is not clear to what extent other important features, such as the symptoms caused by the viruses, could contribute to a predictor. Disease phenotypes (i.e. signs and symptoms) are readily accessible from clinical diagnosis and we hypothesize that they may act as a potential proxy and an additional source of information for the underlying molecular interactions between the pathogens and hosts. RESULTS: We developed DeepViral, a deep learning based method that predicts protein–protein interactions (PPI) between humans and viruses. Motivated by the potential utility of infectious disease phenotypes, we first embedded human proteins and viruses in a shared space using their associated phenotypes and functions, supported by formalized background knowledge from biomedical ontologies. By jointly learning from protein sequences and phenotype features, DeepViral significantly improves over existing sequence-based methods for intra- and inter-species PPI prediction. AVAILABILITY AND IMPLEMENTATION: Code and datasets for reproduction and customization are available at https://github.com/bio-ontology-research-group/DeepViral. Prediction results for 14 virus families are available at https://doi.org/10.5281/zenodo.4429824. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-8428617
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-84286172021-09-10 DeepViral: prediction of novel virus–host interactions from protein sequences and infectious disease phenotypes Liu-Wei, Wang Kafkas, Şenay Chen, Jun Dimonaco, Nicholas J. Tegnér, Jesper Hoehndorf, Robert Bioinformatics Original Papers MOTIVATION: Infectious diseases caused by novel viruses have become a major public health concern. Rapid identification of virus–host interactions can reveal mechanistic insights into infectious diseases and shed light on potential treatments. Current computational prediction methods for novel viruses are based mainly on protein sequences. However, it is not clear to what extent other important features, such as the symptoms caused by the viruses, could contribute to a predictor. Disease phenotypes (i.e. signs and symptoms) are readily accessible from clinical diagnosis and we hypothesize that they may act as a potential proxy and an additional source of information for the underlying molecular interactions between the pathogens and hosts. RESULTS: We developed DeepViral, a deep learning based method that predicts protein–protein interactions (PPI) between humans and viruses. Motivated by the potential utility of infectious disease phenotypes, we first embedded human proteins and viruses in a shared space using their associated phenotypes and functions, supported by formalized background knowledge from biomedical ontologies. By jointly learning from protein sequences and phenotype features, DeepViral significantly improves over existing sequence-based methods for intra- and inter-species PPI prediction. AVAILABILITY AND IMPLEMENTATION: Code and datasets for reproduction and customization are available at https://github.com/bio-ontology-research-group/DeepViral. Prediction results for 14 virus families are available at https://doi.org/10.5281/zenodo.4429824. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2021-03-03 /pmc/articles/PMC8428617/ /pubmed/33682875 http://dx.doi.org/10.1093/bioinformatics/btab147 Text en © The Author(s) 2021. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Papers
Liu-Wei, Wang
Kafkas, Şenay
Chen, Jun
Dimonaco, Nicholas J.
Tegnér, Jesper
Hoehndorf, Robert
DeepViral: prediction of novel virus–host interactions from protein sequences and infectious disease phenotypes
title DeepViral: prediction of novel virus–host interactions from protein sequences and infectious disease phenotypes
title_full DeepViral: prediction of novel virus–host interactions from protein sequences and infectious disease phenotypes
title_fullStr DeepViral: prediction of novel virus–host interactions from protein sequences and infectious disease phenotypes
title_full_unstemmed DeepViral: prediction of novel virus–host interactions from protein sequences and infectious disease phenotypes
title_short DeepViral: prediction of novel virus–host interactions from protein sequences and infectious disease phenotypes
title_sort deepviral: prediction of novel virus–host interactions from protein sequences and infectious disease phenotypes
topic Original Papers
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8428617/
https://www.ncbi.nlm.nih.gov/pubmed/33682875
http://dx.doi.org/10.1093/bioinformatics/btab147
work_keys_str_mv AT liuweiwang deepviralpredictionofnovelvirushostinteractionsfromproteinsequencesandinfectiousdiseasephenotypes
AT kafkassenay deepviralpredictionofnovelvirushostinteractionsfromproteinsequencesandinfectiousdiseasephenotypes
AT chenjun deepviralpredictionofnovelvirushostinteractionsfromproteinsequencesandinfectiousdiseasephenotypes
AT dimonaconicholasj deepviralpredictionofnovelvirushostinteractionsfromproteinsequencesandinfectiousdiseasephenotypes
AT tegnerjesper deepviralpredictionofnovelvirushostinteractionsfromproteinsequencesandinfectiousdiseasephenotypes
AT hoehndorfrobert deepviralpredictionofnovelvirushostinteractionsfromproteinsequencesandinfectiousdiseasephenotypes