Cargando…

Accurate prediction of virus-host protein-protein interactions via a Siamese neural network using deep protein sequence embeddings

Prediction and understanding of virus-host protein-protein interactions (PPIs) have relevance for the development of novel therapeutic interventions. In addition, virus-like particles open novel opportunities to deliver therapeutics to targeted cell types and tissues. Given our incomplete knowledge...

Descripción completa

Detalles Bibliográficos
Autores principales: Madan, Sumit, Demina, Victoria, Stapf, Marcus, Ernst, Oliver, Fröhlich, Holger
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9481957/
https://www.ncbi.nlm.nih.gov/pubmed/36124304
http://dx.doi.org/10.1016/j.patter.2022.100551
_version_ 1784791354506739712
author Madan, Sumit
Demina, Victoria
Stapf, Marcus
Ernst, Oliver
Fröhlich, Holger
author_facet Madan, Sumit
Demina, Victoria
Stapf, Marcus
Ernst, Oliver
Fröhlich, Holger
author_sort Madan, Sumit
collection PubMed
description Prediction and understanding of virus-host protein-protein interactions (PPIs) have relevance for the development of novel therapeutic interventions. In addition, virus-like particles open novel opportunities to deliver therapeutics to targeted cell types and tissues. Given our incomplete knowledge of PPIs on the one hand and the cost and time associated with experimental procedures on the other, we here propose a deep learning approach to predict virus-host PPIs. Our method (Siamese Tailored deep sequence Embedding of Proteins [STEP]) is based on recent deep protein sequence embedding techniques, which we integrate into a Siamese neural network. After showing the state-of-the-art performance of STEP on external datasets, we apply it to two use cases, severe acute respiratory syndrome coronavirus 2 and John Cunningham polyomavirus, to predict virus-host PPIs. Altogether our work highlights the potential of deep sequence embedding techniques originating from the field of NLP as well as explainable artificial intelligence methods for the analysis of biological sequences.
format Online
Article
Text
id pubmed-9481957
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Elsevier
record_format MEDLINE/PubMed
spelling pubmed-94819572022-09-18 Accurate prediction of virus-host protein-protein interactions via a Siamese neural network using deep protein sequence embeddings Madan, Sumit Demina, Victoria Stapf, Marcus Ernst, Oliver Fröhlich, Holger Patterns (N Y) Article Prediction and understanding of virus-host protein-protein interactions (PPIs) have relevance for the development of novel therapeutic interventions. In addition, virus-like particles open novel opportunities to deliver therapeutics to targeted cell types and tissues. Given our incomplete knowledge of PPIs on the one hand and the cost and time associated with experimental procedures on the other, we here propose a deep learning approach to predict virus-host PPIs. Our method (Siamese Tailored deep sequence Embedding of Proteins [STEP]) is based on recent deep protein sequence embedding techniques, which we integrate into a Siamese neural network. After showing the state-of-the-art performance of STEP on external datasets, we apply it to two use cases, severe acute respiratory syndrome coronavirus 2 and John Cunningham polyomavirus, to predict virus-host PPIs. Altogether our work highlights the potential of deep sequence embedding techniques originating from the field of NLP as well as explainable artificial intelligence methods for the analysis of biological sequences. Elsevier 2022-07-31 /pmc/articles/PMC9481957/ /pubmed/36124304 http://dx.doi.org/10.1016/j.patter.2022.100551 Text en © 2022 The Authors https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
spellingShingle Article
Madan, Sumit
Demina, Victoria
Stapf, Marcus
Ernst, Oliver
Fröhlich, Holger
Accurate prediction of virus-host protein-protein interactions via a Siamese neural network using deep protein sequence embeddings
title Accurate prediction of virus-host protein-protein interactions via a Siamese neural network using deep protein sequence embeddings
title_full Accurate prediction of virus-host protein-protein interactions via a Siamese neural network using deep protein sequence embeddings
title_fullStr Accurate prediction of virus-host protein-protein interactions via a Siamese neural network using deep protein sequence embeddings
title_full_unstemmed Accurate prediction of virus-host protein-protein interactions via a Siamese neural network using deep protein sequence embeddings
title_short Accurate prediction of virus-host protein-protein interactions via a Siamese neural network using deep protein sequence embeddings
title_sort accurate prediction of virus-host protein-protein interactions via a siamese neural network using deep protein sequence embeddings
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9481957/
https://www.ncbi.nlm.nih.gov/pubmed/36124304
http://dx.doi.org/10.1016/j.patter.2022.100551
work_keys_str_mv AT madansumit accuratepredictionofvirushostproteinproteininteractionsviaasiameseneuralnetworkusingdeepproteinsequenceembeddings
AT deminavictoria accuratepredictionofvirushostproteinproteininteractionsviaasiameseneuralnetworkusingdeepproteinsequenceembeddings
AT stapfmarcus accuratepredictionofvirushostproteinproteininteractionsviaasiameseneuralnetworkusingdeepproteinsequenceembeddings
AT ernstoliver accuratepredictionofvirushostproteinproteininteractionsviaasiameseneuralnetworkusingdeepproteinsequenceembeddings
AT frohlichholger accuratepredictionofvirushostproteinproteininteractionsviaasiameseneuralnetworkusingdeepproteinsequenceembeddings