Cargando…

Predicting residues involved in anti-DNA autoantibodies with limited neural networks

ABSTRACT: Computer-aided rational vaccine design (RVD) and synthetic pharmacology are rapidly developing fields that leverage existing datasets for developing compounds of interest. Computational proteomics utilizes algorithms and models to probe proteins for functional prediction. A potentially str...

Descripción completa

Detalles Bibliográficos
Autores principales:	St. Clair, Rachel, Teti, Michael, Pavlovic, Mirjana, Hahn, William, Barenholtz, Elan
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Springer Berlin Heidelberg 2022
Materias:	Original Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8932093/ https://www.ncbi.nlm.nih.gov/pubmed/35303216 http://dx.doi.org/10.1007/s11517-022-02539-7

_version_	1784671384682627072
author	St. Clair, Rachel Teti, Michael Pavlovic, Mirjana Hahn, William Barenholtz, Elan
author_facet	St. Clair, Rachel Teti, Michael Pavlovic, Mirjana Hahn, William Barenholtz, Elan
author_sort	St. Clair, Rachel
collection	PubMed
description	ABSTRACT: Computer-aided rational vaccine design (RVD) and synthetic pharmacology are rapidly developing fields that leverage existing datasets for developing compounds of interest. Computational proteomics utilizes algorithms and models to probe proteins for functional prediction. A potentially strong target for computational approach is autoimmune antibodies, which are the result of broken tolerance in the immune system where it cannot distinguish “self” from “non-self” resulting in attack of its own structures (proteins and DNA, mainly). The information on structure, function, and pathogenicity of autoantibodies may assist in engineering RVD against autoimmune diseases. Current computational approaches exploit large datasets curated with extensive domain knowledge, most of which include the need for many resources and have been applied indirectly to problems of interest for DNA, RNA, and monomer protein binding. We present a novel method for discovering potential binding sites. We employed long short-term memory (LSTM) models trained on FASTA primary sequences to predict protein binding in DNA-binding hydrolytic antibodies (abzymes). We also employed CNN models applied to the same dataset for comparison with LSTM. While the CNN model outperformed the LSTM on the primary task of binding prediction, analysis of internal model representations of both models showed that the LSTM models recovered sub-sequences that were strongly correlated with sites known to be involved in binding. These results demonstrate that analysis of internal processes of LSTM models may serve as a powerful tool for primary sequence analysis. GRAPHICAL ABSTRACT: [Image: see text]
format	Online Article Text
id	pubmed-8932093
institution	National Center for Biotechnology Information
language	English
publishDate	2022
publisher	Springer Berlin Heidelberg
record_format	MEDLINE/PubMed
spelling	pubmed-89320932022-03-18 Predicting residues involved in anti-DNA autoantibodies with limited neural networks St. Clair, Rachel Teti, Michael Pavlovic, Mirjana Hahn, William Barenholtz, Elan Med Biol Eng Comput Original Article ABSTRACT: Computer-aided rational vaccine design (RVD) and synthetic pharmacology are rapidly developing fields that leverage existing datasets for developing compounds of interest. Computational proteomics utilizes algorithms and models to probe proteins for functional prediction. A potentially strong target for computational approach is autoimmune antibodies, which are the result of broken tolerance in the immune system where it cannot distinguish “self” from “non-self” resulting in attack of its own structures (proteins and DNA, mainly). The information on structure, function, and pathogenicity of autoantibodies may assist in engineering RVD against autoimmune diseases. Current computational approaches exploit large datasets curated with extensive domain knowledge, most of which include the need for many resources and have been applied indirectly to problems of interest for DNA, RNA, and monomer protein binding. We present a novel method for discovering potential binding sites. We employed long short-term memory (LSTM) models trained on FASTA primary sequences to predict protein binding in DNA-binding hydrolytic antibodies (abzymes). We also employed CNN models applied to the same dataset for comparison with LSTM. While the CNN model outperformed the LSTM on the primary task of binding prediction, analysis of internal model representations of both models showed that the LSTM models recovered sub-sequences that were strongly correlated with sites known to be involved in binding. These results demonstrate that analysis of internal processes of LSTM models may serve as a powerful tool for primary sequence analysis. GRAPHICAL ABSTRACT: [Image: see text] Springer Berlin Heidelberg 2022-03-18 2022 /pmc/articles/PMC8932093/ /pubmed/35303216 http://dx.doi.org/10.1007/s11517-022-02539-7 Text en © International Federation for Medical and Biological Engineering 2022 This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic.
spellingShingle	Original Article St. Clair, Rachel Teti, Michael Pavlovic, Mirjana Hahn, William Barenholtz, Elan Predicting residues involved in anti-DNA autoantibodies with limited neural networks
title	Predicting residues involved in anti-DNA autoantibodies with limited neural networks
title_full	Predicting residues involved in anti-DNA autoantibodies with limited neural networks
title_fullStr	Predicting residues involved in anti-DNA autoantibodies with limited neural networks
title_full_unstemmed	Predicting residues involved in anti-DNA autoantibodies with limited neural networks
title_short	Predicting residues involved in anti-DNA autoantibodies with limited neural networks
title_sort	predicting residues involved in anti-dna autoantibodies with limited neural networks
topic	Original Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8932093/ https://www.ncbi.nlm.nih.gov/pubmed/35303216 http://dx.doi.org/10.1007/s11517-022-02539-7
work_keys_str_mv	AT stclairrachel predictingresiduesinvolvedinantidnaautoantibodieswithlimitedneuralnetworks AT tetimichael predictingresiduesinvolvedinantidnaautoantibodieswithlimitedneuralnetworks AT pavlovicmirjana predictingresiduesinvolvedinantidnaautoantibodieswithlimitedneuralnetworks AT hahnwilliam predictingresiduesinvolvedinantidnaautoantibodieswithlimitedneuralnetworks AT barenholtzelan predictingresiduesinvolvedinantidnaautoantibodieswithlimitedneuralnetworks

Predicting residues involved in anti-DNA autoantibodies with limited neural networks

Ejemplares similares