Cargando…

ASCARIS: Positional feature annotation and protein structure-based representation of single amino acid variations

BACKGROUND: Genomic variations may cause deleterious effects on protein functionality and perturb biological processes. Elucidating the effects of variations is critical for developing novel treatment strategies for diseases of genetic origin. Computational approaches have been aiding the work in th...

Descripción completa

Detalles Bibliográficos
Autores principales: Cankara, Fatma, Doğan, Tunca
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Research Network of Computational and Structural Biotechnology 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10562615/
https://www.ncbi.nlm.nih.gov/pubmed/37822561
http://dx.doi.org/10.1016/j.csbj.2023.09.017
_version_ 1785118166926491648
author Cankara, Fatma
Doğan, Tunca
author_facet Cankara, Fatma
Doğan, Tunca
author_sort Cankara, Fatma
collection PubMed
description BACKGROUND: Genomic variations may cause deleterious effects on protein functionality and perturb biological processes. Elucidating the effects of variations is critical for developing novel treatment strategies for diseases of genetic origin. Computational approaches have been aiding the work in this field by modeling and analyzing the mutational landscape. However, new approaches are required, especially for accurate representation and data-centric analysis of sequence variations. METHOD: In this study, we propose ASCARIS (Annotation and StruCture-bAsed RepresentatIon of Single amino acid variations), a method for the featurization (i.e., quantitative representation) of single amino acid variations (SAVs), which could be used for a variety of purposes, such as predicting their functional effects or building multi-omics-based integrative models. ASCARIS utilizes the direct and spatial correspondence between the location of the SAV on the sequence/structure and 30 different types of positional feature annotations (e.g., active/lipidation/glycosylation sites; calcium/metal/DNA binding, inter/transmembrane regions, etc.), along with structural features and physicochemical properties. The main novelty of this method lies in constructing reusable numerical representations of SAVs via functional annotations. RESULTS: We statistically analyzed the relationship between these features and the consequences of variations and found that each carries information in this regard. To investigate potential applications of ASCARIS, we trained variant effect prediction models that utilize our SAV representations as input. We carried out an ablation study and a comparison against the state-of-the-art methods and observed that ASCARIS has a competing and complementary performance against widely-used predictors. ASCARIS can be used alone or in combination with other approaches to represent SAVs from a functional perspective. ASCARIS is available as a programmatic tool at https://github.com/HUBioDataLab/ASCARIS and as a web-service at https://huggingface.co/spaces/HUBioDataLab/ASCARIS.
format Online
Article
Text
id pubmed-10562615
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Research Network of Computational and Structural Biotechnology
record_format MEDLINE/PubMed
spelling pubmed-105626152023-10-11 ASCARIS: Positional feature annotation and protein structure-based representation of single amino acid variations Cankara, Fatma Doğan, Tunca Comput Struct Biotechnol J Software/Web Server Article BACKGROUND: Genomic variations may cause deleterious effects on protein functionality and perturb biological processes. Elucidating the effects of variations is critical for developing novel treatment strategies for diseases of genetic origin. Computational approaches have been aiding the work in this field by modeling and analyzing the mutational landscape. However, new approaches are required, especially for accurate representation and data-centric analysis of sequence variations. METHOD: In this study, we propose ASCARIS (Annotation and StruCture-bAsed RepresentatIon of Single amino acid variations), a method for the featurization (i.e., quantitative representation) of single amino acid variations (SAVs), which could be used for a variety of purposes, such as predicting their functional effects or building multi-omics-based integrative models. ASCARIS utilizes the direct and spatial correspondence between the location of the SAV on the sequence/structure and 30 different types of positional feature annotations (e.g., active/lipidation/glycosylation sites; calcium/metal/DNA binding, inter/transmembrane regions, etc.), along with structural features and physicochemical properties. The main novelty of this method lies in constructing reusable numerical representations of SAVs via functional annotations. RESULTS: We statistically analyzed the relationship between these features and the consequences of variations and found that each carries information in this regard. To investigate potential applications of ASCARIS, we trained variant effect prediction models that utilize our SAV representations as input. We carried out an ablation study and a comparison against the state-of-the-art methods and observed that ASCARIS has a competing and complementary performance against widely-used predictors. ASCARIS can be used alone or in combination with other approaches to represent SAVs from a functional perspective. ASCARIS is available as a programmatic tool at https://github.com/HUBioDataLab/ASCARIS and as a web-service at https://huggingface.co/spaces/HUBioDataLab/ASCARIS. Research Network of Computational and Structural Biotechnology 2023-09-17 /pmc/articles/PMC10562615/ /pubmed/37822561 http://dx.doi.org/10.1016/j.csbj.2023.09.017 Text en © 2023 Published by Elsevier B.V. on behalf of Research Network of Computational and Structural Biotechnology. https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
spellingShingle Software/Web Server Article
Cankara, Fatma
Doğan, Tunca
ASCARIS: Positional feature annotation and protein structure-based representation of single amino acid variations
title ASCARIS: Positional feature annotation and protein structure-based representation of single amino acid variations
title_full ASCARIS: Positional feature annotation and protein structure-based representation of single amino acid variations
title_fullStr ASCARIS: Positional feature annotation and protein structure-based representation of single amino acid variations
title_full_unstemmed ASCARIS: Positional feature annotation and protein structure-based representation of single amino acid variations
title_short ASCARIS: Positional feature annotation and protein structure-based representation of single amino acid variations
title_sort ascaris: positional feature annotation and protein structure-based representation of single amino acid variations
topic Software/Web Server Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10562615/
https://www.ncbi.nlm.nih.gov/pubmed/37822561
http://dx.doi.org/10.1016/j.csbj.2023.09.017
work_keys_str_mv AT cankarafatma ascarispositionalfeatureannotationandproteinstructurebasedrepresentationofsingleaminoacidvariations
AT dogantunca ascarispositionalfeatureannotationandproteinstructurebasedrepresentationofsingleaminoacidvariations