Cargando…

Enhanced interpretation of 935 hotspot and non-hotspot RAS variants using evidence-based structural bioinformatics

In the current study, we report computational scores for advancing genomic interpretation of disease-associated genomic variation in members of the RAS family of genes. For this purpose, we applied 31 sequence- and 3D structure-based computational scores, chosen by their breadth of biophysical prope...

Descripción completa

Detalles Bibliográficos
Autores principales: Tripathi, Swarnendu, Dsouza, Nikita R., Mathison, Angela J., Leverence, Elise, Urrutia, Raul, Zimmermann, Michael T.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Research Network of Computational and Structural Biotechnology 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8688876/
https://www.ncbi.nlm.nih.gov/pubmed/34976316
http://dx.doi.org/10.1016/j.csbj.2021.12.007
Descripción
Sumario:In the current study, we report computational scores for advancing genomic interpretation of disease-associated genomic variation in members of the RAS family of genes. For this purpose, we applied 31 sequence- and 3D structure-based computational scores, chosen by their breadth of biophysical properties. We parametrized our data by assembling a numerically homogenized experimentally-derived dataset, which when use in our calculations reveal that computational scores using 3D structure highly correlate with experimental measures (e.g., GAP-mediated hydrolysis R(Spearman) = 0.80 and RAF affinity R(spearman) = 0.82), while sequence-based scores are discordant with this data. Performing all-against-all comparisons, we applied this parametrized modeling approach to the study of 935 RAS variants from 7 RAS genes, which led us to identify 4 groups of mutations according to distinct biochemical scores within each group. Each group was comprised of hotspot and non-hotspot KRAS variants, indicating that poorly characterized variants could functionally behave like pathogenic mutations. Combining computational scores using dimensionality reduction indicated that changes to local unfolding propensity associate with changes in enzyme activity by genomic variants. Hence, our systematic approach, combining methodologies from both clinical genomics and 3D structural bioinformatics, represents an expansion for interpreting genomic data, provides information of mechanistic value, and that is transferable to other proteins.