Cargando…

mmCSM-NA: accurately predicting effects of single and multiple mutations on protein–nucleic acid binding affinity

While protein–nucleic acid interactions are pivotal for many crucial biological processes, limited experimental data has made the development of computational approaches to characterise these interactions a challenge. Consequently, most approaches to understand the effects of missense mutations on p...

Descripción completa

Detalles Bibliográficos
Autores principales: Nguyen, Thanh Binh, Myung, Yoochan, de Sá, Alex G C, Pires, Douglas E V, Ascher, David B
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8600011/
https://www.ncbi.nlm.nih.gov/pubmed/34805992
http://dx.doi.org/10.1093/nargab/lqab109
_version_ 1784601059541385216
author Nguyen, Thanh Binh
Myung, Yoochan
de Sá, Alex G C
Pires, Douglas E V
Ascher, David B
author_facet Nguyen, Thanh Binh
Myung, Yoochan
de Sá, Alex G C
Pires, Douglas E V
Ascher, David B
author_sort Nguyen, Thanh Binh
collection PubMed
description While protein–nucleic acid interactions are pivotal for many crucial biological processes, limited experimental data has made the development of computational approaches to characterise these interactions a challenge. Consequently, most approaches to understand the effects of missense mutations on protein-nucleic acid affinity have focused on single-point mutations and have presented a limited performance on independent data sets. To overcome this, we have curated the largest dataset of experimentally measured effects of mutations on nucleic acid binding affinity to date, encompassing 856 single-point mutations and 141 multiple-point mutations across 155 experimentally solved complexes. This was used in combination with an optimized version of our graph-based signatures to develop mmCSM-NA (http://biosig.unimelb.edu.au/mmcsm_na), the first scalable method capable of quantitatively and accurately predicting the effects of multiple-point mutations on nucleic acid binding affinities. mmCSM-NA obtained a Pearson's correlation of up to 0.67 (RMSE of 1.06 Kcal/mol) on single-point mutations under cross-validation, and up to 0.65 on independent non-redundant datasets of multiple-point mutations (RMSE of 1.12 kcal/mol), outperforming similar tools. mmCSM-NA is freely available as an easy-to-use web-server and API. We believe it will be an invaluable tool to shed light on the role of mutations affecting protein–nucleic acid interactions in diseases.
format Online
Article
Text
id pubmed-8600011
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-86000112021-11-18 mmCSM-NA: accurately predicting effects of single and multiple mutations on protein–nucleic acid binding affinity Nguyen, Thanh Binh Myung, Yoochan de Sá, Alex G C Pires, Douglas E V Ascher, David B NAR Genom Bioinform Standard Article While protein–nucleic acid interactions are pivotal for many crucial biological processes, limited experimental data has made the development of computational approaches to characterise these interactions a challenge. Consequently, most approaches to understand the effects of missense mutations on protein-nucleic acid affinity have focused on single-point mutations and have presented a limited performance on independent data sets. To overcome this, we have curated the largest dataset of experimentally measured effects of mutations on nucleic acid binding affinity to date, encompassing 856 single-point mutations and 141 multiple-point mutations across 155 experimentally solved complexes. This was used in combination with an optimized version of our graph-based signatures to develop mmCSM-NA (http://biosig.unimelb.edu.au/mmcsm_na), the first scalable method capable of quantitatively and accurately predicting the effects of multiple-point mutations on nucleic acid binding affinities. mmCSM-NA obtained a Pearson's correlation of up to 0.67 (RMSE of 1.06 Kcal/mol) on single-point mutations under cross-validation, and up to 0.65 on independent non-redundant datasets of multiple-point mutations (RMSE of 1.12 kcal/mol), outperforming similar tools. mmCSM-NA is freely available as an easy-to-use web-server and API. We believe it will be an invaluable tool to shed light on the role of mutations affecting protein–nucleic acid interactions in diseases. Oxford University Press 2021-11-17 /pmc/articles/PMC8600011/ /pubmed/34805992 http://dx.doi.org/10.1093/nargab/lqab109 Text en © The Author(s) 2021. Published by Oxford University Press on behalf of NAR Genomics and Bioinformatics. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Standard Article
Nguyen, Thanh Binh
Myung, Yoochan
de Sá, Alex G C
Pires, Douglas E V
Ascher, David B
mmCSM-NA: accurately predicting effects of single and multiple mutations on protein–nucleic acid binding affinity
title mmCSM-NA: accurately predicting effects of single and multiple mutations on protein–nucleic acid binding affinity
title_full mmCSM-NA: accurately predicting effects of single and multiple mutations on protein–nucleic acid binding affinity
title_fullStr mmCSM-NA: accurately predicting effects of single and multiple mutations on protein–nucleic acid binding affinity
title_full_unstemmed mmCSM-NA: accurately predicting effects of single and multiple mutations on protein–nucleic acid binding affinity
title_short mmCSM-NA: accurately predicting effects of single and multiple mutations on protein–nucleic acid binding affinity
title_sort mmcsm-na: accurately predicting effects of single and multiple mutations on protein–nucleic acid binding affinity
topic Standard Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8600011/
https://www.ncbi.nlm.nih.gov/pubmed/34805992
http://dx.doi.org/10.1093/nargab/lqab109
work_keys_str_mv AT nguyenthanhbinh mmcsmnaaccuratelypredictingeffectsofsingleandmultiplemutationsonproteinnucleicacidbindingaffinity
AT myungyoochan mmcsmnaaccuratelypredictingeffectsofsingleandmultiplemutationsonproteinnucleicacidbindingaffinity
AT desaalexgc mmcsmnaaccuratelypredictingeffectsofsingleandmultiplemutationsonproteinnucleicacidbindingaffinity
AT piresdouglasev mmcsmnaaccuratelypredictingeffectsofsingleandmultiplemutationsonproteinnucleicacidbindingaffinity
AT ascherdavidb mmcsmnaaccuratelypredictingeffectsofsingleandmultiplemutationsonproteinnucleicacidbindingaffinity