Cargando…

Using an integrative machine learning approach utilising homology modelling to clinically interpret genetic variants: CACNA1F as an exemplar

Advances in DNA sequencing technologies have revolutionised rare disease diagnostics and have led to a dramatic increase in the volume of available genomic data. A key challenge that needs to be overcome to realise the full potential of these technologies is that of precisely predicting the effect o...

Descripción completa

Detalles Bibliográficos
Autores principales: Sallah, Shalaw R., Sergouniotis, Panagiotis I., Barton, Stephanie, Ramsden, Simon, Taylor, Rachel L., Safadi, Amro, Kabir, Mitra, Ellingford, Jamie M., Lench, Nick, Lovell, Simon C., Black, Graeme C. M.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer International Publishing 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7608274/
https://www.ncbi.nlm.nih.gov/pubmed/32313206
http://dx.doi.org/10.1038/s41431-020-0623-y
_version_ 1783604802596372480
author Sallah, Shalaw R.
Sergouniotis, Panagiotis I.
Barton, Stephanie
Ramsden, Simon
Taylor, Rachel L.
Safadi, Amro
Kabir, Mitra
Ellingford, Jamie M.
Lench, Nick
Lovell, Simon C.
Black, Graeme C. M.
author_facet Sallah, Shalaw R.
Sergouniotis, Panagiotis I.
Barton, Stephanie
Ramsden, Simon
Taylor, Rachel L.
Safadi, Amro
Kabir, Mitra
Ellingford, Jamie M.
Lench, Nick
Lovell, Simon C.
Black, Graeme C. M.
author_sort Sallah, Shalaw R.
collection PubMed
description Advances in DNA sequencing technologies have revolutionised rare disease diagnostics and have led to a dramatic increase in the volume of available genomic data. A key challenge that needs to be overcome to realise the full potential of these technologies is that of precisely predicting the effect of genetic variants on molecular and organismal phenotypes. Notably, despite recent progress, there is still a lack of robust in silico tools that accurately assign clinical significance to variants. Genetic alterations in the CACNA1F gene are the commonest cause of X-linked incomplete Congenital Stationary Night Blindness (iCSNB), a condition associated with non-progressive visual impairment. We combined genetic and homology modelling data to produce CACNA1F-vp, an in silico model that differentiates disease-implicated from benign missense CACNA1F changes. CACNA1F-vp predicts variant effects on the structure of the CACNA1F encoded protein (a calcium channel) using parameters based upon changes in amino acid properties; these include size, charge, hydrophobicity, and position. The model produces an overall score for each variant that can be used to predict its pathogenicity. CACNA1F-vp outperformed four other tools in identifying disease-implicated variants (area under receiver operating characteristic and precision recall curves = 0.84; Matthews correlation coefficient = 0.52) using a tenfold cross-validation technique. We consider this protein-specific model to be a robust stand-alone diagnostic classifier that could be replicated in other proteins and could enable precise and timely diagnosis.
format Online
Article
Text
id pubmed-7608274
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Springer International Publishing
record_format MEDLINE/PubMed
spelling pubmed-76082742020-11-05 Using an integrative machine learning approach utilising homology modelling to clinically interpret genetic variants: CACNA1F as an exemplar Sallah, Shalaw R. Sergouniotis, Panagiotis I. Barton, Stephanie Ramsden, Simon Taylor, Rachel L. Safadi, Amro Kabir, Mitra Ellingford, Jamie M. Lench, Nick Lovell, Simon C. Black, Graeme C. M. Eur J Hum Genet Article Advances in DNA sequencing technologies have revolutionised rare disease diagnostics and have led to a dramatic increase in the volume of available genomic data. A key challenge that needs to be overcome to realise the full potential of these technologies is that of precisely predicting the effect of genetic variants on molecular and organismal phenotypes. Notably, despite recent progress, there is still a lack of robust in silico tools that accurately assign clinical significance to variants. Genetic alterations in the CACNA1F gene are the commonest cause of X-linked incomplete Congenital Stationary Night Blindness (iCSNB), a condition associated with non-progressive visual impairment. We combined genetic and homology modelling data to produce CACNA1F-vp, an in silico model that differentiates disease-implicated from benign missense CACNA1F changes. CACNA1F-vp predicts variant effects on the structure of the CACNA1F encoded protein (a calcium channel) using parameters based upon changes in amino acid properties; these include size, charge, hydrophobicity, and position. The model produces an overall score for each variant that can be used to predict its pathogenicity. CACNA1F-vp outperformed four other tools in identifying disease-implicated variants (area under receiver operating characteristic and precision recall curves = 0.84; Matthews correlation coefficient = 0.52) using a tenfold cross-validation technique. We consider this protein-specific model to be a robust stand-alone diagnostic classifier that could be replicated in other proteins and could enable precise and timely diagnosis. Springer International Publishing 2020-04-20 2020-09 /pmc/articles/PMC7608274/ /pubmed/32313206 http://dx.doi.org/10.1038/s41431-020-0623-y Text en © The Author(s) 2020 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
spellingShingle Article
Sallah, Shalaw R.
Sergouniotis, Panagiotis I.
Barton, Stephanie
Ramsden, Simon
Taylor, Rachel L.
Safadi, Amro
Kabir, Mitra
Ellingford, Jamie M.
Lench, Nick
Lovell, Simon C.
Black, Graeme C. M.
Using an integrative machine learning approach utilising homology modelling to clinically interpret genetic variants: CACNA1F as an exemplar
title Using an integrative machine learning approach utilising homology modelling to clinically interpret genetic variants: CACNA1F as an exemplar
title_full Using an integrative machine learning approach utilising homology modelling to clinically interpret genetic variants: CACNA1F as an exemplar
title_fullStr Using an integrative machine learning approach utilising homology modelling to clinically interpret genetic variants: CACNA1F as an exemplar
title_full_unstemmed Using an integrative machine learning approach utilising homology modelling to clinically interpret genetic variants: CACNA1F as an exemplar
title_short Using an integrative machine learning approach utilising homology modelling to clinically interpret genetic variants: CACNA1F as an exemplar
title_sort using an integrative machine learning approach utilising homology modelling to clinically interpret genetic variants: cacna1f as an exemplar
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7608274/
https://www.ncbi.nlm.nih.gov/pubmed/32313206
http://dx.doi.org/10.1038/s41431-020-0623-y
work_keys_str_mv AT sallahshalawr usinganintegrativemachinelearningapproachutilisinghomologymodellingtoclinicallyinterpretgeneticvariantscacna1fasanexemplar
AT sergouniotispanagiotisi usinganintegrativemachinelearningapproachutilisinghomologymodellingtoclinicallyinterpretgeneticvariantscacna1fasanexemplar
AT bartonstephanie usinganintegrativemachinelearningapproachutilisinghomologymodellingtoclinicallyinterpretgeneticvariantscacna1fasanexemplar
AT ramsdensimon usinganintegrativemachinelearningapproachutilisinghomologymodellingtoclinicallyinterpretgeneticvariantscacna1fasanexemplar
AT taylorrachell usinganintegrativemachinelearningapproachutilisinghomologymodellingtoclinicallyinterpretgeneticvariantscacna1fasanexemplar
AT safadiamro usinganintegrativemachinelearningapproachutilisinghomologymodellingtoclinicallyinterpretgeneticvariantscacna1fasanexemplar
AT kabirmitra usinganintegrativemachinelearningapproachutilisinghomologymodellingtoclinicallyinterpretgeneticvariantscacna1fasanexemplar
AT ellingfordjamiem usinganintegrativemachinelearningapproachutilisinghomologymodellingtoclinicallyinterpretgeneticvariantscacna1fasanexemplar
AT lenchnick usinganintegrativemachinelearningapproachutilisinghomologymodellingtoclinicallyinterpretgeneticvariantscacna1fasanexemplar
AT lovellsimonc usinganintegrativemachinelearningapproachutilisinghomologymodellingtoclinicallyinterpretgeneticvariantscacna1fasanexemplar
AT blackgraemecm usinganintegrativemachinelearningapproachutilisinghomologymodellingtoclinicallyinterpretgeneticvariantscacna1fasanexemplar