Cargando…
Using an integrative machine learning approach utilising homology modelling to clinically interpret genetic variants: CACNA1F as an exemplar
Advances in DNA sequencing technologies have revolutionised rare disease diagnostics and have led to a dramatic increase in the volume of available genomic data. A key challenge that needs to be overcome to realise the full potential of these technologies is that of precisely predicting the effect o...
Autores principales: | , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Springer International Publishing
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7608274/ https://www.ncbi.nlm.nih.gov/pubmed/32313206 http://dx.doi.org/10.1038/s41431-020-0623-y |
_version_ | 1783604802596372480 |
---|---|
author | Sallah, Shalaw R. Sergouniotis, Panagiotis I. Barton, Stephanie Ramsden, Simon Taylor, Rachel L. Safadi, Amro Kabir, Mitra Ellingford, Jamie M. Lench, Nick Lovell, Simon C. Black, Graeme C. M. |
author_facet | Sallah, Shalaw R. Sergouniotis, Panagiotis I. Barton, Stephanie Ramsden, Simon Taylor, Rachel L. Safadi, Amro Kabir, Mitra Ellingford, Jamie M. Lench, Nick Lovell, Simon C. Black, Graeme C. M. |
author_sort | Sallah, Shalaw R. |
collection | PubMed |
description | Advances in DNA sequencing technologies have revolutionised rare disease diagnostics and have led to a dramatic increase in the volume of available genomic data. A key challenge that needs to be overcome to realise the full potential of these technologies is that of precisely predicting the effect of genetic variants on molecular and organismal phenotypes. Notably, despite recent progress, there is still a lack of robust in silico tools that accurately assign clinical significance to variants. Genetic alterations in the CACNA1F gene are the commonest cause of X-linked incomplete Congenital Stationary Night Blindness (iCSNB), a condition associated with non-progressive visual impairment. We combined genetic and homology modelling data to produce CACNA1F-vp, an in silico model that differentiates disease-implicated from benign missense CACNA1F changes. CACNA1F-vp predicts variant effects on the structure of the CACNA1F encoded protein (a calcium channel) using parameters based upon changes in amino acid properties; these include size, charge, hydrophobicity, and position. The model produces an overall score for each variant that can be used to predict its pathogenicity. CACNA1F-vp outperformed four other tools in identifying disease-implicated variants (area under receiver operating characteristic and precision recall curves = 0.84; Matthews correlation coefficient = 0.52) using a tenfold cross-validation technique. We consider this protein-specific model to be a robust stand-alone diagnostic classifier that could be replicated in other proteins and could enable precise and timely diagnosis. |
format | Online Article Text |
id | pubmed-7608274 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Springer International Publishing |
record_format | MEDLINE/PubMed |
spelling | pubmed-76082742020-11-05 Using an integrative machine learning approach utilising homology modelling to clinically interpret genetic variants: CACNA1F as an exemplar Sallah, Shalaw R. Sergouniotis, Panagiotis I. Barton, Stephanie Ramsden, Simon Taylor, Rachel L. Safadi, Amro Kabir, Mitra Ellingford, Jamie M. Lench, Nick Lovell, Simon C. Black, Graeme C. M. Eur J Hum Genet Article Advances in DNA sequencing technologies have revolutionised rare disease diagnostics and have led to a dramatic increase in the volume of available genomic data. A key challenge that needs to be overcome to realise the full potential of these technologies is that of precisely predicting the effect of genetic variants on molecular and organismal phenotypes. Notably, despite recent progress, there is still a lack of robust in silico tools that accurately assign clinical significance to variants. Genetic alterations in the CACNA1F gene are the commonest cause of X-linked incomplete Congenital Stationary Night Blindness (iCSNB), a condition associated with non-progressive visual impairment. We combined genetic and homology modelling data to produce CACNA1F-vp, an in silico model that differentiates disease-implicated from benign missense CACNA1F changes. CACNA1F-vp predicts variant effects on the structure of the CACNA1F encoded protein (a calcium channel) using parameters based upon changes in amino acid properties; these include size, charge, hydrophobicity, and position. The model produces an overall score for each variant that can be used to predict its pathogenicity. CACNA1F-vp outperformed four other tools in identifying disease-implicated variants (area under receiver operating characteristic and precision recall curves = 0.84; Matthews correlation coefficient = 0.52) using a tenfold cross-validation technique. We consider this protein-specific model to be a robust stand-alone diagnostic classifier that could be replicated in other proteins and could enable precise and timely diagnosis. Springer International Publishing 2020-04-20 2020-09 /pmc/articles/PMC7608274/ /pubmed/32313206 http://dx.doi.org/10.1038/s41431-020-0623-y Text en © The Author(s) 2020 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/. |
spellingShingle | Article Sallah, Shalaw R. Sergouniotis, Panagiotis I. Barton, Stephanie Ramsden, Simon Taylor, Rachel L. Safadi, Amro Kabir, Mitra Ellingford, Jamie M. Lench, Nick Lovell, Simon C. Black, Graeme C. M. Using an integrative machine learning approach utilising homology modelling to clinically interpret genetic variants: CACNA1F as an exemplar |
title | Using an integrative machine learning approach utilising homology modelling to clinically interpret genetic variants: CACNA1F as an exemplar |
title_full | Using an integrative machine learning approach utilising homology modelling to clinically interpret genetic variants: CACNA1F as an exemplar |
title_fullStr | Using an integrative machine learning approach utilising homology modelling to clinically interpret genetic variants: CACNA1F as an exemplar |
title_full_unstemmed | Using an integrative machine learning approach utilising homology modelling to clinically interpret genetic variants: CACNA1F as an exemplar |
title_short | Using an integrative machine learning approach utilising homology modelling to clinically interpret genetic variants: CACNA1F as an exemplar |
title_sort | using an integrative machine learning approach utilising homology modelling to clinically interpret genetic variants: cacna1f as an exemplar |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7608274/ https://www.ncbi.nlm.nih.gov/pubmed/32313206 http://dx.doi.org/10.1038/s41431-020-0623-y |
work_keys_str_mv | AT sallahshalawr usinganintegrativemachinelearningapproachutilisinghomologymodellingtoclinicallyinterpretgeneticvariantscacna1fasanexemplar AT sergouniotispanagiotisi usinganintegrativemachinelearningapproachutilisinghomologymodellingtoclinicallyinterpretgeneticvariantscacna1fasanexemplar AT bartonstephanie usinganintegrativemachinelearningapproachutilisinghomologymodellingtoclinicallyinterpretgeneticvariantscacna1fasanexemplar AT ramsdensimon usinganintegrativemachinelearningapproachutilisinghomologymodellingtoclinicallyinterpretgeneticvariantscacna1fasanexemplar AT taylorrachell usinganintegrativemachinelearningapproachutilisinghomologymodellingtoclinicallyinterpretgeneticvariantscacna1fasanexemplar AT safadiamro usinganintegrativemachinelearningapproachutilisinghomologymodellingtoclinicallyinterpretgeneticvariantscacna1fasanexemplar AT kabirmitra usinganintegrativemachinelearningapproachutilisinghomologymodellingtoclinicallyinterpretgeneticvariantscacna1fasanexemplar AT ellingfordjamiem usinganintegrativemachinelearningapproachutilisinghomologymodellingtoclinicallyinterpretgeneticvariantscacna1fasanexemplar AT lenchnick usinganintegrativemachinelearningapproachutilisinghomologymodellingtoclinicallyinterpretgeneticvariantscacna1fasanexemplar AT lovellsimonc usinganintegrativemachinelearningapproachutilisinghomologymodellingtoclinicallyinterpretgeneticvariantscacna1fasanexemplar AT blackgraemecm usinganintegrativemachinelearningapproachutilisinghomologymodellingtoclinicallyinterpretgeneticvariantscacna1fasanexemplar |