Cargando…

Genome bioinformatic analysis of nonsynonymous SNPs

BACKGROUND: Genome-wide association studies of common diseases for common, low penetrance causal variants are underway. A proportion of these will alter protein sequences, the most common of which is the non-synonymous single nucleotide polymorphism (nsSNP). It would be an advantage if the functiona...

Descripción completa

Detalles Bibliográficos
Autores principales: Burke, David F, Worth, Catherine L, Priego, Eva-Maria, Cheng, Tammy, Smink, Luc J, Todd, John A, Blundell, Tom L
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2007
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1978506/
https://www.ncbi.nlm.nih.gov/pubmed/17708757
http://dx.doi.org/10.1186/1471-2105-8-301
_version_ 1782135408071540736
author Burke, David F
Worth, Catherine L
Priego, Eva-Maria
Cheng, Tammy
Smink, Luc J
Todd, John A
Blundell, Tom L
author_facet Burke, David F
Worth, Catherine L
Priego, Eva-Maria
Cheng, Tammy
Smink, Luc J
Todd, John A
Blundell, Tom L
author_sort Burke, David F
collection PubMed
description BACKGROUND: Genome-wide association studies of common diseases for common, low penetrance causal variants are underway. A proportion of these will alter protein sequences, the most common of which is the non-synonymous single nucleotide polymorphism (nsSNP). It would be an advantage if the functional effects of an nsSNP on protein structure and function could be predicted, both for the final identification process of a causal variant in a disease-associated chromosome region, and in further functional analyses of the nsSNP and its disease-associated protein. RESULTS: In the present report we have compared and contrasted structure- and sequence-based methods of prediction to over 5500 genes carrying nearly 24,000 nsSNPs, by employing an automatic comparative modelling procedure to build models for the genes. The nsSNP information came from two sources, the OMIM database which are rare (minor allele frequency, MAF, < 0.01) and are known to cause penetrant, monogenic diseases. Secondly, nsSNP information came from dbSNP125, for which the vast majority of nsSNPs, mostly MAF > 0.05, have no known link to a disease. For over 40% of the nsSNPs, structure-based methods predicted which of these sequence changes are likely to either disrupt the structure of the protein or interfere with the function or interactions of the protein. For the remaining 60%, we generated sequence-based predictions. CONCLUSION: We show that, in general, the prediction tools are able distinguish disease causing mutations from those mutations which are thought to have a neutral affect. We give examples of mutations in genes that are predicted to be deleterious and may have a role in disease. Contrary to previous reports, we also show that rare mutations are consistently predicted to be deleterious as often as commonly occurring nsSNPs.
format Text
id pubmed-1978506
institution National Center for Biotechnology Information
language English
publishDate 2007
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-19785062007-09-19 Genome bioinformatic analysis of nonsynonymous SNPs Burke, David F Worth, Catherine L Priego, Eva-Maria Cheng, Tammy Smink, Luc J Todd, John A Blundell, Tom L BMC Bioinformatics Methodology Article BACKGROUND: Genome-wide association studies of common diseases for common, low penetrance causal variants are underway. A proportion of these will alter protein sequences, the most common of which is the non-synonymous single nucleotide polymorphism (nsSNP). It would be an advantage if the functional effects of an nsSNP on protein structure and function could be predicted, both for the final identification process of a causal variant in a disease-associated chromosome region, and in further functional analyses of the nsSNP and its disease-associated protein. RESULTS: In the present report we have compared and contrasted structure- and sequence-based methods of prediction to over 5500 genes carrying nearly 24,000 nsSNPs, by employing an automatic comparative modelling procedure to build models for the genes. The nsSNP information came from two sources, the OMIM database which are rare (minor allele frequency, MAF, < 0.01) and are known to cause penetrant, monogenic diseases. Secondly, nsSNP information came from dbSNP125, for which the vast majority of nsSNPs, mostly MAF > 0.05, have no known link to a disease. For over 40% of the nsSNPs, structure-based methods predicted which of these sequence changes are likely to either disrupt the structure of the protein or interfere with the function or interactions of the protein. For the remaining 60%, we generated sequence-based predictions. CONCLUSION: We show that, in general, the prediction tools are able distinguish disease causing mutations from those mutations which are thought to have a neutral affect. We give examples of mutations in genes that are predicted to be deleterious and may have a role in disease. Contrary to previous reports, we also show that rare mutations are consistently predicted to be deleterious as often as commonly occurring nsSNPs. BioMed Central 2007-08-20 /pmc/articles/PMC1978506/ /pubmed/17708757 http://dx.doi.org/10.1186/1471-2105-8-301 Text en Copyright © 2007 Burke et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology Article
Burke, David F
Worth, Catherine L
Priego, Eva-Maria
Cheng, Tammy
Smink, Luc J
Todd, John A
Blundell, Tom L
Genome bioinformatic analysis of nonsynonymous SNPs
title Genome bioinformatic analysis of nonsynonymous SNPs
title_full Genome bioinformatic analysis of nonsynonymous SNPs
title_fullStr Genome bioinformatic analysis of nonsynonymous SNPs
title_full_unstemmed Genome bioinformatic analysis of nonsynonymous SNPs
title_short Genome bioinformatic analysis of nonsynonymous SNPs
title_sort genome bioinformatic analysis of nonsynonymous snps
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1978506/
https://www.ncbi.nlm.nih.gov/pubmed/17708757
http://dx.doi.org/10.1186/1471-2105-8-301
work_keys_str_mv AT burkedavidf genomebioinformaticanalysisofnonsynonymoussnps
AT worthcatherinel genomebioinformaticanalysisofnonsynonymoussnps
AT priegoevamaria genomebioinformaticanalysisofnonsynonymoussnps
AT chengtammy genomebioinformaticanalysisofnonsynonymoussnps
AT sminklucj genomebioinformaticanalysisofnonsynonymoussnps
AT toddjohna genomebioinformaticanalysisofnonsynonymoussnps
AT blundelltoml genomebioinformaticanalysisofnonsynonymoussnps