Cargando…

A Simple Method for Analyzing Exome Sequencing Data Shows Distinct Levels of Nonsynonymous Variation for Human Immune and Nervous System Genes

To measure the strength of natural selection that acts upon single nucleotide variants (SNVs) in a set of human genes, we calculate the ratio between nonsynonymous SNVs (nsSNVs) per nonsynonymous site and synonymous SNVs (sSNVs) per synonymous site. We transform this ratio with a respective factor f...

Descripción completa

Detalles Bibliográficos
Autores principales: Freudenberg, Jan, Gregersen, Peter K., Freudenberg-Hua, Yun
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3368947/
https://www.ncbi.nlm.nih.gov/pubmed/22701602
http://dx.doi.org/10.1371/journal.pone.0038087
_version_ 1782235017450094592
author Freudenberg, Jan
Gregersen, Peter K.
Freudenberg-Hua, Yun
author_facet Freudenberg, Jan
Gregersen, Peter K.
Freudenberg-Hua, Yun
author_sort Freudenberg, Jan
collection PubMed
description To measure the strength of natural selection that acts upon single nucleotide variants (SNVs) in a set of human genes, we calculate the ratio between nonsynonymous SNVs (nsSNVs) per nonsynonymous site and synonymous SNVs (sSNVs) per synonymous site. We transform this ratio with a respective factor f that corrects for the bias of synonymous sites towards transitions in the genetic code and different mutation rates for transitions and transversions. This method approximates the relative density of nsSNVs (rdnsv) in comparison with the neutral expectation as inferred from the density of sSNVs. Using SNVs from a diploid genome and 200 exomes, we apply our method to immune system genes (ISGs), nervous system genes (NSGs), randomly sampled genes (RSGs), and gene ontology annotated genes. The estimate of rdnsv in an individual exome is around 20% for NSGs and 30–40% for ISGs and RSGs. This smaller rdnsv of NSGs indicates overall stronger purifying selection. To quantify the relative shift of nsSNVs towards rare variants, we next fit a linear regression model to the estimates of rdnsv over different SNV allele frequency bins. The obtained regression models show a negative slope for NSGs, ISGs and RSGs, supporting an influence of purifying selection on the frequency spectrum of segregating nsSNVs. The y-intercept of the model predicts rdnsv for an allele frequency close to 0. This parameter can be interpreted as the proportion of nonsynonymous sites where mutations are tolerated to segregate with an allele frequency notably greater than 0 in the population, given the performed normalization of the observed nsSNV to sSNV ratio. A smaller y-intercept is displayed by NSGs, indicating more nonsynonymous sites under strong negative selection. This predicts more monogenically inherited or de-novo mutation diseases that affect the nervous system.
format Online
Article
Text
id pubmed-3368947
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-33689472012-06-13 A Simple Method for Analyzing Exome Sequencing Data Shows Distinct Levels of Nonsynonymous Variation for Human Immune and Nervous System Genes Freudenberg, Jan Gregersen, Peter K. Freudenberg-Hua, Yun PLoS One Research Article To measure the strength of natural selection that acts upon single nucleotide variants (SNVs) in a set of human genes, we calculate the ratio between nonsynonymous SNVs (nsSNVs) per nonsynonymous site and synonymous SNVs (sSNVs) per synonymous site. We transform this ratio with a respective factor f that corrects for the bias of synonymous sites towards transitions in the genetic code and different mutation rates for transitions and transversions. This method approximates the relative density of nsSNVs (rdnsv) in comparison with the neutral expectation as inferred from the density of sSNVs. Using SNVs from a diploid genome and 200 exomes, we apply our method to immune system genes (ISGs), nervous system genes (NSGs), randomly sampled genes (RSGs), and gene ontology annotated genes. The estimate of rdnsv in an individual exome is around 20% for NSGs and 30–40% for ISGs and RSGs. This smaller rdnsv of NSGs indicates overall stronger purifying selection. To quantify the relative shift of nsSNVs towards rare variants, we next fit a linear regression model to the estimates of rdnsv over different SNV allele frequency bins. The obtained regression models show a negative slope for NSGs, ISGs and RSGs, supporting an influence of purifying selection on the frequency spectrum of segregating nsSNVs. The y-intercept of the model predicts rdnsv for an allele frequency close to 0. This parameter can be interpreted as the proportion of nonsynonymous sites where mutations are tolerated to segregate with an allele frequency notably greater than 0 in the population, given the performed normalization of the observed nsSNV to sSNV ratio. A smaller y-intercept is displayed by NSGs, indicating more nonsynonymous sites under strong negative selection. This predicts more monogenically inherited or de-novo mutation diseases that affect the nervous system. Public Library of Science 2012-06-06 /pmc/articles/PMC3368947/ /pubmed/22701602 http://dx.doi.org/10.1371/journal.pone.0038087 Text en Freudenberg et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Freudenberg, Jan
Gregersen, Peter K.
Freudenberg-Hua, Yun
A Simple Method for Analyzing Exome Sequencing Data Shows Distinct Levels of Nonsynonymous Variation for Human Immune and Nervous System Genes
title A Simple Method for Analyzing Exome Sequencing Data Shows Distinct Levels of Nonsynonymous Variation for Human Immune and Nervous System Genes
title_full A Simple Method for Analyzing Exome Sequencing Data Shows Distinct Levels of Nonsynonymous Variation for Human Immune and Nervous System Genes
title_fullStr A Simple Method for Analyzing Exome Sequencing Data Shows Distinct Levels of Nonsynonymous Variation for Human Immune and Nervous System Genes
title_full_unstemmed A Simple Method for Analyzing Exome Sequencing Data Shows Distinct Levels of Nonsynonymous Variation for Human Immune and Nervous System Genes
title_short A Simple Method for Analyzing Exome Sequencing Data Shows Distinct Levels of Nonsynonymous Variation for Human Immune and Nervous System Genes
title_sort simple method for analyzing exome sequencing data shows distinct levels of nonsynonymous variation for human immune and nervous system genes
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3368947/
https://www.ncbi.nlm.nih.gov/pubmed/22701602
http://dx.doi.org/10.1371/journal.pone.0038087
work_keys_str_mv AT freudenbergjan asimplemethodforanalyzingexomesequencingdatashowsdistinctlevelsofnonsynonymousvariationforhumanimmuneandnervoussystemgenes
AT gregersenpeterk asimplemethodforanalyzingexomesequencingdatashowsdistinctlevelsofnonsynonymousvariationforhumanimmuneandnervoussystemgenes
AT freudenberghuayun asimplemethodforanalyzingexomesequencingdatashowsdistinctlevelsofnonsynonymousvariationforhumanimmuneandnervoussystemgenes
AT freudenbergjan simplemethodforanalyzingexomesequencingdatashowsdistinctlevelsofnonsynonymousvariationforhumanimmuneandnervoussystemgenes
AT gregersenpeterk simplemethodforanalyzingexomesequencingdatashowsdistinctlevelsofnonsynonymousvariationforhumanimmuneandnervoussystemgenes
AT freudenberghuayun simplemethodforanalyzingexomesequencingdatashowsdistinctlevelsofnonsynonymousvariationforhumanimmuneandnervoussystemgenes