Cargando…
Approximation to the Distribution of Fitness Effects across Functional Categories in Human Segregating Polymorphisms
Quantifying the proportion of polymorphic mutations that are deleterious or neutral is of fundamental importance to our understanding of evolution, disease genetics and the maintenance of variation genome-wide. Here, we develop an approximation to the distribution of fitness effects (DFE) of segrega...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2014
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4222666/ https://www.ncbi.nlm.nih.gov/pubmed/25375159 http://dx.doi.org/10.1371/journal.pgen.1004697 |
_version_ | 1782343079055851520 |
---|---|
author | Racimo, Fernando Schraiber, Joshua G. |
author_facet | Racimo, Fernando Schraiber, Joshua G. |
author_sort | Racimo, Fernando |
collection | PubMed |
description | Quantifying the proportion of polymorphic mutations that are deleterious or neutral is of fundamental importance to our understanding of evolution, disease genetics and the maintenance of variation genome-wide. Here, we develop an approximation to the distribution of fitness effects (DFE) of segregating single-nucleotide mutations in humans. Unlike previous methods, we do not assume that synonymous mutations are neutral or not strongly selected, and we do not rely on fitting the DFE of all new nonsynonymous mutations to a single probability distribution, which is poorly motivated on a biological level. We rely on a previously developed method that utilizes a variety of published annotations (including conservation scores, protein deleteriousness estimates and regulatory data) to score all mutations in the human genome based on how likely they are to be affected by negative selection, controlling for mutation rate. We map this and other conservation scores to a scale of fitness coefficients via maximum likelihood using diffusion theory and a Poisson random field model on SNP data. Our method serves to approximate the deleterious DFE of mutations that are segregating, regardless of their genomic consequence. We can then compare the proportion of mutations that are negatively selected or neutral across various categories, including different types of regulatory sites. We observe that the distribution of intergenic polymorphisms is highly peaked at neutrality, while the distribution of nonsynonymous polymorphisms has a second peak at [Image: see text]. Other types of polymorphisms have shapes that fall roughly in between these two. We find that transcriptional start sites, strong CTCF-enriched elements and enhancers are the regulatory categories with the largest proportion of deleterious polymorphisms. |
format | Online Article Text |
id | pubmed-4222666 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2014 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-42226662014-11-13 Approximation to the Distribution of Fitness Effects across Functional Categories in Human Segregating Polymorphisms Racimo, Fernando Schraiber, Joshua G. PLoS Genet Research Article Quantifying the proportion of polymorphic mutations that are deleterious or neutral is of fundamental importance to our understanding of evolution, disease genetics and the maintenance of variation genome-wide. Here, we develop an approximation to the distribution of fitness effects (DFE) of segregating single-nucleotide mutations in humans. Unlike previous methods, we do not assume that synonymous mutations are neutral or not strongly selected, and we do not rely on fitting the DFE of all new nonsynonymous mutations to a single probability distribution, which is poorly motivated on a biological level. We rely on a previously developed method that utilizes a variety of published annotations (including conservation scores, protein deleteriousness estimates and regulatory data) to score all mutations in the human genome based on how likely they are to be affected by negative selection, controlling for mutation rate. We map this and other conservation scores to a scale of fitness coefficients via maximum likelihood using diffusion theory and a Poisson random field model on SNP data. Our method serves to approximate the deleterious DFE of mutations that are segregating, regardless of their genomic consequence. We can then compare the proportion of mutations that are negatively selected or neutral across various categories, including different types of regulatory sites. We observe that the distribution of intergenic polymorphisms is highly peaked at neutrality, while the distribution of nonsynonymous polymorphisms has a second peak at [Image: see text]. Other types of polymorphisms have shapes that fall roughly in between these two. We find that transcriptional start sites, strong CTCF-enriched elements and enhancers are the regulatory categories with the largest proportion of deleterious polymorphisms. Public Library of Science 2014-11-06 /pmc/articles/PMC4222666/ /pubmed/25375159 http://dx.doi.org/10.1371/journal.pgen.1004697 Text en © 2014 Racimo, Schraiber http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited. |
spellingShingle | Research Article Racimo, Fernando Schraiber, Joshua G. Approximation to the Distribution of Fitness Effects across Functional Categories in Human Segregating Polymorphisms |
title | Approximation to the Distribution of Fitness Effects across Functional Categories in Human Segregating Polymorphisms |
title_full | Approximation to the Distribution of Fitness Effects across Functional Categories in Human Segregating Polymorphisms |
title_fullStr | Approximation to the Distribution of Fitness Effects across Functional Categories in Human Segregating Polymorphisms |
title_full_unstemmed | Approximation to the Distribution of Fitness Effects across Functional Categories in Human Segregating Polymorphisms |
title_short | Approximation to the Distribution of Fitness Effects across Functional Categories in Human Segregating Polymorphisms |
title_sort | approximation to the distribution of fitness effects across functional categories in human segregating polymorphisms |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4222666/ https://www.ncbi.nlm.nih.gov/pubmed/25375159 http://dx.doi.org/10.1371/journal.pgen.1004697 |
work_keys_str_mv | AT racimofernando approximationtothedistributionoffitnesseffectsacrossfunctionalcategoriesinhumansegregatingpolymorphisms AT schraiberjoshuag approximationtothedistributionoffitnesseffectsacrossfunctionalcategoriesinhumansegregatingpolymorphisms |