Cargando…

8.2% of the Human Genome Is Constrained: Variation in Rates of Turnover across Functional Element Classes in the Human Lineage

Ten years on from the finishing of the human reference genome sequence, it remains unclear what fraction of the human genome confers function, where this sequence resides, and how much is shared with other mammalian species. When addressing these questions, functional sequence has often been equated...

Descripción completa

Detalles Bibliográficos
Autores principales: Rands, Chris M., Meader, Stephen, Ponting, Chris P., Lunter, Gerton
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4109858/
https://www.ncbi.nlm.nih.gov/pubmed/25057982
http://dx.doi.org/10.1371/journal.pgen.1004525
_version_ 1782327919766405120
author Rands, Chris M.
Meader, Stephen
Ponting, Chris P.
Lunter, Gerton
author_facet Rands, Chris M.
Meader, Stephen
Ponting, Chris P.
Lunter, Gerton
author_sort Rands, Chris M.
collection PubMed
description Ten years on from the finishing of the human reference genome sequence, it remains unclear what fraction of the human genome confers function, where this sequence resides, and how much is shared with other mammalian species. When addressing these questions, functional sequence has often been equated with pan-mammalian conserved sequence. However, functional elements that are short-lived, including those contributing to species-specific biology, will not leave a footprint of long-lasting negative selection. Here, we address these issues by identifying and characterising sequence that has been constrained with respect to insertions and deletions for pairs of eutherian genomes over a range of divergences. Within noncoding sequence, we find increasing amounts of mutually constrained sequence as species pairs become more closely related, indicating that noncoding constrained sequence turns over rapidly. We estimate that half of present-day noncoding constrained sequence has been gained or lost in approximately the last 130 million years (half-life in units of divergence time, d(1/2) = 0.25–0.31). While enriched with ENCODE biochemical annotations, much of the short-lived constrained sequences we identify are not detected by models optimized for wider pan-mammalian conservation. Constrained DNase 1 hypersensitivity sites, promoters and untranslated regions have been more evolutionarily stable than long noncoding RNA loci which have turned over especially rapidly. By contrast, protein coding sequence has been highly stable, with an estimated half-life of over a billion years (d(1/2) = 2.1–5.0). From extrapolations we estimate that 8.2% (7.1–9.2%) of the human genome is presently subject to negative selection and thus is likely to be functional, while only 2.2% has maintained constraint in both human and mouse since these species diverged. These results reveal that the evolutionary history of the human genome has been highly dynamic, particularly for its noncoding yet biologically functional fraction.
format Online
Article
Text
id pubmed-4109858
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-41098582014-07-29 8.2% of the Human Genome Is Constrained: Variation in Rates of Turnover across Functional Element Classes in the Human Lineage Rands, Chris M. Meader, Stephen Ponting, Chris P. Lunter, Gerton PLoS Genet Research Article Ten years on from the finishing of the human reference genome sequence, it remains unclear what fraction of the human genome confers function, where this sequence resides, and how much is shared with other mammalian species. When addressing these questions, functional sequence has often been equated with pan-mammalian conserved sequence. However, functional elements that are short-lived, including those contributing to species-specific biology, will not leave a footprint of long-lasting negative selection. Here, we address these issues by identifying and characterising sequence that has been constrained with respect to insertions and deletions for pairs of eutherian genomes over a range of divergences. Within noncoding sequence, we find increasing amounts of mutually constrained sequence as species pairs become more closely related, indicating that noncoding constrained sequence turns over rapidly. We estimate that half of present-day noncoding constrained sequence has been gained or lost in approximately the last 130 million years (half-life in units of divergence time, d(1/2) = 0.25–0.31). While enriched with ENCODE biochemical annotations, much of the short-lived constrained sequences we identify are not detected by models optimized for wider pan-mammalian conservation. Constrained DNase 1 hypersensitivity sites, promoters and untranslated regions have been more evolutionarily stable than long noncoding RNA loci which have turned over especially rapidly. By contrast, protein coding sequence has been highly stable, with an estimated half-life of over a billion years (d(1/2) = 2.1–5.0). From extrapolations we estimate that 8.2% (7.1–9.2%) of the human genome is presently subject to negative selection and thus is likely to be functional, while only 2.2% has maintained constraint in both human and mouse since these species diverged. These results reveal that the evolutionary history of the human genome has been highly dynamic, particularly for its noncoding yet biologically functional fraction. Public Library of Science 2014-07-24 /pmc/articles/PMC4109858/ /pubmed/25057982 http://dx.doi.org/10.1371/journal.pgen.1004525 Text en © 2014 Rands et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Rands, Chris M.
Meader, Stephen
Ponting, Chris P.
Lunter, Gerton
8.2% of the Human Genome Is Constrained: Variation in Rates of Turnover across Functional Element Classes in the Human Lineage
title 8.2% of the Human Genome Is Constrained: Variation in Rates of Turnover across Functional Element Classes in the Human Lineage
title_full 8.2% of the Human Genome Is Constrained: Variation in Rates of Turnover across Functional Element Classes in the Human Lineage
title_fullStr 8.2% of the Human Genome Is Constrained: Variation in Rates of Turnover across Functional Element Classes in the Human Lineage
title_full_unstemmed 8.2% of the Human Genome Is Constrained: Variation in Rates of Turnover across Functional Element Classes in the Human Lineage
title_short 8.2% of the Human Genome Is Constrained: Variation in Rates of Turnover across Functional Element Classes in the Human Lineage
title_sort 8.2% of the human genome is constrained: variation in rates of turnover across functional element classes in the human lineage
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4109858/
https://www.ncbi.nlm.nih.gov/pubmed/25057982
http://dx.doi.org/10.1371/journal.pgen.1004525
work_keys_str_mv AT randschrism 82ofthehumangenomeisconstrainedvariationinratesofturnoveracrossfunctionalelementclassesinthehumanlineage
AT meaderstephen 82ofthehumangenomeisconstrainedvariationinratesofturnoveracrossfunctionalelementclassesinthehumanlineage
AT pontingchrisp 82ofthehumangenomeisconstrainedvariationinratesofturnoveracrossfunctionalelementclassesinthehumanlineage
AT luntergerton 82ofthehumangenomeisconstrainedvariationinratesofturnoveracrossfunctionalelementclassesinthehumanlineage