Cargando…
Local conservation scores without a priori assumptions on neutral substitution rates
BACKGROUND: Comparative genomics aims to detect signals of evolutionary conservation as an indicator of functional constraint. Surprisingly, results of the ENCODE project revealed that about half of the experimentally verified functional elements found in non-coding DNA were classified as unconstrai...
Autores principales: | , , , , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2008
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2375903/ https://www.ncbi.nlm.nih.gov/pubmed/18405366 http://dx.doi.org/10.1186/1471-2105-9-190 |
_version_ | 1782154671353233408 |
---|---|
author | Dingel, Janis Hanus, Pavol Leonardi, Niccolò Hagenauer, Joachim Zech, Jürgen Mueller, Jakob C |
author_facet | Dingel, Janis Hanus, Pavol Leonardi, Niccolò Hagenauer, Joachim Zech, Jürgen Mueller, Jakob C |
author_sort | Dingel, Janis |
collection | PubMed |
description | BACKGROUND: Comparative genomics aims to detect signals of evolutionary conservation as an indicator of functional constraint. Surprisingly, results of the ENCODE project revealed that about half of the experimentally verified functional elements found in non-coding DNA were classified as unconstrained by computational predictions. Following this observation, it has been hypothesized that this may be partly explained by biased estimates on neutral evolutionary rates used by existing sequence conservation metrics. All methods we are aware of rely on a comparison with the neutral rate and conservation is estimated by measuring the deviation of a particular genomic region from this rate. Consequently, it is a reasonable assumption that inaccurate neutral rate estimates may lead to biased conservation and constraint estimates. RESULTS: We propose a conservation signal that is produced by local Maximum Likelihood estimation of evolutionary parameters using an optimized sliding window and present a Kullback-Leibler projection that allows multiple different estimated parameters to be transformed into a conservation measure. This conservation measure does not rely on assumptions about neutral evolutionary substitution rates and little a priori assumptions on the properties of the conserved regions are imposed. We show the accuracy of our approach (KuLCons) on synthetic data and compare it to the scores generated by state-of-the-art methods (phastCons, GERP, SCONE) in an ENCODE region. We find that KuLCons is most often in agreement with the conservation/constraint signatures detected by GERP and SCONE while qualitatively very different patterns from phastCons are observed. Opposed to standard methods KuLCons can be extended to more complex evolutionary models, e.g. taking insertion and deletion events into account and corresponding results show that scores obtained under this model can diverge significantly from scores using the simpler model. CONCLUSION: Our results suggest that discriminating among the different degrees of conservation is possible without making assumptions about neutral rates. We find, however, that it cannot be expected to discover considerably different constraint regions than GERP and SCONE. Consequently, we conclude that the reported discrepancies between experimentally verified functional and computationally identified constraint elements are likely not to be explained by biased neutral rate estimates. |
format | Text |
id | pubmed-2375903 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2008 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-23759032008-05-12 Local conservation scores without a priori assumptions on neutral substitution rates Dingel, Janis Hanus, Pavol Leonardi, Niccolò Hagenauer, Joachim Zech, Jürgen Mueller, Jakob C BMC Bioinformatics Research Article BACKGROUND: Comparative genomics aims to detect signals of evolutionary conservation as an indicator of functional constraint. Surprisingly, results of the ENCODE project revealed that about half of the experimentally verified functional elements found in non-coding DNA were classified as unconstrained by computational predictions. Following this observation, it has been hypothesized that this may be partly explained by biased estimates on neutral evolutionary rates used by existing sequence conservation metrics. All methods we are aware of rely on a comparison with the neutral rate and conservation is estimated by measuring the deviation of a particular genomic region from this rate. Consequently, it is a reasonable assumption that inaccurate neutral rate estimates may lead to biased conservation and constraint estimates. RESULTS: We propose a conservation signal that is produced by local Maximum Likelihood estimation of evolutionary parameters using an optimized sliding window and present a Kullback-Leibler projection that allows multiple different estimated parameters to be transformed into a conservation measure. This conservation measure does not rely on assumptions about neutral evolutionary substitution rates and little a priori assumptions on the properties of the conserved regions are imposed. We show the accuracy of our approach (KuLCons) on synthetic data and compare it to the scores generated by state-of-the-art methods (phastCons, GERP, SCONE) in an ENCODE region. We find that KuLCons is most often in agreement with the conservation/constraint signatures detected by GERP and SCONE while qualitatively very different patterns from phastCons are observed. Opposed to standard methods KuLCons can be extended to more complex evolutionary models, e.g. taking insertion and deletion events into account and corresponding results show that scores obtained under this model can diverge significantly from scores using the simpler model. CONCLUSION: Our results suggest that discriminating among the different degrees of conservation is possible without making assumptions about neutral rates. We find, however, that it cannot be expected to discover considerably different constraint regions than GERP and SCONE. Consequently, we conclude that the reported discrepancies between experimentally verified functional and computationally identified constraint elements are likely not to be explained by biased neutral rate estimates. BioMed Central 2008-04-11 /pmc/articles/PMC2375903/ /pubmed/18405366 http://dx.doi.org/10.1186/1471-2105-9-190 Text en Copyright © 2008 Dingel et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Dingel, Janis Hanus, Pavol Leonardi, Niccolò Hagenauer, Joachim Zech, Jürgen Mueller, Jakob C Local conservation scores without a priori assumptions on neutral substitution rates |
title | Local conservation scores without a priori assumptions on neutral substitution rates |
title_full | Local conservation scores without a priori assumptions on neutral substitution rates |
title_fullStr | Local conservation scores without a priori assumptions on neutral substitution rates |
title_full_unstemmed | Local conservation scores without a priori assumptions on neutral substitution rates |
title_short | Local conservation scores without a priori assumptions on neutral substitution rates |
title_sort | local conservation scores without a priori assumptions on neutral substitution rates |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2375903/ https://www.ncbi.nlm.nih.gov/pubmed/18405366 http://dx.doi.org/10.1186/1471-2105-9-190 |
work_keys_str_mv | AT dingeljanis localconservationscoreswithoutaprioriassumptionsonneutralsubstitutionrates AT hanuspavol localconservationscoreswithoutaprioriassumptionsonneutralsubstitutionrates AT leonardiniccolo localconservationscoreswithoutaprioriassumptionsonneutralsubstitutionrates AT hagenauerjoachim localconservationscoreswithoutaprioriassumptionsonneutralsubstitutionrates AT zechjurgen localconservationscoreswithoutaprioriassumptionsonneutralsubstitutionrates AT muellerjakobc localconservationscoreswithoutaprioriassumptionsonneutralsubstitutionrates |