Cargando…

Local conservation scores without a priori assumptions on neutral substitution rates

BACKGROUND: Comparative genomics aims to detect signals of evolutionary conservation as an indicator of functional constraint. Surprisingly, results of the ENCODE project revealed that about half of the experimentally verified functional elements found in non-coding DNA were classified as unconstrai...

Descripción completa

Detalles Bibliográficos
Autores principales: Dingel, Janis, Hanus, Pavol, Leonardi, Niccolò, Hagenauer, Joachim, Zech, Jürgen, Mueller, Jakob C
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2008
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2375903/
https://www.ncbi.nlm.nih.gov/pubmed/18405366
http://dx.doi.org/10.1186/1471-2105-9-190
_version_ 1782154671353233408
author Dingel, Janis
Hanus, Pavol
Leonardi, Niccolò
Hagenauer, Joachim
Zech, Jürgen
Mueller, Jakob C
author_facet Dingel, Janis
Hanus, Pavol
Leonardi, Niccolò
Hagenauer, Joachim
Zech, Jürgen
Mueller, Jakob C
author_sort Dingel, Janis
collection PubMed
description BACKGROUND: Comparative genomics aims to detect signals of evolutionary conservation as an indicator of functional constraint. Surprisingly, results of the ENCODE project revealed that about half of the experimentally verified functional elements found in non-coding DNA were classified as unconstrained by computational predictions. Following this observation, it has been hypothesized that this may be partly explained by biased estimates on neutral evolutionary rates used by existing sequence conservation metrics. All methods we are aware of rely on a comparison with the neutral rate and conservation is estimated by measuring the deviation of a particular genomic region from this rate. Consequently, it is a reasonable assumption that inaccurate neutral rate estimates may lead to biased conservation and constraint estimates. RESULTS: We propose a conservation signal that is produced by local Maximum Likelihood estimation of evolutionary parameters using an optimized sliding window and present a Kullback-Leibler projection that allows multiple different estimated parameters to be transformed into a conservation measure. This conservation measure does not rely on assumptions about neutral evolutionary substitution rates and little a priori assumptions on the properties of the conserved regions are imposed. We show the accuracy of our approach (KuLCons) on synthetic data and compare it to the scores generated by state-of-the-art methods (phastCons, GERP, SCONE) in an ENCODE region. We find that KuLCons is most often in agreement with the conservation/constraint signatures detected by GERP and SCONE while qualitatively very different patterns from phastCons are observed. Opposed to standard methods KuLCons can be extended to more complex evolutionary models, e.g. taking insertion and deletion events into account and corresponding results show that scores obtained under this model can diverge significantly from scores using the simpler model. CONCLUSION: Our results suggest that discriminating among the different degrees of conservation is possible without making assumptions about neutral rates. We find, however, that it cannot be expected to discover considerably different constraint regions than GERP and SCONE. Consequently, we conclude that the reported discrepancies between experimentally verified functional and computationally identified constraint elements are likely not to be explained by biased neutral rate estimates.
format Text
id pubmed-2375903
institution National Center for Biotechnology Information
language English
publishDate 2008
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-23759032008-05-12 Local conservation scores without a priori assumptions on neutral substitution rates Dingel, Janis Hanus, Pavol Leonardi, Niccolò Hagenauer, Joachim Zech, Jürgen Mueller, Jakob C BMC Bioinformatics Research Article BACKGROUND: Comparative genomics aims to detect signals of evolutionary conservation as an indicator of functional constraint. Surprisingly, results of the ENCODE project revealed that about half of the experimentally verified functional elements found in non-coding DNA were classified as unconstrained by computational predictions. Following this observation, it has been hypothesized that this may be partly explained by biased estimates on neutral evolutionary rates used by existing sequence conservation metrics. All methods we are aware of rely on a comparison with the neutral rate and conservation is estimated by measuring the deviation of a particular genomic region from this rate. Consequently, it is a reasonable assumption that inaccurate neutral rate estimates may lead to biased conservation and constraint estimates. RESULTS: We propose a conservation signal that is produced by local Maximum Likelihood estimation of evolutionary parameters using an optimized sliding window and present a Kullback-Leibler projection that allows multiple different estimated parameters to be transformed into a conservation measure. This conservation measure does not rely on assumptions about neutral evolutionary substitution rates and little a priori assumptions on the properties of the conserved regions are imposed. We show the accuracy of our approach (KuLCons) on synthetic data and compare it to the scores generated by state-of-the-art methods (phastCons, GERP, SCONE) in an ENCODE region. We find that KuLCons is most often in agreement with the conservation/constraint signatures detected by GERP and SCONE while qualitatively very different patterns from phastCons are observed. Opposed to standard methods KuLCons can be extended to more complex evolutionary models, e.g. taking insertion and deletion events into account and corresponding results show that scores obtained under this model can diverge significantly from scores using the simpler model. CONCLUSION: Our results suggest that discriminating among the different degrees of conservation is possible without making assumptions about neutral rates. We find, however, that it cannot be expected to discover considerably different constraint regions than GERP and SCONE. Consequently, we conclude that the reported discrepancies between experimentally verified functional and computationally identified constraint elements are likely not to be explained by biased neutral rate estimates. BioMed Central 2008-04-11 /pmc/articles/PMC2375903/ /pubmed/18405366 http://dx.doi.org/10.1186/1471-2105-9-190 Text en Copyright © 2008 Dingel et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Dingel, Janis
Hanus, Pavol
Leonardi, Niccolò
Hagenauer, Joachim
Zech, Jürgen
Mueller, Jakob C
Local conservation scores without a priori assumptions on neutral substitution rates
title Local conservation scores without a priori assumptions on neutral substitution rates
title_full Local conservation scores without a priori assumptions on neutral substitution rates
title_fullStr Local conservation scores without a priori assumptions on neutral substitution rates
title_full_unstemmed Local conservation scores without a priori assumptions on neutral substitution rates
title_short Local conservation scores without a priori assumptions on neutral substitution rates
title_sort local conservation scores without a priori assumptions on neutral substitution rates
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2375903/
https://www.ncbi.nlm.nih.gov/pubmed/18405366
http://dx.doi.org/10.1186/1471-2105-9-190
work_keys_str_mv AT dingeljanis localconservationscoreswithoutaprioriassumptionsonneutralsubstitutionrates
AT hanuspavol localconservationscoreswithoutaprioriassumptionsonneutralsubstitutionrates
AT leonardiniccolo localconservationscoreswithoutaprioriassumptionsonneutralsubstitutionrates
AT hagenauerjoachim localconservationscoreswithoutaprioriassumptionsonneutralsubstitutionrates
AT zechjurgen localconservationscoreswithoutaprioriassumptionsonneutralsubstitutionrates
AT muellerjakobc localconservationscoreswithoutaprioriassumptionsonneutralsubstitutionrates