Cargando…

Predicting site-specific human selective pressure using evolutionary signatures

Motivation: The identification of non-coding functional regions of the human genome remains one of the main challenges of genomics. By observing how a given region evolved over time, one can detect signs of negative or positive selection hinting that the region may be functional. With the quickly in...

Descripción completa

Detalles Bibliográficos
Autores principales: Sadri, Javad, Diallo, Abdoulaye Banire, Blanchette, Mathieu
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3117352/
https://www.ncbi.nlm.nih.gov/pubmed/21685080
http://dx.doi.org/10.1093/bioinformatics/btr241
_version_ 1782206320757178368
author Sadri, Javad
Diallo, Abdoulaye Banire
Blanchette, Mathieu
author_facet Sadri, Javad
Diallo, Abdoulaye Banire
Blanchette, Mathieu
author_sort Sadri, Javad
collection PubMed
description Motivation: The identification of non-coding functional regions of the human genome remains one of the main challenges of genomics. By observing how a given region evolved over time, one can detect signs of negative or positive selection hinting that the region may be functional. With the quickly increasing number of vertebrate genomes to compare with our own, this type of approach is set to become extremely powerful, provided the right analytical tools are available. Results: A large number of approaches have been proposed to measure signs of past selective pressure, usually in the form of reduced mutation rate. Here, we propose a radically different approach to the detection of non-coding functional region: instead of measuring past evolutionary rates, we build a machine learning classifier to predict current substitution rates in human based on the inferred evolutionary events that affected the region during vertebrate evolution. We show that different types of evolutionary events, occurring along different branches of the phylogenetic tree, bring very different amounts of information. We propose a number of simple machine learning classifiers and show that a Support-Vector Machine (SVM) predictor clearly outperforms existing tools at predicting human non-coding functional sites. Comparison to external evidences of selection and regulatory function confirms that these SVM predictions are more accurate than those of other approaches. Availability: The predictor and predictions made are available at http://www.mcb.mcgill.ca/~blanchem/sadri. Contact: blanchem@mcb.mcgill.ca
format Online
Article
Text
id pubmed-3117352
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-31173522011-06-17 Predicting site-specific human selective pressure using evolutionary signatures Sadri, Javad Diallo, Abdoulaye Banire Blanchette, Mathieu Bioinformatics Ismb/Eccb 2011 Proceedings Papers Committee July 17 to July 19, 2011, Vienna, Austria Motivation: The identification of non-coding functional regions of the human genome remains one of the main challenges of genomics. By observing how a given region evolved over time, one can detect signs of negative or positive selection hinting that the region may be functional. With the quickly increasing number of vertebrate genomes to compare with our own, this type of approach is set to become extremely powerful, provided the right analytical tools are available. Results: A large number of approaches have been proposed to measure signs of past selective pressure, usually in the form of reduced mutation rate. Here, we propose a radically different approach to the detection of non-coding functional region: instead of measuring past evolutionary rates, we build a machine learning classifier to predict current substitution rates in human based on the inferred evolutionary events that affected the region during vertebrate evolution. We show that different types of evolutionary events, occurring along different branches of the phylogenetic tree, bring very different amounts of information. We propose a number of simple machine learning classifiers and show that a Support-Vector Machine (SVM) predictor clearly outperforms existing tools at predicting human non-coding functional sites. Comparison to external evidences of selection and regulatory function confirms that these SVM predictions are more accurate than those of other approaches. Availability: The predictor and predictions made are available at http://www.mcb.mcgill.ca/~blanchem/sadri. Contact: blanchem@mcb.mcgill.ca Oxford University Press 2011-07-01 2011-06-14 /pmc/articles/PMC3117352/ /pubmed/21685080 http://dx.doi.org/10.1093/bioinformatics/btr241 Text en © The Author(s) 2011. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/2.5 This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.5), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Ismb/Eccb 2011 Proceedings Papers Committee July 17 to July 19, 2011, Vienna, Austria
Sadri, Javad
Diallo, Abdoulaye Banire
Blanchette, Mathieu
Predicting site-specific human selective pressure using evolutionary signatures
title Predicting site-specific human selective pressure using evolutionary signatures
title_full Predicting site-specific human selective pressure using evolutionary signatures
title_fullStr Predicting site-specific human selective pressure using evolutionary signatures
title_full_unstemmed Predicting site-specific human selective pressure using evolutionary signatures
title_short Predicting site-specific human selective pressure using evolutionary signatures
title_sort predicting site-specific human selective pressure using evolutionary signatures
topic Ismb/Eccb 2011 Proceedings Papers Committee July 17 to July 19, 2011, Vienna, Austria
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3117352/
https://www.ncbi.nlm.nih.gov/pubmed/21685080
http://dx.doi.org/10.1093/bioinformatics/btr241
work_keys_str_mv AT sadrijavad predictingsitespecifichumanselectivepressureusingevolutionarysignatures
AT dialloabdoulayebanire predictingsitespecifichumanselectivepressureusingevolutionarysignatures
AT blanchettemathieu predictingsitespecifichumanselectivepressureusingevolutionarysignatures