Cargando…

HIPred: an integrative approach to predicting haploinsufficient genes

MOTIVATION: A major cause of autosomal dominant disease is haploinsufficiency, whereby a single copy of a gene is not sufficient to maintain the normal function of the gene. A large proportion of existing methods for predicting haploinsufficiency incorporate biological networks, e.g. protein-protein...

Descripción completa

Detalles Bibliográficos
Autores principales: Shihab, Hashem A, Rogers, Mark F, Campbell, Colin, Gaunt, Tom R
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5581952/
https://www.ncbi.nlm.nih.gov/pubmed/28137713
http://dx.doi.org/10.1093/bioinformatics/btx028
_version_ 1783261120510820352
author Shihab, Hashem A
Rogers, Mark F
Campbell, Colin
Gaunt, Tom R
author_facet Shihab, Hashem A
Rogers, Mark F
Campbell, Colin
Gaunt, Tom R
author_sort Shihab, Hashem A
collection PubMed
description MOTIVATION: A major cause of autosomal dominant disease is haploinsufficiency, whereby a single copy of a gene is not sufficient to maintain the normal function of the gene. A large proportion of existing methods for predicting haploinsufficiency incorporate biological networks, e.g. protein-protein interaction networks that have recently been shown to introduce study bias. As a result, these methods tend to perform best on well-studied genes, but underperform on less studied genes. The advent of large genome sequencing consortia, such as the 1000 genomes project, NHLBI Exome Sequencing Project and the Exome Aggregation Consortium creates an urgent need for unbiased haploinsufficiency prediction methods. RESULTS: Here, we describe a machine learning approach, called HIPred, that integrates genomic and evolutionary information from ENSEMBL, with functional annotations from the Encyclopaedia of DNA Elements consortium and the NIH Roadmap Epigenomics Project to predict haploinsufficiency, without the study bias described earlier. We benchmark HIPred using several datasets and show that our unbiased method performs as well as, and in most cases, outperforms existing biased algorithms. AVAILABILITY AND IMPLEMENTATION: HIPred scores for all gene identifiers are available at: https://github.com/HAShihab/HIPred. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-5581952
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-55819522017-09-03 HIPred: an integrative approach to predicting haploinsufficient genes Shihab, Hashem A Rogers, Mark F Campbell, Colin Gaunt, Tom R Bioinformatics Original Papers MOTIVATION: A major cause of autosomal dominant disease is haploinsufficiency, whereby a single copy of a gene is not sufficient to maintain the normal function of the gene. A large proportion of existing methods for predicting haploinsufficiency incorporate biological networks, e.g. protein-protein interaction networks that have recently been shown to introduce study bias. As a result, these methods tend to perform best on well-studied genes, but underperform on less studied genes. The advent of large genome sequencing consortia, such as the 1000 genomes project, NHLBI Exome Sequencing Project and the Exome Aggregation Consortium creates an urgent need for unbiased haploinsufficiency prediction methods. RESULTS: Here, we describe a machine learning approach, called HIPred, that integrates genomic and evolutionary information from ENSEMBL, with functional annotations from the Encyclopaedia of DNA Elements consortium and the NIH Roadmap Epigenomics Project to predict haploinsufficiency, without the study bias described earlier. We benchmark HIPred using several datasets and show that our unbiased method performs as well as, and in most cases, outperforms existing biased algorithms. AVAILABILITY AND IMPLEMENTATION: HIPred scores for all gene identifiers are available at: https://github.com/HAShihab/HIPred. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2017-06-15 2017-01-30 /pmc/articles/PMC5581952/ /pubmed/28137713 http://dx.doi.org/10.1093/bioinformatics/btx028 Text en © The Author 2017. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Papers
Shihab, Hashem A
Rogers, Mark F
Campbell, Colin
Gaunt, Tom R
HIPred: an integrative approach to predicting haploinsufficient genes
title HIPred: an integrative approach to predicting haploinsufficient genes
title_full HIPred: an integrative approach to predicting haploinsufficient genes
title_fullStr HIPred: an integrative approach to predicting haploinsufficient genes
title_full_unstemmed HIPred: an integrative approach to predicting haploinsufficient genes
title_short HIPred: an integrative approach to predicting haploinsufficient genes
title_sort hipred: an integrative approach to predicting haploinsufficient genes
topic Original Papers
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5581952/
https://www.ncbi.nlm.nih.gov/pubmed/28137713
http://dx.doi.org/10.1093/bioinformatics/btx028
work_keys_str_mv AT shihabhashema hipredanintegrativeapproachtopredictinghaploinsufficientgenes
AT rogersmarkf hipredanintegrativeapproachtopredictinghaploinsufficientgenes
AT campbellcolin hipredanintegrativeapproachtopredictinghaploinsufficientgenes
AT gaunttomr hipredanintegrativeapproachtopredictinghaploinsufficientgenes