Cargando…

Annotation of Human Exome Gene Variants with Consensus Pathogenicity

A novel approach is developed to address the challenge of annotating with phenotypic effects those exome variants for which relevant empirical data are lacking or minimal. The predictive annotation method is implemented as a stacked ensemble of supervised base-learners, including distributed random...

Descripción completa

Detalles Bibliográficos
Autores principales: Jaravine, Victor, Balmford, James, Metzger, Patrick, Boerries, Melanie, Binder, Harald, Boeker, Martin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7563776/
https://www.ncbi.nlm.nih.gov/pubmed/32938008
http://dx.doi.org/10.3390/genes11091076
_version_ 1783595564069289984
author Jaravine, Victor
Balmford, James
Metzger, Patrick
Boerries, Melanie
Binder, Harald
Boeker, Martin
author_facet Jaravine, Victor
Balmford, James
Metzger, Patrick
Boerries, Melanie
Binder, Harald
Boeker, Martin
author_sort Jaravine, Victor
collection PubMed
description A novel approach is developed to address the challenge of annotating with phenotypic effects those exome variants for which relevant empirical data are lacking or minimal. The predictive annotation method is implemented as a stacked ensemble of supervised base-learners, including distributed random forest and gradient boosting machines. Ensemble models were trained and cross-validated on evidence-based categorical variant effect annotations from the ClinVar database, and were applied to 84 million non-synonymous single nucleotide variants (SNVs). The consensus model combined 39 functional mutation impacts, cross-species conservation score, and gene indispensability score. The indispensability score, accounting for differences in variant pathogenicities including in essential and mutation-tolerant genes, considerably improved the predictions. The consensus combination is consistent with as many input scores as possible while minimizing false predictions. The input scores are ranked based on their ability to predict effects. The score rankings and categorical phenotypic variant effect predictions are aimed for direct use in clinical and biological applications to prioritize human exome variants and mutations.
format Online
Article
Text
id pubmed-7563776
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-75637762020-10-27 Annotation of Human Exome Gene Variants with Consensus Pathogenicity Jaravine, Victor Balmford, James Metzger, Patrick Boerries, Melanie Binder, Harald Boeker, Martin Genes (Basel) Article A novel approach is developed to address the challenge of annotating with phenotypic effects those exome variants for which relevant empirical data are lacking or minimal. The predictive annotation method is implemented as a stacked ensemble of supervised base-learners, including distributed random forest and gradient boosting machines. Ensemble models were trained and cross-validated on evidence-based categorical variant effect annotations from the ClinVar database, and were applied to 84 million non-synonymous single nucleotide variants (SNVs). The consensus model combined 39 functional mutation impacts, cross-species conservation score, and gene indispensability score. The indispensability score, accounting for differences in variant pathogenicities including in essential and mutation-tolerant genes, considerably improved the predictions. The consensus combination is consistent with as many input scores as possible while minimizing false predictions. The input scores are ranked based on their ability to predict effects. The score rankings and categorical phenotypic variant effect predictions are aimed for direct use in clinical and biological applications to prioritize human exome variants and mutations. MDPI 2020-09-14 /pmc/articles/PMC7563776/ /pubmed/32938008 http://dx.doi.org/10.3390/genes11091076 Text en © 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Jaravine, Victor
Balmford, James
Metzger, Patrick
Boerries, Melanie
Binder, Harald
Boeker, Martin
Annotation of Human Exome Gene Variants with Consensus Pathogenicity
title Annotation of Human Exome Gene Variants with Consensus Pathogenicity
title_full Annotation of Human Exome Gene Variants with Consensus Pathogenicity
title_fullStr Annotation of Human Exome Gene Variants with Consensus Pathogenicity
title_full_unstemmed Annotation of Human Exome Gene Variants with Consensus Pathogenicity
title_short Annotation of Human Exome Gene Variants with Consensus Pathogenicity
title_sort annotation of human exome gene variants with consensus pathogenicity
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7563776/
https://www.ncbi.nlm.nih.gov/pubmed/32938008
http://dx.doi.org/10.3390/genes11091076
work_keys_str_mv AT jaravinevictor annotationofhumanexomegenevariantswithconsensuspathogenicity
AT balmfordjames annotationofhumanexomegenevariantswithconsensuspathogenicity
AT metzgerpatrick annotationofhumanexomegenevariantswithconsensuspathogenicity
AT boerriesmelanie annotationofhumanexomegenevariantswithconsensuspathogenicity
AT binderharald annotationofhumanexomegenevariantswithconsensuspathogenicity
AT boekermartin annotationofhumanexomegenevariantswithconsensuspathogenicity