Cargando…

VarElect: the phenotype-based variation prioritizer of the GeneCards Suite

BACKGROUND: Next generation sequencing (NGS) provides a key technology for deciphering the genetic underpinnings of human diseases. Typical NGS analyses of a patient depict tens of thousands non-reference coding variants, but only one or very few are expected to be significant for the relevant disor...

Descripción completa

Detalles Bibliográficos
Autores principales: Stelzer, Gil, Plaschkes, Inbar, Oz-Levi, Danit, Alkelai, Anna, Olender, Tsviya, Zimmerman, Shahar, Twik, Michal, Belinky, Frida, Fishilevich, Simon, Nudel, Ron, Guan-Golan, Yaron, Warshawsky, David, Dahary, Dvir, Kohn, Asher, Mazor, Yaron, Kaplan, Sergey, Iny Stein, Tsippi, Baris, Hagit N., Rappaport, Noa, Safran, Marilyn, Lancet, Doron
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4928145/
https://www.ncbi.nlm.nih.gov/pubmed/27357693
http://dx.doi.org/10.1186/s12864-016-2722-2
_version_ 1782440387580788736
author Stelzer, Gil
Plaschkes, Inbar
Oz-Levi, Danit
Alkelai, Anna
Olender, Tsviya
Zimmerman, Shahar
Twik, Michal
Belinky, Frida
Fishilevich, Simon
Nudel, Ron
Guan-Golan, Yaron
Warshawsky, David
Dahary, Dvir
Kohn, Asher
Mazor, Yaron
Kaplan, Sergey
Iny Stein, Tsippi
Baris, Hagit N.
Rappaport, Noa
Safran, Marilyn
Lancet, Doron
author_facet Stelzer, Gil
Plaschkes, Inbar
Oz-Levi, Danit
Alkelai, Anna
Olender, Tsviya
Zimmerman, Shahar
Twik, Michal
Belinky, Frida
Fishilevich, Simon
Nudel, Ron
Guan-Golan, Yaron
Warshawsky, David
Dahary, Dvir
Kohn, Asher
Mazor, Yaron
Kaplan, Sergey
Iny Stein, Tsippi
Baris, Hagit N.
Rappaport, Noa
Safran, Marilyn
Lancet, Doron
author_sort Stelzer, Gil
collection PubMed
description BACKGROUND: Next generation sequencing (NGS) provides a key technology for deciphering the genetic underpinnings of human diseases. Typical NGS analyses of a patient depict tens of thousands non-reference coding variants, but only one or very few are expected to be significant for the relevant disorder. In a filtering stage, one employs family segregation, rarity in the population, predicted protein impact and evolutionary conservation as a means for shortening the variation list. However, narrowing down further towards culprit disease genes usually entails laborious seeking of gene-phenotype relationships, consulting numerous separate databases. Thus, a major challenge is to transition from the few hundred shortlisted genes to the most viable disease-causing candidates. RESULTS: We describe a novel tool, VarElect (http://ve.genecards.org), a comprehensive phenotype-dependent variant/gene prioritizer, based on the widely-used GeneCards, which helps rapidly identify causal mutations with extensive evidence. The GeneCards suite offers an effective and speedy alternative, whereby >120 gene-centric automatically-mined data sources are jointly available for the task. VarElect cashes on this wealth of information, as well as on GeneCards’ powerful free-text Boolean search and scoring capabilities, proficiently matching variant-containing genes to submitted disease/symptom keywords. The tool also leverages the rich disease and pathway information of MalaCards, the human disease database, and PathCards, the unified pathway (SuperPaths) database, both within the GeneCards Suite. The VarElect algorithm infers direct as well as indirect links between genes and phenotypes, the latter benefitting from GeneCards’ diverse gene-to-gene data links in GenesLikeMe. Finally, our tool offers an extensive gene-phenotype evidence portrayal (“MiniCards”) and hyperlinks to the parent databases. CONCLUSIONS: We demonstrate that VarElect compares favorably with several often-used NGS phenotyping tools, thus providing a robust facility for ranking genes, pointing out their likelihood to be related to a patient’s disease. VarElect’s capacity to automatically process numerous NGS cases, either in stand-alone format or in VCF-analyzer mode (TGex and VarAnnot), is indispensable for emerging clinical projects that involve thousands of whole exome/genome NGS analyses. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12864-016-2722-2) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-4928145
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-49281452016-06-30 VarElect: the phenotype-based variation prioritizer of the GeneCards Suite Stelzer, Gil Plaschkes, Inbar Oz-Levi, Danit Alkelai, Anna Olender, Tsviya Zimmerman, Shahar Twik, Michal Belinky, Frida Fishilevich, Simon Nudel, Ron Guan-Golan, Yaron Warshawsky, David Dahary, Dvir Kohn, Asher Mazor, Yaron Kaplan, Sergey Iny Stein, Tsippi Baris, Hagit N. Rappaport, Noa Safran, Marilyn Lancet, Doron BMC Genomics Methodology Article BACKGROUND: Next generation sequencing (NGS) provides a key technology for deciphering the genetic underpinnings of human diseases. Typical NGS analyses of a patient depict tens of thousands non-reference coding variants, but only one or very few are expected to be significant for the relevant disorder. In a filtering stage, one employs family segregation, rarity in the population, predicted protein impact and evolutionary conservation as a means for shortening the variation list. However, narrowing down further towards culprit disease genes usually entails laborious seeking of gene-phenotype relationships, consulting numerous separate databases. Thus, a major challenge is to transition from the few hundred shortlisted genes to the most viable disease-causing candidates. RESULTS: We describe a novel tool, VarElect (http://ve.genecards.org), a comprehensive phenotype-dependent variant/gene prioritizer, based on the widely-used GeneCards, which helps rapidly identify causal mutations with extensive evidence. The GeneCards suite offers an effective and speedy alternative, whereby >120 gene-centric automatically-mined data sources are jointly available for the task. VarElect cashes on this wealth of information, as well as on GeneCards’ powerful free-text Boolean search and scoring capabilities, proficiently matching variant-containing genes to submitted disease/symptom keywords. The tool also leverages the rich disease and pathway information of MalaCards, the human disease database, and PathCards, the unified pathway (SuperPaths) database, both within the GeneCards Suite. The VarElect algorithm infers direct as well as indirect links between genes and phenotypes, the latter benefitting from GeneCards’ diverse gene-to-gene data links in GenesLikeMe. Finally, our tool offers an extensive gene-phenotype evidence portrayal (“MiniCards”) and hyperlinks to the parent databases. CONCLUSIONS: We demonstrate that VarElect compares favorably with several often-used NGS phenotyping tools, thus providing a robust facility for ranking genes, pointing out their likelihood to be related to a patient’s disease. VarElect’s capacity to automatically process numerous NGS cases, either in stand-alone format or in VCF-analyzer mode (TGex and VarAnnot), is indispensable for emerging clinical projects that involve thousands of whole exome/genome NGS analyses. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12864-016-2722-2) contains supplementary material, which is available to authorized users. BioMed Central 2016-06-23 /pmc/articles/PMC4928145/ /pubmed/27357693 http://dx.doi.org/10.1186/s12864-016-2722-2 Text en © Stelzer et al. 2016 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Methodology Article
Stelzer, Gil
Plaschkes, Inbar
Oz-Levi, Danit
Alkelai, Anna
Olender, Tsviya
Zimmerman, Shahar
Twik, Michal
Belinky, Frida
Fishilevich, Simon
Nudel, Ron
Guan-Golan, Yaron
Warshawsky, David
Dahary, Dvir
Kohn, Asher
Mazor, Yaron
Kaplan, Sergey
Iny Stein, Tsippi
Baris, Hagit N.
Rappaport, Noa
Safran, Marilyn
Lancet, Doron
VarElect: the phenotype-based variation prioritizer of the GeneCards Suite
title VarElect: the phenotype-based variation prioritizer of the GeneCards Suite
title_full VarElect: the phenotype-based variation prioritizer of the GeneCards Suite
title_fullStr VarElect: the phenotype-based variation prioritizer of the GeneCards Suite
title_full_unstemmed VarElect: the phenotype-based variation prioritizer of the GeneCards Suite
title_short VarElect: the phenotype-based variation prioritizer of the GeneCards Suite
title_sort varelect: the phenotype-based variation prioritizer of the genecards suite
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4928145/
https://www.ncbi.nlm.nih.gov/pubmed/27357693
http://dx.doi.org/10.1186/s12864-016-2722-2
work_keys_str_mv AT stelzergil varelectthephenotypebasedvariationprioritizerofthegenecardssuite
AT plaschkesinbar varelectthephenotypebasedvariationprioritizerofthegenecardssuite
AT ozlevidanit varelectthephenotypebasedvariationprioritizerofthegenecardssuite
AT alkelaianna varelectthephenotypebasedvariationprioritizerofthegenecardssuite
AT olendertsviya varelectthephenotypebasedvariationprioritizerofthegenecardssuite
AT zimmermanshahar varelectthephenotypebasedvariationprioritizerofthegenecardssuite
AT twikmichal varelectthephenotypebasedvariationprioritizerofthegenecardssuite
AT belinkyfrida varelectthephenotypebasedvariationprioritizerofthegenecardssuite
AT fishilevichsimon varelectthephenotypebasedvariationprioritizerofthegenecardssuite
AT nudelron varelectthephenotypebasedvariationprioritizerofthegenecardssuite
AT guangolanyaron varelectthephenotypebasedvariationprioritizerofthegenecardssuite
AT warshawskydavid varelectthephenotypebasedvariationprioritizerofthegenecardssuite
AT daharydvir varelectthephenotypebasedvariationprioritizerofthegenecardssuite
AT kohnasher varelectthephenotypebasedvariationprioritizerofthegenecardssuite
AT mazoryaron varelectthephenotypebasedvariationprioritizerofthegenecardssuite
AT kaplansergey varelectthephenotypebasedvariationprioritizerofthegenecardssuite
AT inysteintsippi varelectthephenotypebasedvariationprioritizerofthegenecardssuite
AT barishagitn varelectthephenotypebasedvariationprioritizerofthegenecardssuite
AT rappaportnoa varelectthephenotypebasedvariationprioritizerofthegenecardssuite
AT safranmarilyn varelectthephenotypebasedvariationprioritizerofthegenecardssuite
AT lancetdoron varelectthephenotypebasedvariationprioritizerofthegenecardssuite