Cargando…

SmartPhase: Accurate and fast phasing of heterozygous variant pairs for genetic diagnosis of rare diseases

There is an increasing need to use genome and transcriptome sequencing to genetically diagnose patients suffering from suspected monogenic rare diseases. The proper detection of compound heterozygous variant combinations as disease-causing candidates is a challenge in diagnostic workflows as haploty...

Descripción completa

Detalles Bibliográficos
Autores principales: Hager, Paul, Mewes, Hans-Werner, Rohlfs, Meino, Klein, Christoph, Jeske, Tim
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7032729/
https://www.ncbi.nlm.nih.gov/pubmed/32032351
http://dx.doi.org/10.1371/journal.pcbi.1007613
_version_ 1783499556971872256
author Hager, Paul
Mewes, Hans-Werner
Rohlfs, Meino
Klein, Christoph
Jeske, Tim
author_facet Hager, Paul
Mewes, Hans-Werner
Rohlfs, Meino
Klein, Christoph
Jeske, Tim
author_sort Hager, Paul
collection PubMed
description There is an increasing need to use genome and transcriptome sequencing to genetically diagnose patients suffering from suspected monogenic rare diseases. The proper detection of compound heterozygous variant combinations as disease-causing candidates is a challenge in diagnostic workflows as haplotype information is lost by currently used next-generation sequencing technologies. Consequently, computational tools are required to phase, or resolve the haplotype of, the high number of heterozygous variants in the exome or genome of each patient. Here we present SmartPhase, a phasing tool designed to efficiently reduce the set of potential compound heterozygous variant pairs in genetic diagnoses pipelines. The phasing algorithm of SmartPhase creates haplotypes using both parental genotype information and reads generated by DNA or RNA sequencing and is thus well suited to resolve the phase of rare variants. To inform the user about the reliability of a phasing prediction, it computes a confidence score which is essential to select error-free predictions. It incorporates existing haplotype information and applies logical rules to determine variants that can be excluded as causing a recessive, monogenic disease. SmartPhase can phase either all possible variant pairs in predefined genetic loci or preselected variant pairs of interest, thus keeping the focus on clinically relevant results. We compared SmartPhase to WhatsHap, one of the leading comparable phasing tools, using simulated data and a real clinical cohort of 921 patients. On both data sets, SmartPhase generated error-free predictions using our derived confidence score threshold. It outperformed WhatsHap with regard to the percentage of resolved pairs when parental genotype information is available. On the cohort data, SmartPhase enabled on average the exclusion of approximately 22% of the input variant pairs in each singleton patient and 44% in each trio patient. SmartPhase is implemented as an open-source Java tool and freely available at http://ibis.helmholtz-muenchen.de/smartphase/.
format Online
Article
Text
id pubmed-7032729
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-70327292020-02-28 SmartPhase: Accurate and fast phasing of heterozygous variant pairs for genetic diagnosis of rare diseases Hager, Paul Mewes, Hans-Werner Rohlfs, Meino Klein, Christoph Jeske, Tim PLoS Comput Biol Research Article There is an increasing need to use genome and transcriptome sequencing to genetically diagnose patients suffering from suspected monogenic rare diseases. The proper detection of compound heterozygous variant combinations as disease-causing candidates is a challenge in diagnostic workflows as haplotype information is lost by currently used next-generation sequencing technologies. Consequently, computational tools are required to phase, or resolve the haplotype of, the high number of heterozygous variants in the exome or genome of each patient. Here we present SmartPhase, a phasing tool designed to efficiently reduce the set of potential compound heterozygous variant pairs in genetic diagnoses pipelines. The phasing algorithm of SmartPhase creates haplotypes using both parental genotype information and reads generated by DNA or RNA sequencing and is thus well suited to resolve the phase of rare variants. To inform the user about the reliability of a phasing prediction, it computes a confidence score which is essential to select error-free predictions. It incorporates existing haplotype information and applies logical rules to determine variants that can be excluded as causing a recessive, monogenic disease. SmartPhase can phase either all possible variant pairs in predefined genetic loci or preselected variant pairs of interest, thus keeping the focus on clinically relevant results. We compared SmartPhase to WhatsHap, one of the leading comparable phasing tools, using simulated data and a real clinical cohort of 921 patients. On both data sets, SmartPhase generated error-free predictions using our derived confidence score threshold. It outperformed WhatsHap with regard to the percentage of resolved pairs when parental genotype information is available. On the cohort data, SmartPhase enabled on average the exclusion of approximately 22% of the input variant pairs in each singleton patient and 44% in each trio patient. SmartPhase is implemented as an open-source Java tool and freely available at http://ibis.helmholtz-muenchen.de/smartphase/. Public Library of Science 2020-02-07 /pmc/articles/PMC7032729/ /pubmed/32032351 http://dx.doi.org/10.1371/journal.pcbi.1007613 Text en © 2020 Hager et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Hager, Paul
Mewes, Hans-Werner
Rohlfs, Meino
Klein, Christoph
Jeske, Tim
SmartPhase: Accurate and fast phasing of heterozygous variant pairs for genetic diagnosis of rare diseases
title SmartPhase: Accurate and fast phasing of heterozygous variant pairs for genetic diagnosis of rare diseases
title_full SmartPhase: Accurate and fast phasing of heterozygous variant pairs for genetic diagnosis of rare diseases
title_fullStr SmartPhase: Accurate and fast phasing of heterozygous variant pairs for genetic diagnosis of rare diseases
title_full_unstemmed SmartPhase: Accurate and fast phasing of heterozygous variant pairs for genetic diagnosis of rare diseases
title_short SmartPhase: Accurate and fast phasing of heterozygous variant pairs for genetic diagnosis of rare diseases
title_sort smartphase: accurate and fast phasing of heterozygous variant pairs for genetic diagnosis of rare diseases
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7032729/
https://www.ncbi.nlm.nih.gov/pubmed/32032351
http://dx.doi.org/10.1371/journal.pcbi.1007613
work_keys_str_mv AT hagerpaul smartphaseaccurateandfastphasingofheterozygousvariantpairsforgeneticdiagnosisofrarediseases
AT meweshanswerner smartphaseaccurateandfastphasingofheterozygousvariantpairsforgeneticdiagnosisofrarediseases
AT rohlfsmeino smartphaseaccurateandfastphasingofheterozygousvariantpairsforgeneticdiagnosisofrarediseases
AT kleinchristoph smartphaseaccurateandfastphasingofheterozygousvariantpairsforgeneticdiagnosisofrarediseases
AT jesketim smartphaseaccurateandfastphasingofheterozygousvariantpairsforgeneticdiagnosisofrarediseases