Cargando…

Xrare: a machine learning method jointly modeling phenotypes and genetic evidence for rare disease diagnosis

PURPOSE: Despite the successful progress next-generation sequencing technologies has achieved in diagnosing the genetic cause of rare Mendelian diseases, the current diagnostic rate is still far from satisfactory because of heterogeneity, imprecision, and noise in disease phenotype descriptions and...

Descripción completa

Detalles Bibliográficos
Autores principales: Li, Qigang, Zhao, Keyan, Bustamante, Carlos D., Ma, Xin, Wong, Wing H.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group US 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6752318/
https://www.ncbi.nlm.nih.gov/pubmed/30675030
http://dx.doi.org/10.1038/s41436-019-0439-8
_version_ 1783452763000143872
author Li, Qigang
Zhao, Keyan
Bustamante, Carlos D.
Ma, Xin
Wong, Wing H.
author_facet Li, Qigang
Zhao, Keyan
Bustamante, Carlos D.
Ma, Xin
Wong, Wing H.
author_sort Li, Qigang
collection PubMed
description PURPOSE: Despite the successful progress next-generation sequencing technologies has achieved in diagnosing the genetic cause of rare Mendelian diseases, the current diagnostic rate is still far from satisfactory because of heterogeneity, imprecision, and noise in disease phenotype descriptions and insufficient utilization of expert knowledge in clinical genetics. To overcome these difficulties, we present a novel method called Xrare for the prioritization of causative gene variants in rare disease diagnosis. METHODS: We propose a new phenotype similarity scoring method called Emission-Reception Information Content (ERIC), which is highly tolerant of noise and imprecision in clinical phenotypes. We utilize medical genetic domain knowledge by designing genetic features implementing American College of Medical Genetics and Genomics (ACMG) guidelines. RESULTS: ERIC score ranked consistently higher for disease genes than other phenotypic similarity scores in the presence of imprecise and noisy phenotypes. Extensive simulations and real clinical data demonstrated that Xrare outperforms existing alternative methods by 10–40% at various genetic diagnosis scenarios. CONCLUSION: The Xrare model is learned from a large database of clinical variants, and derives its strength from the tight integration of medical genetics features and phenotypic features similarity scores. Xrare provides the clinical community with a robust and powerful tool for variant prioritization.
format Online
Article
Text
id pubmed-6752318
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Nature Publishing Group US
record_format MEDLINE/PubMed
spelling pubmed-67523182019-09-23 Xrare: a machine learning method jointly modeling phenotypes and genetic evidence for rare disease diagnosis Li, Qigang Zhao, Keyan Bustamante, Carlos D. Ma, Xin Wong, Wing H. Genet Med Article PURPOSE: Despite the successful progress next-generation sequencing technologies has achieved in diagnosing the genetic cause of rare Mendelian diseases, the current diagnostic rate is still far from satisfactory because of heterogeneity, imprecision, and noise in disease phenotype descriptions and insufficient utilization of expert knowledge in clinical genetics. To overcome these difficulties, we present a novel method called Xrare for the prioritization of causative gene variants in rare disease diagnosis. METHODS: We propose a new phenotype similarity scoring method called Emission-Reception Information Content (ERIC), which is highly tolerant of noise and imprecision in clinical phenotypes. We utilize medical genetic domain knowledge by designing genetic features implementing American College of Medical Genetics and Genomics (ACMG) guidelines. RESULTS: ERIC score ranked consistently higher for disease genes than other phenotypic similarity scores in the presence of imprecise and noisy phenotypes. Extensive simulations and real clinical data demonstrated that Xrare outperforms existing alternative methods by 10–40% at various genetic diagnosis scenarios. CONCLUSION: The Xrare model is learned from a large database of clinical variants, and derives its strength from the tight integration of medical genetics features and phenotypic features similarity scores. Xrare provides the clinical community with a robust and powerful tool for variant prioritization. Nature Publishing Group US 2019-01-24 2019 /pmc/articles/PMC6752318/ /pubmed/30675030 http://dx.doi.org/10.1038/s41436-019-0439-8 Text en © The Author(s) 2019 Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, and provide a link to the Creative Commons license. You do not have permission under this license to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.
spellingShingle Article
Li, Qigang
Zhao, Keyan
Bustamante, Carlos D.
Ma, Xin
Wong, Wing H.
Xrare: a machine learning method jointly modeling phenotypes and genetic evidence for rare disease diagnosis
title Xrare: a machine learning method jointly modeling phenotypes and genetic evidence for rare disease diagnosis
title_full Xrare: a machine learning method jointly modeling phenotypes and genetic evidence for rare disease diagnosis
title_fullStr Xrare: a machine learning method jointly modeling phenotypes and genetic evidence for rare disease diagnosis
title_full_unstemmed Xrare: a machine learning method jointly modeling phenotypes and genetic evidence for rare disease diagnosis
title_short Xrare: a machine learning method jointly modeling phenotypes and genetic evidence for rare disease diagnosis
title_sort xrare: a machine learning method jointly modeling phenotypes and genetic evidence for rare disease diagnosis
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6752318/
https://www.ncbi.nlm.nih.gov/pubmed/30675030
http://dx.doi.org/10.1038/s41436-019-0439-8
work_keys_str_mv AT liqigang xrareamachinelearningmethodjointlymodelingphenotypesandgeneticevidenceforrarediseasediagnosis
AT zhaokeyan xrareamachinelearningmethodjointlymodelingphenotypesandgeneticevidenceforrarediseasediagnosis
AT bustamantecarlosd xrareamachinelearningmethodjointlymodelingphenotypesandgeneticevidenceforrarediseasediagnosis
AT maxin xrareamachinelearningmethodjointlymodelingphenotypesandgeneticevidenceforrarediseasediagnosis
AT wongwingh xrareamachinelearningmethodjointlymodelingphenotypesandgeneticevidenceforrarediseasediagnosis