Cargando…
Xrare: a machine learning method jointly modeling phenotypes and genetic evidence for rare disease diagnosis
PURPOSE: Despite the successful progress next-generation sequencing technologies has achieved in diagnosing the genetic cause of rare Mendelian diseases, the current diagnostic rate is still far from satisfactory because of heterogeneity, imprecision, and noise in disease phenotype descriptions and...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group US
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6752318/ https://www.ncbi.nlm.nih.gov/pubmed/30675030 http://dx.doi.org/10.1038/s41436-019-0439-8 |
_version_ | 1783452763000143872 |
---|---|
author | Li, Qigang Zhao, Keyan Bustamante, Carlos D. Ma, Xin Wong, Wing H. |
author_facet | Li, Qigang Zhao, Keyan Bustamante, Carlos D. Ma, Xin Wong, Wing H. |
author_sort | Li, Qigang |
collection | PubMed |
description | PURPOSE: Despite the successful progress next-generation sequencing technologies has achieved in diagnosing the genetic cause of rare Mendelian diseases, the current diagnostic rate is still far from satisfactory because of heterogeneity, imprecision, and noise in disease phenotype descriptions and insufficient utilization of expert knowledge in clinical genetics. To overcome these difficulties, we present a novel method called Xrare for the prioritization of causative gene variants in rare disease diagnosis. METHODS: We propose a new phenotype similarity scoring method called Emission-Reception Information Content (ERIC), which is highly tolerant of noise and imprecision in clinical phenotypes. We utilize medical genetic domain knowledge by designing genetic features implementing American College of Medical Genetics and Genomics (ACMG) guidelines. RESULTS: ERIC score ranked consistently higher for disease genes than other phenotypic similarity scores in the presence of imprecise and noisy phenotypes. Extensive simulations and real clinical data demonstrated that Xrare outperforms existing alternative methods by 10–40% at various genetic diagnosis scenarios. CONCLUSION: The Xrare model is learned from a large database of clinical variants, and derives its strength from the tight integration of medical genetics features and phenotypic features similarity scores. Xrare provides the clinical community with a robust and powerful tool for variant prioritization. |
format | Online Article Text |
id | pubmed-6752318 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | Nature Publishing Group US |
record_format | MEDLINE/PubMed |
spelling | pubmed-67523182019-09-23 Xrare: a machine learning method jointly modeling phenotypes and genetic evidence for rare disease diagnosis Li, Qigang Zhao, Keyan Bustamante, Carlos D. Ma, Xin Wong, Wing H. Genet Med Article PURPOSE: Despite the successful progress next-generation sequencing technologies has achieved in diagnosing the genetic cause of rare Mendelian diseases, the current diagnostic rate is still far from satisfactory because of heterogeneity, imprecision, and noise in disease phenotype descriptions and insufficient utilization of expert knowledge in clinical genetics. To overcome these difficulties, we present a novel method called Xrare for the prioritization of causative gene variants in rare disease diagnosis. METHODS: We propose a new phenotype similarity scoring method called Emission-Reception Information Content (ERIC), which is highly tolerant of noise and imprecision in clinical phenotypes. We utilize medical genetic domain knowledge by designing genetic features implementing American College of Medical Genetics and Genomics (ACMG) guidelines. RESULTS: ERIC score ranked consistently higher for disease genes than other phenotypic similarity scores in the presence of imprecise and noisy phenotypes. Extensive simulations and real clinical data demonstrated that Xrare outperforms existing alternative methods by 10–40% at various genetic diagnosis scenarios. CONCLUSION: The Xrare model is learned from a large database of clinical variants, and derives its strength from the tight integration of medical genetics features and phenotypic features similarity scores. Xrare provides the clinical community with a robust and powerful tool for variant prioritization. Nature Publishing Group US 2019-01-24 2019 /pmc/articles/PMC6752318/ /pubmed/30675030 http://dx.doi.org/10.1038/s41436-019-0439-8 Text en © The Author(s) 2019 Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, and provide a link to the Creative Commons license. You do not have permission under this license to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by-nc-nd/4.0/. |
spellingShingle | Article Li, Qigang Zhao, Keyan Bustamante, Carlos D. Ma, Xin Wong, Wing H. Xrare: a machine learning method jointly modeling phenotypes and genetic evidence for rare disease diagnosis |
title | Xrare: a machine learning method jointly modeling phenotypes and genetic evidence for rare disease diagnosis |
title_full | Xrare: a machine learning method jointly modeling phenotypes and genetic evidence for rare disease diagnosis |
title_fullStr | Xrare: a machine learning method jointly modeling phenotypes and genetic evidence for rare disease diagnosis |
title_full_unstemmed | Xrare: a machine learning method jointly modeling phenotypes and genetic evidence for rare disease diagnosis |
title_short | Xrare: a machine learning method jointly modeling phenotypes and genetic evidence for rare disease diagnosis |
title_sort | xrare: a machine learning method jointly modeling phenotypes and genetic evidence for rare disease diagnosis |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6752318/ https://www.ncbi.nlm.nih.gov/pubmed/30675030 http://dx.doi.org/10.1038/s41436-019-0439-8 |
work_keys_str_mv | AT liqigang xrareamachinelearningmethodjointlymodelingphenotypesandgeneticevidenceforrarediseasediagnosis AT zhaokeyan xrareamachinelearningmethodjointlymodelingphenotypesandgeneticevidenceforrarediseasediagnosis AT bustamantecarlosd xrareamachinelearningmethodjointlymodelingphenotypesandgeneticevidenceforrarediseasediagnosis AT maxin xrareamachinelearningmethodjointlymodelingphenotypesandgeneticevidenceforrarediseasediagnosis AT wongwingh xrareamachinelearningmethodjointlymodelingphenotypesandgeneticevidenceforrarediseasediagnosis |