Cargando…

Simulation of undiagnosed patients with novel genetic conditions

Rare Mendelian disorders pose a major diagnostic challenge and collectively affect 300–400 million patients worldwide. Many automated tools aim to uncover causal genes in patients with suspected genetic disorders, but evaluation of these tools is limited due to the lack of comprehensive benchmark da...

Descripción completa

Detalles Bibliográficos
Autores principales: Alsentzer, Emily, Finlayson, Samuel G., Li, Michelle M., Kobren, Shilpa N., Kohane, Isaac S.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10570269/
https://www.ncbi.nlm.nih.gov/pubmed/37828001
http://dx.doi.org/10.1038/s41467-023-41980-6
_version_ 1785119726601502720
author Alsentzer, Emily
Finlayson, Samuel G.
Li, Michelle M.
Kobren, Shilpa N.
Kohane, Isaac S.
author_facet Alsentzer, Emily
Finlayson, Samuel G.
Li, Michelle M.
Kobren, Shilpa N.
Kohane, Isaac S.
author_sort Alsentzer, Emily
collection PubMed
description Rare Mendelian disorders pose a major diagnostic challenge and collectively affect 300–400 million patients worldwide. Many automated tools aim to uncover causal genes in patients with suspected genetic disorders, but evaluation of these tools is limited due to the lack of comprehensive benchmark datasets that include previously unpublished conditions. Here, we present a computational pipeline that simulates realistic clinical datasets to address this deficit. Our framework jointly simulates complex phenotypes and challenging candidate genes and produces patients with novel genetic conditions. We demonstrate the similarity of our simulated patients to real patients from the Undiagnosed Diseases Network and evaluate common gene prioritization methods on the simulated cohort. These prioritization methods recover known gene-disease associations but perform poorly on diagnosing patients with novel genetic disorders. Our publicly-available dataset and codebase can be utilized by medical genetics researchers to evaluate, compare, and improve tools that aid in the diagnostic process.
format Online
Article
Text
id pubmed-10570269
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-105702692023-10-14 Simulation of undiagnosed patients with novel genetic conditions Alsentzer, Emily Finlayson, Samuel G. Li, Michelle M. Kobren, Shilpa N. Kohane, Isaac S. Nat Commun Article Rare Mendelian disorders pose a major diagnostic challenge and collectively affect 300–400 million patients worldwide. Many automated tools aim to uncover causal genes in patients with suspected genetic disorders, but evaluation of these tools is limited due to the lack of comprehensive benchmark datasets that include previously unpublished conditions. Here, we present a computational pipeline that simulates realistic clinical datasets to address this deficit. Our framework jointly simulates complex phenotypes and challenging candidate genes and produces patients with novel genetic conditions. We demonstrate the similarity of our simulated patients to real patients from the Undiagnosed Diseases Network and evaluate common gene prioritization methods on the simulated cohort. These prioritization methods recover known gene-disease associations but perform poorly on diagnosing patients with novel genetic disorders. Our publicly-available dataset and codebase can be utilized by medical genetics researchers to evaluate, compare, and improve tools that aid in the diagnostic process. Nature Publishing Group UK 2023-10-12 /pmc/articles/PMC10570269/ /pubmed/37828001 http://dx.doi.org/10.1038/s41467-023-41980-6 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Alsentzer, Emily
Finlayson, Samuel G.
Li, Michelle M.
Kobren, Shilpa N.
Kohane, Isaac S.
Simulation of undiagnosed patients with novel genetic conditions
title Simulation of undiagnosed patients with novel genetic conditions
title_full Simulation of undiagnosed patients with novel genetic conditions
title_fullStr Simulation of undiagnosed patients with novel genetic conditions
title_full_unstemmed Simulation of undiagnosed patients with novel genetic conditions
title_short Simulation of undiagnosed patients with novel genetic conditions
title_sort simulation of undiagnosed patients with novel genetic conditions
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10570269/
https://www.ncbi.nlm.nih.gov/pubmed/37828001
http://dx.doi.org/10.1038/s41467-023-41980-6
work_keys_str_mv AT alsentzeremily simulationofundiagnosedpatientswithnovelgeneticconditions
AT finlaysonsamuelg simulationofundiagnosedpatientswithnovelgeneticconditions
AT limichellem simulationofundiagnosedpatientswithnovelgeneticconditions
AT simulationofundiagnosedpatientswithnovelgeneticconditions
AT kobrenshilpan simulationofundiagnosedpatientswithnovelgeneticconditions
AT kohaneisaacs simulationofundiagnosedpatientswithnovelgeneticconditions