Cargando…

Finding commonalities in rare diseases through the undiagnosed diseases network

OBJECTIVE: When studying any specific rare disease, heterogeneity and scarcity of affected individuals has historically hindered investigators from discerning on what to focus to understand and diagnose a disease. New nongenomic methodologies must be developed that identify similarities in seemingly...

Descripción completa

Detalles Bibliográficos
Autores principales: Yates, Josephine, Gutiérrez-Sacristán, Alba, Jouhet, Vianney, LeBlanc, Kimberly, Esteves, Cecilia, DeSain, Thomas N, Benik, Nick, Stedman, Jason, Palmer, Nathan, Mellon, Guillaume, Kohane, Isaac, Avillach, Paul
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8324228/
https://www.ncbi.nlm.nih.gov/pubmed/34009343
http://dx.doi.org/10.1093/jamia/ocab050
_version_ 1783731365485740032
author Yates, Josephine
Gutiérrez-Sacristán, Alba
Jouhet, Vianney
LeBlanc, Kimberly
Esteves, Cecilia
DeSain, Thomas N
Benik, Nick
Stedman, Jason
Palmer, Nathan
Mellon, Guillaume
Kohane, Isaac
Avillach, Paul
author_facet Yates, Josephine
Gutiérrez-Sacristán, Alba
Jouhet, Vianney
LeBlanc, Kimberly
Esteves, Cecilia
DeSain, Thomas N
Benik, Nick
Stedman, Jason
Palmer, Nathan
Mellon, Guillaume
Kohane, Isaac
Avillach, Paul
author_sort Yates, Josephine
collection PubMed
description OBJECTIVE: When studying any specific rare disease, heterogeneity and scarcity of affected individuals has historically hindered investigators from discerning on what to focus to understand and diagnose a disease. New nongenomic methodologies must be developed that identify similarities in seemingly dissimilar conditions. MATERIALS AND METHODS: This observational study analyzes 1042 patients from the Undiagnosed Diseases Network (2015-2019), a multicenter, nationwide research study using phenotypic data annotated by specialized staff using Human Phenotype Ontology terms. We used Louvain community detection to cluster patients linked by Jaccard pairwise similarity and 2 support vector classifier to assign new cases. We further validated the clusters’ most representative comorbidities using a national claims database (67 million patients). RESULTS: Patients were divided into 2 groups: those with symptom onset before 18 years of age (n = 810) and at 18 years of age or older (n = 232) (average symptom onset age: 10 [interquartile range, 0-14] years). For 810 pediatric patients, we identified 4 statistically significant clusters. Two clusters were characterized by growth disorders, and developmental delay enriched for hypotonia presented a higher likelihood of diagnosis. Support vector classifier showed 0.89 balanced accuracy (0.83 for Human Phenotype Ontology terms only) on test data. DISCUSSIONS: To set the framework for future discovery, we chose as our endpoint the successful grouping of patients by phenotypic similarity and provide a classification tool to assign new patients to those clusters. CONCLUSION: This study shows that despite the scarcity and heterogeneity of patients, we can still find commonalities that can potentially be harnessed to uncover new insights and targets for therapy.
format Online
Article
Text
id pubmed-8324228
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-83242282021-08-02 Finding commonalities in rare diseases through the undiagnosed diseases network Yates, Josephine Gutiérrez-Sacristán, Alba Jouhet, Vianney LeBlanc, Kimberly Esteves, Cecilia DeSain, Thomas N Benik, Nick Stedman, Jason Palmer, Nathan Mellon, Guillaume Kohane, Isaac Avillach, Paul J Am Med Inform Assoc Research and Applications OBJECTIVE: When studying any specific rare disease, heterogeneity and scarcity of affected individuals has historically hindered investigators from discerning on what to focus to understand and diagnose a disease. New nongenomic methodologies must be developed that identify similarities in seemingly dissimilar conditions. MATERIALS AND METHODS: This observational study analyzes 1042 patients from the Undiagnosed Diseases Network (2015-2019), a multicenter, nationwide research study using phenotypic data annotated by specialized staff using Human Phenotype Ontology terms. We used Louvain community detection to cluster patients linked by Jaccard pairwise similarity and 2 support vector classifier to assign new cases. We further validated the clusters’ most representative comorbidities using a national claims database (67 million patients). RESULTS: Patients were divided into 2 groups: those with symptom onset before 18 years of age (n = 810) and at 18 years of age or older (n = 232) (average symptom onset age: 10 [interquartile range, 0-14] years). For 810 pediatric patients, we identified 4 statistically significant clusters. Two clusters were characterized by growth disorders, and developmental delay enriched for hypotonia presented a higher likelihood of diagnosis. Support vector classifier showed 0.89 balanced accuracy (0.83 for Human Phenotype Ontology terms only) on test data. DISCUSSIONS: To set the framework for future discovery, we chose as our endpoint the successful grouping of patients by phenotypic similarity and provide a classification tool to assign new patients to those clusters. CONCLUSION: This study shows that despite the scarcity and heterogeneity of patients, we can still find commonalities that can potentially be harnessed to uncover new insights and targets for therapy. Oxford University Press 2021-05-03 /pmc/articles/PMC8324228/ /pubmed/34009343 http://dx.doi.org/10.1093/jamia/ocab050 Text en © The Author(s) 2021. Published by Oxford University Press on behalf of the American Medical Informatics Association. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) ), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research and Applications
Yates, Josephine
Gutiérrez-Sacristán, Alba
Jouhet, Vianney
LeBlanc, Kimberly
Esteves, Cecilia
DeSain, Thomas N
Benik, Nick
Stedman, Jason
Palmer, Nathan
Mellon, Guillaume
Kohane, Isaac
Avillach, Paul
Finding commonalities in rare diseases through the undiagnosed diseases network
title Finding commonalities in rare diseases through the undiagnosed diseases network
title_full Finding commonalities in rare diseases through the undiagnosed diseases network
title_fullStr Finding commonalities in rare diseases through the undiagnosed diseases network
title_full_unstemmed Finding commonalities in rare diseases through the undiagnosed diseases network
title_short Finding commonalities in rare diseases through the undiagnosed diseases network
title_sort finding commonalities in rare diseases through the undiagnosed diseases network
topic Research and Applications
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8324228/
https://www.ncbi.nlm.nih.gov/pubmed/34009343
http://dx.doi.org/10.1093/jamia/ocab050
work_keys_str_mv AT yatesjosephine findingcommonalitiesinrarediseasesthroughtheundiagnoseddiseasesnetwork
AT gutierrezsacristanalba findingcommonalitiesinrarediseasesthroughtheundiagnoseddiseasesnetwork
AT jouhetvianney findingcommonalitiesinrarediseasesthroughtheundiagnoseddiseasesnetwork
AT leblanckimberly findingcommonalitiesinrarediseasesthroughtheundiagnoseddiseasesnetwork
AT estevescecilia findingcommonalitiesinrarediseasesthroughtheundiagnoseddiseasesnetwork
AT findingcommonalitiesinrarediseasesthroughtheundiagnoseddiseasesnetwork
AT desainthomasn findingcommonalitiesinrarediseasesthroughtheundiagnoseddiseasesnetwork
AT beniknick findingcommonalitiesinrarediseasesthroughtheundiagnoseddiseasesnetwork
AT stedmanjason findingcommonalitiesinrarediseasesthroughtheundiagnoseddiseasesnetwork
AT palmernathan findingcommonalitiesinrarediseasesthroughtheundiagnoseddiseasesnetwork
AT mellonguillaume findingcommonalitiesinrarediseasesthroughtheundiagnoseddiseasesnetwork
AT kohaneisaac findingcommonalitiesinrarediseasesthroughtheundiagnoseddiseasesnetwork
AT avillachpaul findingcommonalitiesinrarediseasesthroughtheundiagnoseddiseasesnetwork