Cargando…
Deep phenotyping unstructured data mining in an extensive pediatric database to unravel a common KCNA2 variant in neurodevelopmental syndromes
PURPOSE: Electronic health records are gaining popularity to detect and propose interdisciplinary treatments for patients with similar medical histories, diagnoses, and outcomes. These files are compiled by different nonexperts and expert clinicians. Data mining in these unstructured data is a trans...
Autores principales: | , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group US
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8105164/ https://www.ncbi.nlm.nih.gov/pubmed/33500571 http://dx.doi.org/10.1038/s41436-020-01039-z |
_version_ | 1783689558061219840 |
---|---|
author | Hully, Marie Lo Barco, Tommaso Kaminska, Anna Barcia, Giulia Cances, Claude Mignot, Cyril Desguerre, Isabelle Garcelon, Nicolas Kabashi, Edor Nabbout, Rima |
author_facet | Hully, Marie Lo Barco, Tommaso Kaminska, Anna Barcia, Giulia Cances, Claude Mignot, Cyril Desguerre, Isabelle Garcelon, Nicolas Kabashi, Edor Nabbout, Rima |
author_sort | Hully, Marie |
collection | PubMed |
description | PURPOSE: Electronic health records are gaining popularity to detect and propose interdisciplinary treatments for patients with similar medical histories, diagnoses, and outcomes. These files are compiled by different nonexperts and expert clinicians. Data mining in these unstructured data is a transposable and sustainable methodology to search for patients presenting a high similitude of clinical features. METHODS: Exome and targeted next-generation sequencing bioinformatics analyses were performed at the Imagine Institute. Similarity Index (SI), an algorithm based on a vector space model (VSM) that exploits concepts extracted from clinical narrative reports was used to identify patients with highly similar clinical features. RESULTS: Here we describe a case of “automated diagnosis” indicated by Dr. Warehouse, a biomedical data warehouse oriented toward clinical narrative reports, developed at Necker Children’s Hospital using around 500,000 patients’ records. Through the use of this warehouse, we were able to match and identify two patients sharing very specific clinical neonatal and childhood features harboring the same de novo variant in KCNA2. CONCLUSION: This innovative application of database clustering clinical features could advance identification of patients with rare and common genetic conditions and detect with high accuracy the natural history of patients harboring similar genetic pathogenic variants. |
format | Online Article Text |
id | pubmed-8105164 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Nature Publishing Group US |
record_format | MEDLINE/PubMed |
spelling | pubmed-81051642021-05-24 Deep phenotyping unstructured data mining in an extensive pediatric database to unravel a common KCNA2 variant in neurodevelopmental syndromes Hully, Marie Lo Barco, Tommaso Kaminska, Anna Barcia, Giulia Cances, Claude Mignot, Cyril Desguerre, Isabelle Garcelon, Nicolas Kabashi, Edor Nabbout, Rima Genet Med Brief Communication PURPOSE: Electronic health records are gaining popularity to detect and propose interdisciplinary treatments for patients with similar medical histories, diagnoses, and outcomes. These files are compiled by different nonexperts and expert clinicians. Data mining in these unstructured data is a transposable and sustainable methodology to search for patients presenting a high similitude of clinical features. METHODS: Exome and targeted next-generation sequencing bioinformatics analyses were performed at the Imagine Institute. Similarity Index (SI), an algorithm based on a vector space model (VSM) that exploits concepts extracted from clinical narrative reports was used to identify patients with highly similar clinical features. RESULTS: Here we describe a case of “automated diagnosis” indicated by Dr. Warehouse, a biomedical data warehouse oriented toward clinical narrative reports, developed at Necker Children’s Hospital using around 500,000 patients’ records. Through the use of this warehouse, we were able to match and identify two patients sharing very specific clinical neonatal and childhood features harboring the same de novo variant in KCNA2. CONCLUSION: This innovative application of database clustering clinical features could advance identification of patients with rare and common genetic conditions and detect with high accuracy the natural history of patients harboring similar genetic pathogenic variants. Nature Publishing Group US 2021-01-26 2021 /pmc/articles/PMC8105164/ /pubmed/33500571 http://dx.doi.org/10.1038/s41436-020-01039-z Text en © The Author(s) 2021 https://creativecommons.org/licenses/by-nc-nd/4.0/Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, and provide a link to the Creative Commons license. You do not have permission under this license to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by-nc-nd/4.0/ (https://creativecommons.org/licenses/by-nc-nd/4.0/) . |
spellingShingle | Brief Communication Hully, Marie Lo Barco, Tommaso Kaminska, Anna Barcia, Giulia Cances, Claude Mignot, Cyril Desguerre, Isabelle Garcelon, Nicolas Kabashi, Edor Nabbout, Rima Deep phenotyping unstructured data mining in an extensive pediatric database to unravel a common KCNA2 variant in neurodevelopmental syndromes |
title | Deep phenotyping unstructured data mining in an extensive pediatric database to unravel a common KCNA2 variant in neurodevelopmental syndromes |
title_full | Deep phenotyping unstructured data mining in an extensive pediatric database to unravel a common KCNA2 variant in neurodevelopmental syndromes |
title_fullStr | Deep phenotyping unstructured data mining in an extensive pediatric database to unravel a common KCNA2 variant in neurodevelopmental syndromes |
title_full_unstemmed | Deep phenotyping unstructured data mining in an extensive pediatric database to unravel a common KCNA2 variant in neurodevelopmental syndromes |
title_short | Deep phenotyping unstructured data mining in an extensive pediatric database to unravel a common KCNA2 variant in neurodevelopmental syndromes |
title_sort | deep phenotyping unstructured data mining in an extensive pediatric database to unravel a common kcna2 variant in neurodevelopmental syndromes |
topic | Brief Communication |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8105164/ https://www.ncbi.nlm.nih.gov/pubmed/33500571 http://dx.doi.org/10.1038/s41436-020-01039-z |
work_keys_str_mv | AT hullymarie deepphenotypingunstructureddatamininginanextensivepediatricdatabasetounravelacommonkcna2variantinneurodevelopmentalsyndromes AT lobarcotommaso deepphenotypingunstructureddatamininginanextensivepediatricdatabasetounravelacommonkcna2variantinneurodevelopmentalsyndromes AT kaminskaanna deepphenotypingunstructureddatamininginanextensivepediatricdatabasetounravelacommonkcna2variantinneurodevelopmentalsyndromes AT barciagiulia deepphenotypingunstructureddatamininginanextensivepediatricdatabasetounravelacommonkcna2variantinneurodevelopmentalsyndromes AT cancesclaude deepphenotypingunstructureddatamininginanextensivepediatricdatabasetounravelacommonkcna2variantinneurodevelopmentalsyndromes AT mignotcyril deepphenotypingunstructureddatamininginanextensivepediatricdatabasetounravelacommonkcna2variantinneurodevelopmentalsyndromes AT desguerreisabelle deepphenotypingunstructureddatamininginanextensivepediatricdatabasetounravelacommonkcna2variantinneurodevelopmentalsyndromes AT garcelonnicolas deepphenotypingunstructureddatamininginanextensivepediatricdatabasetounravelacommonkcna2variantinneurodevelopmentalsyndromes AT kabashiedor deepphenotypingunstructureddatamininginanextensivepediatricdatabasetounravelacommonkcna2variantinneurodevelopmentalsyndromes AT nabboutrima deepphenotypingunstructureddatamininginanextensivepediatricdatabasetounravelacommonkcna2variantinneurodevelopmentalsyndromes |