Cargando…

Deep phenotyping unstructured data mining in an extensive pediatric database to unravel a common KCNA2 variant in neurodevelopmental syndromes

PURPOSE: Electronic health records are gaining popularity to detect and propose interdisciplinary treatments for patients with similar medical histories, diagnoses, and outcomes. These files are compiled by different nonexperts and expert clinicians. Data mining in these unstructured data is a trans...

Descripción completa

Detalles Bibliográficos
Autores principales: Hully, Marie, Lo Barco, Tommaso, Kaminska, Anna, Barcia, Giulia, Cances, Claude, Mignot, Cyril, Desguerre, Isabelle, Garcelon, Nicolas, Kabashi, Edor, Nabbout, Rima
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group US 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8105164/
https://www.ncbi.nlm.nih.gov/pubmed/33500571
http://dx.doi.org/10.1038/s41436-020-01039-z
_version_ 1783689558061219840
author Hully, Marie
Lo Barco, Tommaso
Kaminska, Anna
Barcia, Giulia
Cances, Claude
Mignot, Cyril
Desguerre, Isabelle
Garcelon, Nicolas
Kabashi, Edor
Nabbout, Rima
author_facet Hully, Marie
Lo Barco, Tommaso
Kaminska, Anna
Barcia, Giulia
Cances, Claude
Mignot, Cyril
Desguerre, Isabelle
Garcelon, Nicolas
Kabashi, Edor
Nabbout, Rima
author_sort Hully, Marie
collection PubMed
description PURPOSE: Electronic health records are gaining popularity to detect and propose interdisciplinary treatments for patients with similar medical histories, diagnoses, and outcomes. These files are compiled by different nonexperts and expert clinicians. Data mining in these unstructured data is a transposable and sustainable methodology to search for patients presenting a high similitude of clinical features. METHODS: Exome and targeted next-generation sequencing bioinformatics analyses were performed at the Imagine Institute. Similarity Index (SI), an algorithm based on a vector space model (VSM) that exploits concepts extracted from clinical narrative reports was used to identify patients with highly similar clinical features. RESULTS: Here we describe a case of “automated diagnosis” indicated by Dr. Warehouse, a biomedical data warehouse oriented toward clinical narrative reports, developed at Necker Children’s Hospital using around 500,000 patients’ records. Through the use of this warehouse, we were able to match and identify two patients sharing very specific clinical neonatal and childhood features harboring the same de novo variant in KCNA2. CONCLUSION: This innovative application of database clustering clinical features could advance identification of patients with rare and common genetic conditions and detect with high accuracy the natural history of patients harboring similar genetic pathogenic variants.
format Online
Article
Text
id pubmed-8105164
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Nature Publishing Group US
record_format MEDLINE/PubMed
spelling pubmed-81051642021-05-24 Deep phenotyping unstructured data mining in an extensive pediatric database to unravel a common KCNA2 variant in neurodevelopmental syndromes Hully, Marie Lo Barco, Tommaso Kaminska, Anna Barcia, Giulia Cances, Claude Mignot, Cyril Desguerre, Isabelle Garcelon, Nicolas Kabashi, Edor Nabbout, Rima Genet Med Brief Communication PURPOSE: Electronic health records are gaining popularity to detect and propose interdisciplinary treatments for patients with similar medical histories, diagnoses, and outcomes. These files are compiled by different nonexperts and expert clinicians. Data mining in these unstructured data is a transposable and sustainable methodology to search for patients presenting a high similitude of clinical features. METHODS: Exome and targeted next-generation sequencing bioinformatics analyses were performed at the Imagine Institute. Similarity Index (SI), an algorithm based on a vector space model (VSM) that exploits concepts extracted from clinical narrative reports was used to identify patients with highly similar clinical features. RESULTS: Here we describe a case of “automated diagnosis” indicated by Dr. Warehouse, a biomedical data warehouse oriented toward clinical narrative reports, developed at Necker Children’s Hospital using around 500,000 patients’ records. Through the use of this warehouse, we were able to match and identify two patients sharing very specific clinical neonatal and childhood features harboring the same de novo variant in KCNA2. CONCLUSION: This innovative application of database clustering clinical features could advance identification of patients with rare and common genetic conditions and detect with high accuracy the natural history of patients harboring similar genetic pathogenic variants. Nature Publishing Group US 2021-01-26 2021 /pmc/articles/PMC8105164/ /pubmed/33500571 http://dx.doi.org/10.1038/s41436-020-01039-z Text en © The Author(s) 2021 https://creativecommons.org/licenses/by-nc-nd/4.0/Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, and provide a link to the Creative Commons license. You do not have permission under this license to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by-nc-nd/4.0/ (https://creativecommons.org/licenses/by-nc-nd/4.0/) .
spellingShingle Brief Communication
Hully, Marie
Lo Barco, Tommaso
Kaminska, Anna
Barcia, Giulia
Cances, Claude
Mignot, Cyril
Desguerre, Isabelle
Garcelon, Nicolas
Kabashi, Edor
Nabbout, Rima
Deep phenotyping unstructured data mining in an extensive pediatric database to unravel a common KCNA2 variant in neurodevelopmental syndromes
title Deep phenotyping unstructured data mining in an extensive pediatric database to unravel a common KCNA2 variant in neurodevelopmental syndromes
title_full Deep phenotyping unstructured data mining in an extensive pediatric database to unravel a common KCNA2 variant in neurodevelopmental syndromes
title_fullStr Deep phenotyping unstructured data mining in an extensive pediatric database to unravel a common KCNA2 variant in neurodevelopmental syndromes
title_full_unstemmed Deep phenotyping unstructured data mining in an extensive pediatric database to unravel a common KCNA2 variant in neurodevelopmental syndromes
title_short Deep phenotyping unstructured data mining in an extensive pediatric database to unravel a common KCNA2 variant in neurodevelopmental syndromes
title_sort deep phenotyping unstructured data mining in an extensive pediatric database to unravel a common kcna2 variant in neurodevelopmental syndromes
topic Brief Communication
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8105164/
https://www.ncbi.nlm.nih.gov/pubmed/33500571
http://dx.doi.org/10.1038/s41436-020-01039-z
work_keys_str_mv AT hullymarie deepphenotypingunstructureddatamininginanextensivepediatricdatabasetounravelacommonkcna2variantinneurodevelopmentalsyndromes
AT lobarcotommaso deepphenotypingunstructureddatamininginanextensivepediatricdatabasetounravelacommonkcna2variantinneurodevelopmentalsyndromes
AT kaminskaanna deepphenotypingunstructureddatamininginanextensivepediatricdatabasetounravelacommonkcna2variantinneurodevelopmentalsyndromes
AT barciagiulia deepphenotypingunstructureddatamininginanextensivepediatricdatabasetounravelacommonkcna2variantinneurodevelopmentalsyndromes
AT cancesclaude deepphenotypingunstructureddatamininginanextensivepediatricdatabasetounravelacommonkcna2variantinneurodevelopmentalsyndromes
AT mignotcyril deepphenotypingunstructureddatamininginanextensivepediatricdatabasetounravelacommonkcna2variantinneurodevelopmentalsyndromes
AT desguerreisabelle deepphenotypingunstructureddatamininginanextensivepediatricdatabasetounravelacommonkcna2variantinneurodevelopmentalsyndromes
AT garcelonnicolas deepphenotypingunstructureddatamininginanextensivepediatricdatabasetounravelacommonkcna2variantinneurodevelopmentalsyndromes
AT kabashiedor deepphenotypingunstructureddatamininginanextensivepediatricdatabasetounravelacommonkcna2variantinneurodevelopmentalsyndromes
AT nabboutrima deepphenotypingunstructureddatamininginanextensivepediatricdatabasetounravelacommonkcna2variantinneurodevelopmentalsyndromes