Cargando…

Topology and redescriptions detect multiple alternative biological pathways from clinical phenotypes

Biological pathways play a crucial role in the properties of diseases and are important in drug discovery. Identifying the logical relationships among distinctive phenotypic clusters could reveal possible connections to the underlying pathways. However, this process is challenging since clinical phe...

Descripción completa

Detalles Bibliográficos
Autores principales: Karisani, Negin, Platt, Daniel E, Basu, Saugata, Parida, Laxmi
Formato: Online Artículo Texto
Lenguaje:English
Publicado: SAGE Publications 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9679317/
https://www.ncbi.nlm.nih.gov/pubmed/36398440
http://dx.doi.org/10.1177/15353702221126671
_version_ 1784834165275885568
author Karisani, Negin
Platt, Daniel E
Basu, Saugata
Parida, Laxmi
author_facet Karisani, Negin
Platt, Daniel E
Basu, Saugata
Parida, Laxmi
author_sort Karisani, Negin
collection PubMed
description Biological pathways play a crucial role in the properties of diseases and are important in drug discovery. Identifying the logical relationships among distinctive phenotypic clusters could reveal possible connections to the underlying pathways. However, this process is challenging since clinical phenotypes are often available through unstructured electronic health records. Moreover, in the absence of a standardized questionnaire, there could be bias among physicians toward selecting certain medical terms. In this article, we develop an efficient pipeline to address these challenges and help practitioners to reveal the pathways associated with the disease. We use topological data analysis and redescriptions and propose a pipeline of four phases: (1) pre-processing the clinical notes to extract the salient concepts, (2) constructing a feature space of the patients to characterize the extracted concepts, (3) leveraging the topological properties to distill the available knowledge and visualize the extracted features, and finally, (4) investigating the bias in the clinical notes of the selected features and identify possible pathways. Our experiments on a publicly available dataset of COVID-19 clinical notes testify that our pipeline can indeed extract meaningful pathways.
format Online
Article
Text
id pubmed-9679317
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher SAGE Publications
record_format MEDLINE/PubMed
spelling pubmed-96793172022-11-22 Topology and redescriptions detect multiple alternative biological pathways from clinical phenotypes Karisani, Negin Platt, Daniel E Basu, Saugata Parida, Laxmi Exp Biol Med (Maywood) Original Research Biological pathways play a crucial role in the properties of diseases and are important in drug discovery. Identifying the logical relationships among distinctive phenotypic clusters could reveal possible connections to the underlying pathways. However, this process is challenging since clinical phenotypes are often available through unstructured electronic health records. Moreover, in the absence of a standardized questionnaire, there could be bias among physicians toward selecting certain medical terms. In this article, we develop an efficient pipeline to address these challenges and help practitioners to reveal the pathways associated with the disease. We use topological data analysis and redescriptions and propose a pipeline of four phases: (1) pre-processing the clinical notes to extract the salient concepts, (2) constructing a feature space of the patients to characterize the extracted concepts, (3) leveraging the topological properties to distill the available knowledge and visualize the extracted features, and finally, (4) investigating the bias in the clinical notes of the selected features and identify possible pathways. Our experiments on a publicly available dataset of COVID-19 clinical notes testify that our pipeline can indeed extract meaningful pathways. SAGE Publications 2022-11-18 2022-11 /pmc/articles/PMC9679317/ /pubmed/36398440 http://dx.doi.org/10.1177/15353702221126671 Text en © 2022 by the Society for Experimental Biology and Medicine
spellingShingle Original Research
Karisani, Negin
Platt, Daniel E
Basu, Saugata
Parida, Laxmi
Topology and redescriptions detect multiple alternative biological pathways from clinical phenotypes
title Topology and redescriptions detect multiple alternative biological pathways from clinical phenotypes
title_full Topology and redescriptions detect multiple alternative biological pathways from clinical phenotypes
title_fullStr Topology and redescriptions detect multiple alternative biological pathways from clinical phenotypes
title_full_unstemmed Topology and redescriptions detect multiple alternative biological pathways from clinical phenotypes
title_short Topology and redescriptions detect multiple alternative biological pathways from clinical phenotypes
title_sort topology and redescriptions detect multiple alternative biological pathways from clinical phenotypes
topic Original Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9679317/
https://www.ncbi.nlm.nih.gov/pubmed/36398440
http://dx.doi.org/10.1177/15353702221126671
work_keys_str_mv AT karisaninegin topologyandredescriptionsdetectmultiplealternativebiologicalpathwaysfromclinicalphenotypes
AT plattdaniele topologyandredescriptionsdetectmultiplealternativebiologicalpathwaysfromclinicalphenotypes
AT basusaugata topologyandredescriptionsdetectmultiplealternativebiologicalpathwaysfromclinicalphenotypes
AT paridalaxmi topologyandredescriptionsdetectmultiplealternativebiologicalpathwaysfromclinicalphenotypes