Cargando…
Topology and redescriptions detect multiple alternative biological pathways from clinical phenotypes
Biological pathways play a crucial role in the properties of diseases and are important in drug discovery. Identifying the logical relationships among distinctive phenotypic clusters could reveal possible connections to the underlying pathways. However, this process is challenging since clinical phe...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
SAGE Publications
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9679317/ https://www.ncbi.nlm.nih.gov/pubmed/36398440 http://dx.doi.org/10.1177/15353702221126671 |
_version_ | 1784834165275885568 |
---|---|
author | Karisani, Negin Platt, Daniel E Basu, Saugata Parida, Laxmi |
author_facet | Karisani, Negin Platt, Daniel E Basu, Saugata Parida, Laxmi |
author_sort | Karisani, Negin |
collection | PubMed |
description | Biological pathways play a crucial role in the properties of diseases and are important in drug discovery. Identifying the logical relationships among distinctive phenotypic clusters could reveal possible connections to the underlying pathways. However, this process is challenging since clinical phenotypes are often available through unstructured electronic health records. Moreover, in the absence of a standardized questionnaire, there could be bias among physicians toward selecting certain medical terms. In this article, we develop an efficient pipeline to address these challenges and help practitioners to reveal the pathways associated with the disease. We use topological data analysis and redescriptions and propose a pipeline of four phases: (1) pre-processing the clinical notes to extract the salient concepts, (2) constructing a feature space of the patients to characterize the extracted concepts, (3) leveraging the topological properties to distill the available knowledge and visualize the extracted features, and finally, (4) investigating the bias in the clinical notes of the selected features and identify possible pathways. Our experiments on a publicly available dataset of COVID-19 clinical notes testify that our pipeline can indeed extract meaningful pathways. |
format | Online Article Text |
id | pubmed-9679317 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | SAGE Publications |
record_format | MEDLINE/PubMed |
spelling | pubmed-96793172022-11-22 Topology and redescriptions detect multiple alternative biological pathways from clinical phenotypes Karisani, Negin Platt, Daniel E Basu, Saugata Parida, Laxmi Exp Biol Med (Maywood) Original Research Biological pathways play a crucial role in the properties of diseases and are important in drug discovery. Identifying the logical relationships among distinctive phenotypic clusters could reveal possible connections to the underlying pathways. However, this process is challenging since clinical phenotypes are often available through unstructured electronic health records. Moreover, in the absence of a standardized questionnaire, there could be bias among physicians toward selecting certain medical terms. In this article, we develop an efficient pipeline to address these challenges and help practitioners to reveal the pathways associated with the disease. We use topological data analysis and redescriptions and propose a pipeline of four phases: (1) pre-processing the clinical notes to extract the salient concepts, (2) constructing a feature space of the patients to characterize the extracted concepts, (3) leveraging the topological properties to distill the available knowledge and visualize the extracted features, and finally, (4) investigating the bias in the clinical notes of the selected features and identify possible pathways. Our experiments on a publicly available dataset of COVID-19 clinical notes testify that our pipeline can indeed extract meaningful pathways. SAGE Publications 2022-11-18 2022-11 /pmc/articles/PMC9679317/ /pubmed/36398440 http://dx.doi.org/10.1177/15353702221126671 Text en © 2022 by the Society for Experimental Biology and Medicine |
spellingShingle | Original Research Karisani, Negin Platt, Daniel E Basu, Saugata Parida, Laxmi Topology and redescriptions detect multiple alternative biological pathways from clinical phenotypes |
title | Topology and redescriptions detect multiple alternative biological pathways from clinical phenotypes |
title_full | Topology and redescriptions detect multiple alternative biological pathways from clinical phenotypes |
title_fullStr | Topology and redescriptions detect multiple alternative biological pathways from clinical phenotypes |
title_full_unstemmed | Topology and redescriptions detect multiple alternative biological pathways from clinical phenotypes |
title_short | Topology and redescriptions detect multiple alternative biological pathways from clinical phenotypes |
title_sort | topology and redescriptions detect multiple alternative biological pathways from clinical phenotypes |
topic | Original Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9679317/ https://www.ncbi.nlm.nih.gov/pubmed/36398440 http://dx.doi.org/10.1177/15353702221126671 |
work_keys_str_mv | AT karisaninegin topologyandredescriptionsdetectmultiplealternativebiologicalpathwaysfromclinicalphenotypes AT plattdaniele topologyandredescriptionsdetectmultiplealternativebiologicalpathwaysfromclinicalphenotypes AT basusaugata topologyandredescriptionsdetectmultiplealternativebiologicalpathwaysfromclinicalphenotypes AT paridalaxmi topologyandredescriptionsdetectmultiplealternativebiologicalpathwaysfromclinicalphenotypes |