Cargando…

541. In Silico Modeling by Causal Inference for Identifying Biomarkers of Sepsis

BACKGROUND: “Every 3 seconds, someone in the world dies of sepsis” (https://sepsistrust.org/about/). We used causal inference theory as a in silico method to identify biomarkers of sepsis. Causal Inference is a theory in Machine Learning that seeks for the root causes of an event. METHODS: In the st...

Descripción completa

Detalles Bibliográficos
Autores principales: Carvalho, Walisson, Silvério-Machado, Rita, Couto, Bráulio R G M, Zarate, Luis Enrique
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9752052/
http://dx.doi.org/10.1093/ofid/ofac492.594
_version_ 1784850624295206912
author Carvalho, Walisson
Silvério-Machado, Rita
Couto, Bráulio R G M
Zarate, Luis Enrique
author_facet Carvalho, Walisson
Silvério-Machado, Rita
Couto, Bráulio R G M
Zarate, Luis Enrique
author_sort Carvalho, Walisson
collection PubMed
description BACKGROUND: “Every 3 seconds, someone in the world dies of sepsis” (https://sepsistrust.org/about/). We used causal inference theory as a in silico method to identify biomarkers of sepsis. Causal Inference is a theory in Machine Learning that seeks for the root causes of an event. METHODS: In the study it was used transcription profile data downloaded from http://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE12624. This dataset has 70 samples, being 34 sepsis and 36 non-sepsis samples. Data set contains 8,519 attributes: 7,672 genes obtained after preprocessing of the mRNA expression profile data. The method applied in the dataset was a modification in the HEISA, a local learner of two stages algorithm (Figure 1). In the first stage HEISA identifies variables that compounds the set of parents, children, parents of parents and children of children of a target. During the second stage is calculated the causal effect using do-calculus method, of the selected variables of the first stage in the target. At end of the second stage, features with causal effect greater than 0.2 is selected. After selecting the features mRNA expression, it was applied two algorithms of classification, Random Forest and K-means, in order to evaluate the ability of the selected variables of identifying the occurrence of sepsis. [Figure: see text] RESULTS: As shown in Table I, after applying the algorithm, it was select six mRNA expressions that better explains whether sepsis occurs, or not. Analyzing these genes it was possible to observe that three of them are known biomarkers, they are related to sepsis: NM_017526, NM_004649 and NM_006099. It is important to stress that the second one, Homo sapiens leptin receptor overlapping transcript (LEPROT) transcript variant 1 mRNA, is known to be inversely proportional to sepsis (its causal effect is negative). The other three biomarkers, NM_017526, NM_001274, and NM_001071 also are inversely proportional to sepsis. Their relationship with sepsis is still unknown. Regarding the classification task, using only those six mRNA expressions, the accuracy of the task of classification was 100%. [Figure: see text] CONCLUSION: In a big set of 7,672 genes, only six were returned as sepsis biomarkers candidates. This is a very promising in silico discovery, made by a novel mathematical method, the Causal Inference theory. DISCLOSURES: All Authors: No reported disclosures.
format Online
Article
Text
id pubmed-9752052
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-97520522022-12-16 541. In Silico Modeling by Causal Inference for Identifying Biomarkers of Sepsis Carvalho, Walisson Silvério-Machado, Rita Couto, Bráulio R G M Zarate, Luis Enrique Open Forum Infect Dis Abstracts BACKGROUND: “Every 3 seconds, someone in the world dies of sepsis” (https://sepsistrust.org/about/). We used causal inference theory as a in silico method to identify biomarkers of sepsis. Causal Inference is a theory in Machine Learning that seeks for the root causes of an event. METHODS: In the study it was used transcription profile data downloaded from http://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE12624. This dataset has 70 samples, being 34 sepsis and 36 non-sepsis samples. Data set contains 8,519 attributes: 7,672 genes obtained after preprocessing of the mRNA expression profile data. The method applied in the dataset was a modification in the HEISA, a local learner of two stages algorithm (Figure 1). In the first stage HEISA identifies variables that compounds the set of parents, children, parents of parents and children of children of a target. During the second stage is calculated the causal effect using do-calculus method, of the selected variables of the first stage in the target. At end of the second stage, features with causal effect greater than 0.2 is selected. After selecting the features mRNA expression, it was applied two algorithms of classification, Random Forest and K-means, in order to evaluate the ability of the selected variables of identifying the occurrence of sepsis. [Figure: see text] RESULTS: As shown in Table I, after applying the algorithm, it was select six mRNA expressions that better explains whether sepsis occurs, or not. Analyzing these genes it was possible to observe that three of them are known biomarkers, they are related to sepsis: NM_017526, NM_004649 and NM_006099. It is important to stress that the second one, Homo sapiens leptin receptor overlapping transcript (LEPROT) transcript variant 1 mRNA, is known to be inversely proportional to sepsis (its causal effect is negative). The other three biomarkers, NM_017526, NM_001274, and NM_001071 also are inversely proportional to sepsis. Their relationship with sepsis is still unknown. Regarding the classification task, using only those six mRNA expressions, the accuracy of the task of classification was 100%. [Figure: see text] CONCLUSION: In a big set of 7,672 genes, only six were returned as sepsis biomarkers candidates. This is a very promising in silico discovery, made by a novel mathematical method, the Causal Inference theory. DISCLOSURES: All Authors: No reported disclosures. Oxford University Press 2022-12-15 /pmc/articles/PMC9752052/ http://dx.doi.org/10.1093/ofid/ofac492.594 Text en © The Author(s) 2022. Published by Oxford University Press on behalf of Infectious Diseases Society of America. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Abstracts
Carvalho, Walisson
Silvério-Machado, Rita
Couto, Bráulio R G M
Zarate, Luis Enrique
541. In Silico Modeling by Causal Inference for Identifying Biomarkers of Sepsis
title 541. In Silico Modeling by Causal Inference for Identifying Biomarkers of Sepsis
title_full 541. In Silico Modeling by Causal Inference for Identifying Biomarkers of Sepsis
title_fullStr 541. In Silico Modeling by Causal Inference for Identifying Biomarkers of Sepsis
title_full_unstemmed 541. In Silico Modeling by Causal Inference for Identifying Biomarkers of Sepsis
title_short 541. In Silico Modeling by Causal Inference for Identifying Biomarkers of Sepsis
title_sort 541. in silico modeling by causal inference for identifying biomarkers of sepsis
topic Abstracts
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9752052/
http://dx.doi.org/10.1093/ofid/ofac492.594
work_keys_str_mv AT carvalhowalisson 541insilicomodelingbycausalinferenceforidentifyingbiomarkersofsepsis
AT silveriomachadorita 541insilicomodelingbycausalinferenceforidentifyingbiomarkersofsepsis
AT coutobrauliorgm 541insilicomodelingbycausalinferenceforidentifyingbiomarkersofsepsis
AT zarateluisenrique 541insilicomodelingbycausalinferenceforidentifyingbiomarkersofsepsis