Cargando…

eXplainable Artificial Intelligence (XAI) for the identification of biologically relevant gene expression patterns in longitudinal human studies, insights from obesity research

Until date, several machine learning approaches have been proposed for the dynamic modeling of temporal omics data. Although they have yielded impressive results in terms of model accuracy and predictive ability, most of these applications are based on “Black-box” algorithms and more interpretable m...

Descripción completa

Detalles Bibliográficos
Autores principales: Anguita-Ruiz, Augusto, Segura-Delgado, Alberto, Alcalá, Rafael, Aguilera, Concepción M., Alcalá-Fdez, Jesús
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7176286/
https://www.ncbi.nlm.nih.gov/pubmed/32275707
http://dx.doi.org/10.1371/journal.pcbi.1007792
_version_ 1783524991906611200
author Anguita-Ruiz, Augusto
Segura-Delgado, Alberto
Alcalá, Rafael
Aguilera, Concepción M.
Alcalá-Fdez, Jesús
author_facet Anguita-Ruiz, Augusto
Segura-Delgado, Alberto
Alcalá, Rafael
Aguilera, Concepción M.
Alcalá-Fdez, Jesús
author_sort Anguita-Ruiz, Augusto
collection PubMed
description Until date, several machine learning approaches have been proposed for the dynamic modeling of temporal omics data. Although they have yielded impressive results in terms of model accuracy and predictive ability, most of these applications are based on “Black-box” algorithms and more interpretable models have been claimed by the research community. The recent eXplainable Artificial Intelligence (XAI) revolution offers a solution for this issue, were rule-based approaches are highly suitable for explanatory purposes. The further integration of the data mining process along with functional-annotation and pathway analyses is an additional way towards more explanatory and biologically soundness models. In this paper, we present a novel rule-based XAI strategy (including pre-processing, knowledge-extraction and functional validation) for finding biologically relevant sequential patterns from longitudinal human gene expression data (GED). To illustrate the performance of our pipeline, we work on in vivo temporal GED collected within the course of a long-term dietary intervention in 57 subjects with obesity (GSE77962). As validation populations, we employ three independent datasets following the same experimental design. As a result, we validate primarily extracted gene patterns and prove the goodness of our strategy for the mining of biologically relevant gene-gene temporal relations. Our whole pipeline has been gathered under open-source software and could be easily extended to other human temporal GED applications.
format Online
Article
Text
id pubmed-7176286
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-71762862020-04-29 eXplainable Artificial Intelligence (XAI) for the identification of biologically relevant gene expression patterns in longitudinal human studies, insights from obesity research Anguita-Ruiz, Augusto Segura-Delgado, Alberto Alcalá, Rafael Aguilera, Concepción M. Alcalá-Fdez, Jesús PLoS Comput Biol Research Article Until date, several machine learning approaches have been proposed for the dynamic modeling of temporal omics data. Although they have yielded impressive results in terms of model accuracy and predictive ability, most of these applications are based on “Black-box” algorithms and more interpretable models have been claimed by the research community. The recent eXplainable Artificial Intelligence (XAI) revolution offers a solution for this issue, were rule-based approaches are highly suitable for explanatory purposes. The further integration of the data mining process along with functional-annotation and pathway analyses is an additional way towards more explanatory and biologically soundness models. In this paper, we present a novel rule-based XAI strategy (including pre-processing, knowledge-extraction and functional validation) for finding biologically relevant sequential patterns from longitudinal human gene expression data (GED). To illustrate the performance of our pipeline, we work on in vivo temporal GED collected within the course of a long-term dietary intervention in 57 subjects with obesity (GSE77962). As validation populations, we employ three independent datasets following the same experimental design. As a result, we validate primarily extracted gene patterns and prove the goodness of our strategy for the mining of biologically relevant gene-gene temporal relations. Our whole pipeline has been gathered under open-source software and could be easily extended to other human temporal GED applications. Public Library of Science 2020-04-10 /pmc/articles/PMC7176286/ /pubmed/32275707 http://dx.doi.org/10.1371/journal.pcbi.1007792 Text en © 2020 Anguita-Ruiz et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Anguita-Ruiz, Augusto
Segura-Delgado, Alberto
Alcalá, Rafael
Aguilera, Concepción M.
Alcalá-Fdez, Jesús
eXplainable Artificial Intelligence (XAI) for the identification of biologically relevant gene expression patterns in longitudinal human studies, insights from obesity research
title eXplainable Artificial Intelligence (XAI) for the identification of biologically relevant gene expression patterns in longitudinal human studies, insights from obesity research
title_full eXplainable Artificial Intelligence (XAI) for the identification of biologically relevant gene expression patterns in longitudinal human studies, insights from obesity research
title_fullStr eXplainable Artificial Intelligence (XAI) for the identification of biologically relevant gene expression patterns in longitudinal human studies, insights from obesity research
title_full_unstemmed eXplainable Artificial Intelligence (XAI) for the identification of biologically relevant gene expression patterns in longitudinal human studies, insights from obesity research
title_short eXplainable Artificial Intelligence (XAI) for the identification of biologically relevant gene expression patterns in longitudinal human studies, insights from obesity research
title_sort explainable artificial intelligence (xai) for the identification of biologically relevant gene expression patterns in longitudinal human studies, insights from obesity research
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7176286/
https://www.ncbi.nlm.nih.gov/pubmed/32275707
http://dx.doi.org/10.1371/journal.pcbi.1007792
work_keys_str_mv AT anguitaruizaugusto explainableartificialintelligencexaifortheidentificationofbiologicallyrelevantgeneexpressionpatternsinlongitudinalhumanstudiesinsightsfromobesityresearch
AT seguradelgadoalberto explainableartificialintelligencexaifortheidentificationofbiologicallyrelevantgeneexpressionpatternsinlongitudinalhumanstudiesinsightsfromobesityresearch
AT alcalarafael explainableartificialintelligencexaifortheidentificationofbiologicallyrelevantgeneexpressionpatternsinlongitudinalhumanstudiesinsightsfromobesityresearch
AT aguileraconcepcionm explainableartificialintelligencexaifortheidentificationofbiologicallyrelevantgeneexpressionpatternsinlongitudinalhumanstudiesinsightsfromobesityresearch
AT alcalafdezjesus explainableartificialintelligencexaifortheidentificationofbiologicallyrelevantgeneexpressionpatternsinlongitudinalhumanstudiesinsightsfromobesityresearch