Cargando…
eXplainable Artificial Intelligence (XAI) for the identification of biologically relevant gene expression patterns in longitudinal human studies, insights from obesity research
Until date, several machine learning approaches have been proposed for the dynamic modeling of temporal omics data. Although they have yielded impressive results in terms of model accuracy and predictive ability, most of these applications are based on “Black-box” algorithms and more interpretable m...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7176286/ https://www.ncbi.nlm.nih.gov/pubmed/32275707 http://dx.doi.org/10.1371/journal.pcbi.1007792 |
_version_ | 1783524991906611200 |
---|---|
author | Anguita-Ruiz, Augusto Segura-Delgado, Alberto Alcalá, Rafael Aguilera, Concepción M. Alcalá-Fdez, Jesús |
author_facet | Anguita-Ruiz, Augusto Segura-Delgado, Alberto Alcalá, Rafael Aguilera, Concepción M. Alcalá-Fdez, Jesús |
author_sort | Anguita-Ruiz, Augusto |
collection | PubMed |
description | Until date, several machine learning approaches have been proposed for the dynamic modeling of temporal omics data. Although they have yielded impressive results in terms of model accuracy and predictive ability, most of these applications are based on “Black-box” algorithms and more interpretable models have been claimed by the research community. The recent eXplainable Artificial Intelligence (XAI) revolution offers a solution for this issue, were rule-based approaches are highly suitable for explanatory purposes. The further integration of the data mining process along with functional-annotation and pathway analyses is an additional way towards more explanatory and biologically soundness models. In this paper, we present a novel rule-based XAI strategy (including pre-processing, knowledge-extraction and functional validation) for finding biologically relevant sequential patterns from longitudinal human gene expression data (GED). To illustrate the performance of our pipeline, we work on in vivo temporal GED collected within the course of a long-term dietary intervention in 57 subjects with obesity (GSE77962). As validation populations, we employ three independent datasets following the same experimental design. As a result, we validate primarily extracted gene patterns and prove the goodness of our strategy for the mining of biologically relevant gene-gene temporal relations. Our whole pipeline has been gathered under open-source software and could be easily extended to other human temporal GED applications. |
format | Online Article Text |
id | pubmed-7176286 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-71762862020-04-29 eXplainable Artificial Intelligence (XAI) for the identification of biologically relevant gene expression patterns in longitudinal human studies, insights from obesity research Anguita-Ruiz, Augusto Segura-Delgado, Alberto Alcalá, Rafael Aguilera, Concepción M. Alcalá-Fdez, Jesús PLoS Comput Biol Research Article Until date, several machine learning approaches have been proposed for the dynamic modeling of temporal omics data. Although they have yielded impressive results in terms of model accuracy and predictive ability, most of these applications are based on “Black-box” algorithms and more interpretable models have been claimed by the research community. The recent eXplainable Artificial Intelligence (XAI) revolution offers a solution for this issue, were rule-based approaches are highly suitable for explanatory purposes. The further integration of the data mining process along with functional-annotation and pathway analyses is an additional way towards more explanatory and biologically soundness models. In this paper, we present a novel rule-based XAI strategy (including pre-processing, knowledge-extraction and functional validation) for finding biologically relevant sequential patterns from longitudinal human gene expression data (GED). To illustrate the performance of our pipeline, we work on in vivo temporal GED collected within the course of a long-term dietary intervention in 57 subjects with obesity (GSE77962). As validation populations, we employ three independent datasets following the same experimental design. As a result, we validate primarily extracted gene patterns and prove the goodness of our strategy for the mining of biologically relevant gene-gene temporal relations. Our whole pipeline has been gathered under open-source software and could be easily extended to other human temporal GED applications. Public Library of Science 2020-04-10 /pmc/articles/PMC7176286/ /pubmed/32275707 http://dx.doi.org/10.1371/journal.pcbi.1007792 Text en © 2020 Anguita-Ruiz et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Anguita-Ruiz, Augusto Segura-Delgado, Alberto Alcalá, Rafael Aguilera, Concepción M. Alcalá-Fdez, Jesús eXplainable Artificial Intelligence (XAI) for the identification of biologically relevant gene expression patterns in longitudinal human studies, insights from obesity research |
title | eXplainable Artificial Intelligence (XAI) for the identification of biologically relevant gene expression patterns in longitudinal human studies, insights from obesity research |
title_full | eXplainable Artificial Intelligence (XAI) for the identification of biologically relevant gene expression patterns in longitudinal human studies, insights from obesity research |
title_fullStr | eXplainable Artificial Intelligence (XAI) for the identification of biologically relevant gene expression patterns in longitudinal human studies, insights from obesity research |
title_full_unstemmed | eXplainable Artificial Intelligence (XAI) for the identification of biologically relevant gene expression patterns in longitudinal human studies, insights from obesity research |
title_short | eXplainable Artificial Intelligence (XAI) for the identification of biologically relevant gene expression patterns in longitudinal human studies, insights from obesity research |
title_sort | explainable artificial intelligence (xai) for the identification of biologically relevant gene expression patterns in longitudinal human studies, insights from obesity research |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7176286/ https://www.ncbi.nlm.nih.gov/pubmed/32275707 http://dx.doi.org/10.1371/journal.pcbi.1007792 |
work_keys_str_mv | AT anguitaruizaugusto explainableartificialintelligencexaifortheidentificationofbiologicallyrelevantgeneexpressionpatternsinlongitudinalhumanstudiesinsightsfromobesityresearch AT seguradelgadoalberto explainableartificialintelligencexaifortheidentificationofbiologicallyrelevantgeneexpressionpatternsinlongitudinalhumanstudiesinsightsfromobesityresearch AT alcalarafael explainableartificialintelligencexaifortheidentificationofbiologicallyrelevantgeneexpressionpatternsinlongitudinalhumanstudiesinsightsfromobesityresearch AT aguileraconcepcionm explainableartificialintelligencexaifortheidentificationofbiologicallyrelevantgeneexpressionpatternsinlongitudinalhumanstudiesinsightsfromobesityresearch AT alcalafdezjesus explainableartificialintelligencexaifortheidentificationofbiologicallyrelevantgeneexpressionpatternsinlongitudinalhumanstudiesinsightsfromobesityresearch |