Cargando…

Using knowledge graphs to infer gene expression in plants

INTRODUCTION: Climate change is already affecting ecosystems around the world and forcing us to adapt to meet societal needs. The speed with which climate change is progressing necessitates a massive scaling up of the number of species with understood genotype-environment-phenotype (G×E×P) dynamics...

Descripción completa

Detalles Bibliográficos
Autores principales: Thessen, Anne E., Cooper, Laurel, Swetnam, Tyson L., Hegde, Harshad, Reese, Justin, Elser, Justin, Jaiswal, Pankaj
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10298150/
https://www.ncbi.nlm.nih.gov/pubmed/37384147
http://dx.doi.org/10.3389/frai.2023.1201002
_version_ 1785064044730777600
author Thessen, Anne E.
Cooper, Laurel
Swetnam, Tyson L.
Hegde, Harshad
Reese, Justin
Elser, Justin
Jaiswal, Pankaj
author_facet Thessen, Anne E.
Cooper, Laurel
Swetnam, Tyson L.
Hegde, Harshad
Reese, Justin
Elser, Justin
Jaiswal, Pankaj
author_sort Thessen, Anne E.
collection PubMed
description INTRODUCTION: Climate change is already affecting ecosystems around the world and forcing us to adapt to meet societal needs. The speed with which climate change is progressing necessitates a massive scaling up of the number of species with understood genotype-environment-phenotype (G×E×P) dynamics in order to increase ecosystem and agriculture resilience. An important part of predicting phenotype is understanding the complex gene regulatory networks present in organisms. Previous work has demonstrated that knowledge about one species can be applied to another using ontologically-supported knowledge bases that exploit homologous structures and homologous genes. These types of structures that can apply knowledge about one species to another have the potential to enable the massive scaling up that is needed through in silico experimentation. METHODS: We developed one such structure, a knowledge graph (KG) using information from Planteome and the EMBL-EBI Expression Atlas that connects gene expression, molecular interactions, functions, and pathways to homology-based gene annotations. Our preliminary analysis uses data from gene expression studies in Arabidopsis thaliana and Populus trichocarpa plants exposed to drought conditions. RESULTS: A graph query identified 16 pairs of homologous genes in these two taxa, some of which show opposite patterns of gene expression in response to drought. As expected, analysis of the upstream cis-regulatory region of these genes revealed that homologs with similar expression behavior had conserved cis-regulatory regions and potential interaction with similar trans-elements, unlike homologs that changed their expression in opposite ways. DISCUSSION: This suggests that even though the homologous pairs share common ancestry and functional roles, predicting expression and phenotype through homology inference needs careful consideration of integrating cis and trans-regulatory components in the curated and inferred knowledge graph.
format Online
Article
Text
id pubmed-10298150
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-102981502023-06-28 Using knowledge graphs to infer gene expression in plants Thessen, Anne E. Cooper, Laurel Swetnam, Tyson L. Hegde, Harshad Reese, Justin Elser, Justin Jaiswal, Pankaj Front Artif Intell Artificial Intelligence INTRODUCTION: Climate change is already affecting ecosystems around the world and forcing us to adapt to meet societal needs. The speed with which climate change is progressing necessitates a massive scaling up of the number of species with understood genotype-environment-phenotype (G×E×P) dynamics in order to increase ecosystem and agriculture resilience. An important part of predicting phenotype is understanding the complex gene regulatory networks present in organisms. Previous work has demonstrated that knowledge about one species can be applied to another using ontologically-supported knowledge bases that exploit homologous structures and homologous genes. These types of structures that can apply knowledge about one species to another have the potential to enable the massive scaling up that is needed through in silico experimentation. METHODS: We developed one such structure, a knowledge graph (KG) using information from Planteome and the EMBL-EBI Expression Atlas that connects gene expression, molecular interactions, functions, and pathways to homology-based gene annotations. Our preliminary analysis uses data from gene expression studies in Arabidopsis thaliana and Populus trichocarpa plants exposed to drought conditions. RESULTS: A graph query identified 16 pairs of homologous genes in these two taxa, some of which show opposite patterns of gene expression in response to drought. As expected, analysis of the upstream cis-regulatory region of these genes revealed that homologs with similar expression behavior had conserved cis-regulatory regions and potential interaction with similar trans-elements, unlike homologs that changed their expression in opposite ways. DISCUSSION: This suggests that even though the homologous pairs share common ancestry and functional roles, predicting expression and phenotype through homology inference needs careful consideration of integrating cis and trans-regulatory components in the curated and inferred knowledge graph. Frontiers Media S.A. 2023-06-13 /pmc/articles/PMC10298150/ /pubmed/37384147 http://dx.doi.org/10.3389/frai.2023.1201002 Text en Copyright © 2023 Thessen, Cooper, Swetnam, Hegde, Reese, Elser and Jaiswal. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Artificial Intelligence
Thessen, Anne E.
Cooper, Laurel
Swetnam, Tyson L.
Hegde, Harshad
Reese, Justin
Elser, Justin
Jaiswal, Pankaj
Using knowledge graphs to infer gene expression in plants
title Using knowledge graphs to infer gene expression in plants
title_full Using knowledge graphs to infer gene expression in plants
title_fullStr Using knowledge graphs to infer gene expression in plants
title_full_unstemmed Using knowledge graphs to infer gene expression in plants
title_short Using knowledge graphs to infer gene expression in plants
title_sort using knowledge graphs to infer gene expression in plants
topic Artificial Intelligence
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10298150/
https://www.ncbi.nlm.nih.gov/pubmed/37384147
http://dx.doi.org/10.3389/frai.2023.1201002
work_keys_str_mv AT thessenannee usingknowledgegraphstoinfergeneexpressioninplants
AT cooperlaurel usingknowledgegraphstoinfergeneexpressioninplants
AT swetnamtysonl usingknowledgegraphstoinfergeneexpressioninplants
AT hegdeharshad usingknowledgegraphstoinfergeneexpressioninplants
AT reesejustin usingknowledgegraphstoinfergeneexpressioninplants
AT elserjustin usingknowledgegraphstoinfergeneexpressioninplants
AT jaiswalpankaj usingknowledgegraphstoinfergeneexpressioninplants