Cargando…
Using knowledge graphs to infer gene expression in plants
INTRODUCTION: Climate change is already affecting ecosystems around the world and forcing us to adapt to meet societal needs. The speed with which climate change is progressing necessitates a massive scaling up of the number of species with understood genotype-environment-phenotype (G×E×P) dynamics...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10298150/ https://www.ncbi.nlm.nih.gov/pubmed/37384147 http://dx.doi.org/10.3389/frai.2023.1201002 |
_version_ | 1785064044730777600 |
---|---|
author | Thessen, Anne E. Cooper, Laurel Swetnam, Tyson L. Hegde, Harshad Reese, Justin Elser, Justin Jaiswal, Pankaj |
author_facet | Thessen, Anne E. Cooper, Laurel Swetnam, Tyson L. Hegde, Harshad Reese, Justin Elser, Justin Jaiswal, Pankaj |
author_sort | Thessen, Anne E. |
collection | PubMed |
description | INTRODUCTION: Climate change is already affecting ecosystems around the world and forcing us to adapt to meet societal needs. The speed with which climate change is progressing necessitates a massive scaling up of the number of species with understood genotype-environment-phenotype (G×E×P) dynamics in order to increase ecosystem and agriculture resilience. An important part of predicting phenotype is understanding the complex gene regulatory networks present in organisms. Previous work has demonstrated that knowledge about one species can be applied to another using ontologically-supported knowledge bases that exploit homologous structures and homologous genes. These types of structures that can apply knowledge about one species to another have the potential to enable the massive scaling up that is needed through in silico experimentation. METHODS: We developed one such structure, a knowledge graph (KG) using information from Planteome and the EMBL-EBI Expression Atlas that connects gene expression, molecular interactions, functions, and pathways to homology-based gene annotations. Our preliminary analysis uses data from gene expression studies in Arabidopsis thaliana and Populus trichocarpa plants exposed to drought conditions. RESULTS: A graph query identified 16 pairs of homologous genes in these two taxa, some of which show opposite patterns of gene expression in response to drought. As expected, analysis of the upstream cis-regulatory region of these genes revealed that homologs with similar expression behavior had conserved cis-regulatory regions and potential interaction with similar trans-elements, unlike homologs that changed their expression in opposite ways. DISCUSSION: This suggests that even though the homologous pairs share common ancestry and functional roles, predicting expression and phenotype through homology inference needs careful consideration of integrating cis and trans-regulatory components in the curated and inferred knowledge graph. |
format | Online Article Text |
id | pubmed-10298150 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-102981502023-06-28 Using knowledge graphs to infer gene expression in plants Thessen, Anne E. Cooper, Laurel Swetnam, Tyson L. Hegde, Harshad Reese, Justin Elser, Justin Jaiswal, Pankaj Front Artif Intell Artificial Intelligence INTRODUCTION: Climate change is already affecting ecosystems around the world and forcing us to adapt to meet societal needs. The speed with which climate change is progressing necessitates a massive scaling up of the number of species with understood genotype-environment-phenotype (G×E×P) dynamics in order to increase ecosystem and agriculture resilience. An important part of predicting phenotype is understanding the complex gene regulatory networks present in organisms. Previous work has demonstrated that knowledge about one species can be applied to another using ontologically-supported knowledge bases that exploit homologous structures and homologous genes. These types of structures that can apply knowledge about one species to another have the potential to enable the massive scaling up that is needed through in silico experimentation. METHODS: We developed one such structure, a knowledge graph (KG) using information from Planteome and the EMBL-EBI Expression Atlas that connects gene expression, molecular interactions, functions, and pathways to homology-based gene annotations. Our preliminary analysis uses data from gene expression studies in Arabidopsis thaliana and Populus trichocarpa plants exposed to drought conditions. RESULTS: A graph query identified 16 pairs of homologous genes in these two taxa, some of which show opposite patterns of gene expression in response to drought. As expected, analysis of the upstream cis-regulatory region of these genes revealed that homologs with similar expression behavior had conserved cis-regulatory regions and potential interaction with similar trans-elements, unlike homologs that changed their expression in opposite ways. DISCUSSION: This suggests that even though the homologous pairs share common ancestry and functional roles, predicting expression and phenotype through homology inference needs careful consideration of integrating cis and trans-regulatory components in the curated and inferred knowledge graph. Frontiers Media S.A. 2023-06-13 /pmc/articles/PMC10298150/ /pubmed/37384147 http://dx.doi.org/10.3389/frai.2023.1201002 Text en Copyright © 2023 Thessen, Cooper, Swetnam, Hegde, Reese, Elser and Jaiswal. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Artificial Intelligence Thessen, Anne E. Cooper, Laurel Swetnam, Tyson L. Hegde, Harshad Reese, Justin Elser, Justin Jaiswal, Pankaj Using knowledge graphs to infer gene expression in plants |
title | Using knowledge graphs to infer gene expression in plants |
title_full | Using knowledge graphs to infer gene expression in plants |
title_fullStr | Using knowledge graphs to infer gene expression in plants |
title_full_unstemmed | Using knowledge graphs to infer gene expression in plants |
title_short | Using knowledge graphs to infer gene expression in plants |
title_sort | using knowledge graphs to infer gene expression in plants |
topic | Artificial Intelligence |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10298150/ https://www.ncbi.nlm.nih.gov/pubmed/37384147 http://dx.doi.org/10.3389/frai.2023.1201002 |
work_keys_str_mv | AT thessenannee usingknowledgegraphstoinfergeneexpressioninplants AT cooperlaurel usingknowledgegraphstoinfergeneexpressioninplants AT swetnamtysonl usingknowledgegraphstoinfergeneexpressioninplants AT hegdeharshad usingknowledgegraphstoinfergeneexpressioninplants AT reesejustin usingknowledgegraphstoinfergeneexpressioninplants AT elserjustin usingknowledgegraphstoinfergeneexpressioninplants AT jaiswalpankaj usingknowledgegraphstoinfergeneexpressioninplants |