Cargando…

Predicting cross-tissue hormone–gene relations using balanced word embeddings

MOTIVATION: Inter-organ/inter-tissue communication is central to multi-cellular organisms including humans, and mapping inter-tissue interactions can advance system-level whole-body modeling efforts. Large volumes of biomedical literature have fostered studies that map within-tissue or tissue-agnost...

Descripción completa

Detalles Bibliográficos
Autores principales: Jadhav, Aditya, Kumar, Tarun, Raghavendra, Mohit, Loganathan, Tamizhini, Narayanan, Manikandan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9563690/
https://www.ncbi.nlm.nih.gov/pubmed/36000859
http://dx.doi.org/10.1093/bioinformatics/btac578
_version_ 1784808463980822528
author Jadhav, Aditya
Kumar, Tarun
Raghavendra, Mohit
Loganathan, Tamizhini
Narayanan, Manikandan
author_facet Jadhav, Aditya
Kumar, Tarun
Raghavendra, Mohit
Loganathan, Tamizhini
Narayanan, Manikandan
author_sort Jadhav, Aditya
collection PubMed
description MOTIVATION: Inter-organ/inter-tissue communication is central to multi-cellular organisms including humans, and mapping inter-tissue interactions can advance system-level whole-body modeling efforts. Large volumes of biomedical literature have fostered studies that map within-tissue or tissue-agnostic interactions, but literature-mining studies that infer inter-tissue relations, such as between hormones and genes are solely missing. RESULTS: We present a first study to predict from biomedical literature the hormone–gene associations mediating inter-tissue signaling in the human body. Our BioEmbedS* models use neural network-based Biomedical word Embeddings with a Support Vector Machine classifier to predict if a hormone–gene pair is associated or not, and whether an associated gene is involved in the hormone’s production or response. Model training relies on our unified dataset Hormone-Gene version 1 of ground-truth associations between genes and endocrine hormones, which we compiled and carefully balanced in the embedded space to handle data disparities, such as between poorly- versus well-studied hormones. Our BioEmbedS model recapitulates known gene mediators of tissue–tissue signaling with 70.4% accuracy; predicts novel inter-tissue communication genes in humans, which are enriched for hormone-related disorders; and generalizes well to mouse, thereby holding promise for its extension to other multi-cellular organisms as well. AVAILABILITY AND IMPLEMENTATION: Freely available at https://cross-tissue-signaling.herokuapp.com are our model predictions & datasets; https://github.com/BIRDSgroup/BioEmbedS has all relevant code. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-9563690
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-95636902022-10-18 Predicting cross-tissue hormone–gene relations using balanced word embeddings Jadhav, Aditya Kumar, Tarun Raghavendra, Mohit Loganathan, Tamizhini Narayanan, Manikandan Bioinformatics Original Papers MOTIVATION: Inter-organ/inter-tissue communication is central to multi-cellular organisms including humans, and mapping inter-tissue interactions can advance system-level whole-body modeling efforts. Large volumes of biomedical literature have fostered studies that map within-tissue or tissue-agnostic interactions, but literature-mining studies that infer inter-tissue relations, such as between hormones and genes are solely missing. RESULTS: We present a first study to predict from biomedical literature the hormone–gene associations mediating inter-tissue signaling in the human body. Our BioEmbedS* models use neural network-based Biomedical word Embeddings with a Support Vector Machine classifier to predict if a hormone–gene pair is associated or not, and whether an associated gene is involved in the hormone’s production or response. Model training relies on our unified dataset Hormone-Gene version 1 of ground-truth associations between genes and endocrine hormones, which we compiled and carefully balanced in the embedded space to handle data disparities, such as between poorly- versus well-studied hormones. Our BioEmbedS model recapitulates known gene mediators of tissue–tissue signaling with 70.4% accuracy; predicts novel inter-tissue communication genes in humans, which are enriched for hormone-related disorders; and generalizes well to mouse, thereby holding promise for its extension to other multi-cellular organisms as well. AVAILABILITY AND IMPLEMENTATION: Freely available at https://cross-tissue-signaling.herokuapp.com are our model predictions & datasets; https://github.com/BIRDSgroup/BioEmbedS has all relevant code. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2022-08-24 /pmc/articles/PMC9563690/ /pubmed/36000859 http://dx.doi.org/10.1093/bioinformatics/btac578 Text en © The Author(s) 2022. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Papers
Jadhav, Aditya
Kumar, Tarun
Raghavendra, Mohit
Loganathan, Tamizhini
Narayanan, Manikandan
Predicting cross-tissue hormone–gene relations using balanced word embeddings
title Predicting cross-tissue hormone–gene relations using balanced word embeddings
title_full Predicting cross-tissue hormone–gene relations using balanced word embeddings
title_fullStr Predicting cross-tissue hormone–gene relations using balanced word embeddings
title_full_unstemmed Predicting cross-tissue hormone–gene relations using balanced word embeddings
title_short Predicting cross-tissue hormone–gene relations using balanced word embeddings
title_sort predicting cross-tissue hormone–gene relations using balanced word embeddings
topic Original Papers
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9563690/
https://www.ncbi.nlm.nih.gov/pubmed/36000859
http://dx.doi.org/10.1093/bioinformatics/btac578
work_keys_str_mv AT jadhavaditya predictingcrosstissuehormonegenerelationsusingbalancedwordembeddings
AT kumartarun predictingcrosstissuehormonegenerelationsusingbalancedwordembeddings
AT raghavendramohit predictingcrosstissuehormonegenerelationsusingbalancedwordembeddings
AT loganathantamizhini predictingcrosstissuehormonegenerelationsusingbalancedwordembeddings
AT narayananmanikandan predictingcrosstissuehormonegenerelationsusingbalancedwordembeddings