Cargando…
Predicting cross-tissue hormone–gene relations using balanced word embeddings
MOTIVATION: Inter-organ/inter-tissue communication is central to multi-cellular organisms including humans, and mapping inter-tissue interactions can advance system-level whole-body modeling efforts. Large volumes of biomedical literature have fostered studies that map within-tissue or tissue-agnost...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9563690/ https://www.ncbi.nlm.nih.gov/pubmed/36000859 http://dx.doi.org/10.1093/bioinformatics/btac578 |
_version_ | 1784808463980822528 |
---|---|
author | Jadhav, Aditya Kumar, Tarun Raghavendra, Mohit Loganathan, Tamizhini Narayanan, Manikandan |
author_facet | Jadhav, Aditya Kumar, Tarun Raghavendra, Mohit Loganathan, Tamizhini Narayanan, Manikandan |
author_sort | Jadhav, Aditya |
collection | PubMed |
description | MOTIVATION: Inter-organ/inter-tissue communication is central to multi-cellular organisms including humans, and mapping inter-tissue interactions can advance system-level whole-body modeling efforts. Large volumes of biomedical literature have fostered studies that map within-tissue or tissue-agnostic interactions, but literature-mining studies that infer inter-tissue relations, such as between hormones and genes are solely missing. RESULTS: We present a first study to predict from biomedical literature the hormone–gene associations mediating inter-tissue signaling in the human body. Our BioEmbedS* models use neural network-based Biomedical word Embeddings with a Support Vector Machine classifier to predict if a hormone–gene pair is associated or not, and whether an associated gene is involved in the hormone’s production or response. Model training relies on our unified dataset Hormone-Gene version 1 of ground-truth associations between genes and endocrine hormones, which we compiled and carefully balanced in the embedded space to handle data disparities, such as between poorly- versus well-studied hormones. Our BioEmbedS model recapitulates known gene mediators of tissue–tissue signaling with 70.4% accuracy; predicts novel inter-tissue communication genes in humans, which are enriched for hormone-related disorders; and generalizes well to mouse, thereby holding promise for its extension to other multi-cellular organisms as well. AVAILABILITY AND IMPLEMENTATION: Freely available at https://cross-tissue-signaling.herokuapp.com are our model predictions & datasets; https://github.com/BIRDSgroup/BioEmbedS has all relevant code. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. |
format | Online Article Text |
id | pubmed-9563690 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-95636902022-10-18 Predicting cross-tissue hormone–gene relations using balanced word embeddings Jadhav, Aditya Kumar, Tarun Raghavendra, Mohit Loganathan, Tamizhini Narayanan, Manikandan Bioinformatics Original Papers MOTIVATION: Inter-organ/inter-tissue communication is central to multi-cellular organisms including humans, and mapping inter-tissue interactions can advance system-level whole-body modeling efforts. Large volumes of biomedical literature have fostered studies that map within-tissue or tissue-agnostic interactions, but literature-mining studies that infer inter-tissue relations, such as between hormones and genes are solely missing. RESULTS: We present a first study to predict from biomedical literature the hormone–gene associations mediating inter-tissue signaling in the human body. Our BioEmbedS* models use neural network-based Biomedical word Embeddings with a Support Vector Machine classifier to predict if a hormone–gene pair is associated or not, and whether an associated gene is involved in the hormone’s production or response. Model training relies on our unified dataset Hormone-Gene version 1 of ground-truth associations between genes and endocrine hormones, which we compiled and carefully balanced in the embedded space to handle data disparities, such as between poorly- versus well-studied hormones. Our BioEmbedS model recapitulates known gene mediators of tissue–tissue signaling with 70.4% accuracy; predicts novel inter-tissue communication genes in humans, which are enriched for hormone-related disorders; and generalizes well to mouse, thereby holding promise for its extension to other multi-cellular organisms as well. AVAILABILITY AND IMPLEMENTATION: Freely available at https://cross-tissue-signaling.herokuapp.com are our model predictions & datasets; https://github.com/BIRDSgroup/BioEmbedS has all relevant code. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2022-08-24 /pmc/articles/PMC9563690/ /pubmed/36000859 http://dx.doi.org/10.1093/bioinformatics/btac578 Text en © The Author(s) 2022. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Original Papers Jadhav, Aditya Kumar, Tarun Raghavendra, Mohit Loganathan, Tamizhini Narayanan, Manikandan Predicting cross-tissue hormone–gene relations using balanced word embeddings |
title | Predicting cross-tissue hormone–gene relations using balanced word embeddings |
title_full | Predicting cross-tissue hormone–gene relations using balanced word embeddings |
title_fullStr | Predicting cross-tissue hormone–gene relations using balanced word embeddings |
title_full_unstemmed | Predicting cross-tissue hormone–gene relations using balanced word embeddings |
title_short | Predicting cross-tissue hormone–gene relations using balanced word embeddings |
title_sort | predicting cross-tissue hormone–gene relations using balanced word embeddings |
topic | Original Papers |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9563690/ https://www.ncbi.nlm.nih.gov/pubmed/36000859 http://dx.doi.org/10.1093/bioinformatics/btac578 |
work_keys_str_mv | AT jadhavaditya predictingcrosstissuehormonegenerelationsusingbalancedwordembeddings AT kumartarun predictingcrosstissuehormonegenerelationsusingbalancedwordembeddings AT raghavendramohit predictingcrosstissuehormonegenerelationsusingbalancedwordembeddings AT loganathantamizhini predictingcrosstissuehormonegenerelationsusingbalancedwordembeddings AT narayananmanikandan predictingcrosstissuehormonegenerelationsusingbalancedwordembeddings |