Cargando…

pathCLIP: Detection of Genes and Gene Relations from Biological Pathway Figures through Image-Text Contrastive Learning

In biomedical literature, biological pathways are commonly described through a combination of images and text. These pathways contain valuable information, including genes and their relationships, which provide insight into biological mechanisms and precision medicine. Curating pathway information a...

Descripción completa

Detalles Bibliográficos
Autores principales: He, Fei, Liu, Kai, Yang, Zhiyuan, Chen, Yibo, Hammer, Richard D., Xu, Dong, Popescu, Mihail
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Cold Spring Harbor Laboratory 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10635012/
https://www.ncbi.nlm.nih.gov/pubmed/37961680
http://dx.doi.org/10.1101/2023.10.31.564859
_version_ 1785146275248734208
author He, Fei
Liu, Kai
Yang, Zhiyuan
Chen, Yibo
Hammer, Richard D.
Xu, Dong
Popescu, Mihail
author_facet He, Fei
Liu, Kai
Yang, Zhiyuan
Chen, Yibo
Hammer, Richard D.
Xu, Dong
Popescu, Mihail
author_sort He, Fei
collection PubMed
description In biomedical literature, biological pathways are commonly described through a combination of images and text. These pathways contain valuable information, including genes and their relationships, which provide insight into biological mechanisms and precision medicine. Curating pathway information across the literature enables the integration of this information to build a comprehensive knowledge base. While some studies have extracted pathway information from images and text independently, they often overlook the correspondence between the two modalities. In this paper, we present a pathway figure curation system named pathCLIP for identifying genes and gene relations from pathway figures. Our key innovation is the use of an image-text contrastive learning model to learn coordinated embeddings of image snippets and text descriptions of genes and gene relations, thereby improving curation. Our validation results, using pathway figures from PubMed, showed that our multimodal model outperforms models using only a single modality. Additionally, our system effectively curates genes and gene relations from multiple literature sources. A case study on extracting pathway information from non-small cell lung cancer literature further demonstrates the usefulness of our curated pathway information in enhancing related pathways in the KEGG database.
format Online
Article
Text
id pubmed-10635012
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Cold Spring Harbor Laboratory
record_format MEDLINE/PubMed
spelling pubmed-106350122023-11-13 pathCLIP: Detection of Genes and Gene Relations from Biological Pathway Figures through Image-Text Contrastive Learning He, Fei Liu, Kai Yang, Zhiyuan Chen, Yibo Hammer, Richard D. Xu, Dong Popescu, Mihail bioRxiv Article In biomedical literature, biological pathways are commonly described through a combination of images and text. These pathways contain valuable information, including genes and their relationships, which provide insight into biological mechanisms and precision medicine. Curating pathway information across the literature enables the integration of this information to build a comprehensive knowledge base. While some studies have extracted pathway information from images and text independently, they often overlook the correspondence between the two modalities. In this paper, we present a pathway figure curation system named pathCLIP for identifying genes and gene relations from pathway figures. Our key innovation is the use of an image-text contrastive learning model to learn coordinated embeddings of image snippets and text descriptions of genes and gene relations, thereby improving curation. Our validation results, using pathway figures from PubMed, showed that our multimodal model outperforms models using only a single modality. Additionally, our system effectively curates genes and gene relations from multiple literature sources. A case study on extracting pathway information from non-small cell lung cancer literature further demonstrates the usefulness of our curated pathway information in enhancing related pathways in the KEGG database. Cold Spring Harbor Laboratory 2023-11-02 /pmc/articles/PMC10635012/ /pubmed/37961680 http://dx.doi.org/10.1101/2023.10.31.564859 Text en https://creativecommons.org/licenses/by-nd/4.0/This work is licensed under a Creative Commons Attribution-NoDerivatives 4.0 International License (https://creativecommons.org/licenses/by-nd/4.0/) , which allows reusers to copy and distribute the material in any medium or format in unadapted form only, and only so long as attribution is given to the creator. The license allows for commercial use.
spellingShingle Article
He, Fei
Liu, Kai
Yang, Zhiyuan
Chen, Yibo
Hammer, Richard D.
Xu, Dong
Popescu, Mihail
pathCLIP: Detection of Genes and Gene Relations from Biological Pathway Figures through Image-Text Contrastive Learning
title pathCLIP: Detection of Genes and Gene Relations from Biological Pathway Figures through Image-Text Contrastive Learning
title_full pathCLIP: Detection of Genes and Gene Relations from Biological Pathway Figures through Image-Text Contrastive Learning
title_fullStr pathCLIP: Detection of Genes and Gene Relations from Biological Pathway Figures through Image-Text Contrastive Learning
title_full_unstemmed pathCLIP: Detection of Genes and Gene Relations from Biological Pathway Figures through Image-Text Contrastive Learning
title_short pathCLIP: Detection of Genes and Gene Relations from Biological Pathway Figures through Image-Text Contrastive Learning
title_sort pathclip: detection of genes and gene relations from biological pathway figures through image-text contrastive learning
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10635012/
https://www.ncbi.nlm.nih.gov/pubmed/37961680
http://dx.doi.org/10.1101/2023.10.31.564859
work_keys_str_mv AT hefei pathclipdetectionofgenesandgenerelationsfrombiologicalpathwayfiguresthroughimagetextcontrastivelearning
AT liukai pathclipdetectionofgenesandgenerelationsfrombiologicalpathwayfiguresthroughimagetextcontrastivelearning
AT yangzhiyuan pathclipdetectionofgenesandgenerelationsfrombiologicalpathwayfiguresthroughimagetextcontrastivelearning
AT chenyibo pathclipdetectionofgenesandgenerelationsfrombiologicalpathwayfiguresthroughimagetextcontrastivelearning
AT hammerrichardd pathclipdetectionofgenesandgenerelationsfrombiologicalpathwayfiguresthroughimagetextcontrastivelearning
AT xudong pathclipdetectionofgenesandgenerelationsfrombiologicalpathwayfiguresthroughimagetextcontrastivelearning
AT popescumihail pathclipdetectionofgenesandgenerelationsfrombiologicalpathwayfiguresthroughimagetextcontrastivelearning