Cargando…
pathCLIP: Detection of Genes and Gene Relations from Biological Pathway Figures through Image-Text Contrastive Learning
In biomedical literature, biological pathways are commonly described through a combination of images and text. These pathways contain valuable information, including genes and their relationships, which provide insight into biological mechanisms and precision medicine. Curating pathway information a...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Cold Spring Harbor Laboratory
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10635012/ https://www.ncbi.nlm.nih.gov/pubmed/37961680 http://dx.doi.org/10.1101/2023.10.31.564859 |
_version_ | 1785146275248734208 |
---|---|
author | He, Fei Liu, Kai Yang, Zhiyuan Chen, Yibo Hammer, Richard D. Xu, Dong Popescu, Mihail |
author_facet | He, Fei Liu, Kai Yang, Zhiyuan Chen, Yibo Hammer, Richard D. Xu, Dong Popescu, Mihail |
author_sort | He, Fei |
collection | PubMed |
description | In biomedical literature, biological pathways are commonly described through a combination of images and text. These pathways contain valuable information, including genes and their relationships, which provide insight into biological mechanisms and precision medicine. Curating pathway information across the literature enables the integration of this information to build a comprehensive knowledge base. While some studies have extracted pathway information from images and text independently, they often overlook the correspondence between the two modalities. In this paper, we present a pathway figure curation system named pathCLIP for identifying genes and gene relations from pathway figures. Our key innovation is the use of an image-text contrastive learning model to learn coordinated embeddings of image snippets and text descriptions of genes and gene relations, thereby improving curation. Our validation results, using pathway figures from PubMed, showed that our multimodal model outperforms models using only a single modality. Additionally, our system effectively curates genes and gene relations from multiple literature sources. A case study on extracting pathway information from non-small cell lung cancer literature further demonstrates the usefulness of our curated pathway information in enhancing related pathways in the KEGG database. |
format | Online Article Text |
id | pubmed-10635012 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Cold Spring Harbor Laboratory |
record_format | MEDLINE/PubMed |
spelling | pubmed-106350122023-11-13 pathCLIP: Detection of Genes and Gene Relations from Biological Pathway Figures through Image-Text Contrastive Learning He, Fei Liu, Kai Yang, Zhiyuan Chen, Yibo Hammer, Richard D. Xu, Dong Popescu, Mihail bioRxiv Article In biomedical literature, biological pathways are commonly described through a combination of images and text. These pathways contain valuable information, including genes and their relationships, which provide insight into biological mechanisms and precision medicine. Curating pathway information across the literature enables the integration of this information to build a comprehensive knowledge base. While some studies have extracted pathway information from images and text independently, they often overlook the correspondence between the two modalities. In this paper, we present a pathway figure curation system named pathCLIP for identifying genes and gene relations from pathway figures. Our key innovation is the use of an image-text contrastive learning model to learn coordinated embeddings of image snippets and text descriptions of genes and gene relations, thereby improving curation. Our validation results, using pathway figures from PubMed, showed that our multimodal model outperforms models using only a single modality. Additionally, our system effectively curates genes and gene relations from multiple literature sources. A case study on extracting pathway information from non-small cell lung cancer literature further demonstrates the usefulness of our curated pathway information in enhancing related pathways in the KEGG database. Cold Spring Harbor Laboratory 2023-11-02 /pmc/articles/PMC10635012/ /pubmed/37961680 http://dx.doi.org/10.1101/2023.10.31.564859 Text en https://creativecommons.org/licenses/by-nd/4.0/This work is licensed under a Creative Commons Attribution-NoDerivatives 4.0 International License (https://creativecommons.org/licenses/by-nd/4.0/) , which allows reusers to copy and distribute the material in any medium or format in unadapted form only, and only so long as attribution is given to the creator. The license allows for commercial use. |
spellingShingle | Article He, Fei Liu, Kai Yang, Zhiyuan Chen, Yibo Hammer, Richard D. Xu, Dong Popescu, Mihail pathCLIP: Detection of Genes and Gene Relations from Biological Pathway Figures through Image-Text Contrastive Learning |
title | pathCLIP: Detection of Genes and Gene Relations from Biological Pathway Figures through Image-Text Contrastive Learning |
title_full | pathCLIP: Detection of Genes and Gene Relations from Biological Pathway Figures through Image-Text Contrastive Learning |
title_fullStr | pathCLIP: Detection of Genes and Gene Relations from Biological Pathway Figures through Image-Text Contrastive Learning |
title_full_unstemmed | pathCLIP: Detection of Genes and Gene Relations from Biological Pathway Figures through Image-Text Contrastive Learning |
title_short | pathCLIP: Detection of Genes and Gene Relations from Biological Pathway Figures through Image-Text Contrastive Learning |
title_sort | pathclip: detection of genes and gene relations from biological pathway figures through image-text contrastive learning |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10635012/ https://www.ncbi.nlm.nih.gov/pubmed/37961680 http://dx.doi.org/10.1101/2023.10.31.564859 |
work_keys_str_mv | AT hefei pathclipdetectionofgenesandgenerelationsfrombiologicalpathwayfiguresthroughimagetextcontrastivelearning AT liukai pathclipdetectionofgenesandgenerelationsfrombiologicalpathwayfiguresthroughimagetextcontrastivelearning AT yangzhiyuan pathclipdetectionofgenesandgenerelationsfrombiologicalpathwayfiguresthroughimagetextcontrastivelearning AT chenyibo pathclipdetectionofgenesandgenerelationsfrombiologicalpathwayfiguresthroughimagetextcontrastivelearning AT hammerrichardd pathclipdetectionofgenesandgenerelationsfrombiologicalpathwayfiguresthroughimagetextcontrastivelearning AT xudong pathclipdetectionofgenesandgenerelationsfrombiologicalpathwayfiguresthroughimagetextcontrastivelearning AT popescumihail pathclipdetectionofgenesandgenerelationsfrombiologicalpathwayfiguresthroughimagetextcontrastivelearning |