Cargando…
PHILM2Web: A high-throughput database of macromolecular host–pathogen interactions on the Web
During infection, the pathogen’s entry into the host organism, breaching the host immune defense, spread and multiplication are frequently mediated by multiple interactions between the host and pathogen proteins. Systematic studying of host–pathogen interactions (HPIs) is a challenging task for both...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9248916/ https://www.ncbi.nlm.nih.gov/pubmed/35776535 http://dx.doi.org/10.1093/database/baac042 |
_version_ | 1784739458263810048 |
---|---|
author | Le, Tuan-Dung Nguyen, Phuong D Korkin, Dmitry Thieu, Thanh |
author_facet | Le, Tuan-Dung Nguyen, Phuong D Korkin, Dmitry Thieu, Thanh |
author_sort | Le, Tuan-Dung |
collection | PubMed |
description | During infection, the pathogen’s entry into the host organism, breaching the host immune defense, spread and multiplication are frequently mediated by multiple interactions between the host and pathogen proteins. Systematic studying of host–pathogen interactions (HPIs) is a challenging task for both experimental and computational approaches and is critically dependent on the previously obtained knowledge about these interactions found in the biomedical literature. While several HPI databases exist that manually filter HPI protein–protein interactions from the generic databases and curated experimental interactomic studies, no comprehensive database on HPIs obtained from the biomedical literature is currently available. Here, we introduce a high-throughput literature-mining platform for extracting HPI data that includes the most comprehensive to date collection of HPIs obtained from the PubMed abstracts. Our HPI data portal, PHILM2Web (Pathogen–Host Interactions by Literature Mining on the Web), integrates an automatically generated database of interactions extracted by PHILM, our high-precision HPI literature-mining algorithm. Currently, the database contains 23 581 generic HPIs between 157 host and 403 pathogen organisms from 11 609 abstracts. The interactions were obtained from processing 608 972 PubMed abstracts, each containing mentions of at least one host and one pathogen organisms. In response to the coronavirus disease 2019 (COVID-19) pandemic, we also utilized PHILM to process 25 796 PubMed abstracts obtained by the same query as the COVID-19 Open Research Dataset. This COVID-19 processing batch resulted in 257 HPIs between 19 host and 31 pathogen organisms from 167 abstracts. The access to the entire HPI dataset is available via a searchable PHILM2Web interface; scientists can also download the entire database in bulk for offline processing. Database URL: http://philm2web.live |
format | Online Article Text |
id | pubmed-9248916 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-92489162022-07-05 PHILM2Web: A high-throughput database of macromolecular host–pathogen interactions on the Web Le, Tuan-Dung Nguyen, Phuong D Korkin, Dmitry Thieu, Thanh Database (Oxford) Original Article During infection, the pathogen’s entry into the host organism, breaching the host immune defense, spread and multiplication are frequently mediated by multiple interactions between the host and pathogen proteins. Systematic studying of host–pathogen interactions (HPIs) is a challenging task for both experimental and computational approaches and is critically dependent on the previously obtained knowledge about these interactions found in the biomedical literature. While several HPI databases exist that manually filter HPI protein–protein interactions from the generic databases and curated experimental interactomic studies, no comprehensive database on HPIs obtained from the biomedical literature is currently available. Here, we introduce a high-throughput literature-mining platform for extracting HPI data that includes the most comprehensive to date collection of HPIs obtained from the PubMed abstracts. Our HPI data portal, PHILM2Web (Pathogen–Host Interactions by Literature Mining on the Web), integrates an automatically generated database of interactions extracted by PHILM, our high-precision HPI literature-mining algorithm. Currently, the database contains 23 581 generic HPIs between 157 host and 403 pathogen organisms from 11 609 abstracts. The interactions were obtained from processing 608 972 PubMed abstracts, each containing mentions of at least one host and one pathogen organisms. In response to the coronavirus disease 2019 (COVID-19) pandemic, we also utilized PHILM to process 25 796 PubMed abstracts obtained by the same query as the COVID-19 Open Research Dataset. This COVID-19 processing batch resulted in 257 HPIs between 19 host and 31 pathogen organisms from 167 abstracts. The access to the entire HPI dataset is available via a searchable PHILM2Web interface; scientists can also download the entire database in bulk for offline processing. Database URL: http://philm2web.live Oxford University Press 2022-06-30 /pmc/articles/PMC9248916/ /pubmed/35776535 http://dx.doi.org/10.1093/database/baac042 Text en © The Author(s) 2022. Published by Oxford University Press. https://creativecommons.org/licenses/by-nc/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution-NonCommercial License (https://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com |
spellingShingle | Original Article Le, Tuan-Dung Nguyen, Phuong D Korkin, Dmitry Thieu, Thanh PHILM2Web: A high-throughput database of macromolecular host–pathogen interactions on the Web |
title | PHILM2Web: A high-throughput database of macromolecular host–pathogen interactions on the Web |
title_full | PHILM2Web: A high-throughput database of macromolecular host–pathogen interactions on the Web |
title_fullStr | PHILM2Web: A high-throughput database of macromolecular host–pathogen interactions on the Web |
title_full_unstemmed | PHILM2Web: A high-throughput database of macromolecular host–pathogen interactions on the Web |
title_short | PHILM2Web: A high-throughput database of macromolecular host–pathogen interactions on the Web |
title_sort | philm2web: a high-throughput database of macromolecular host–pathogen interactions on the web |
topic | Original Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9248916/ https://www.ncbi.nlm.nih.gov/pubmed/35776535 http://dx.doi.org/10.1093/database/baac042 |
work_keys_str_mv | AT letuandung philm2webahighthroughputdatabaseofmacromolecularhostpathogeninteractionsontheweb AT nguyenphuongd philm2webahighthroughputdatabaseofmacromolecularhostpathogeninteractionsontheweb AT korkindmitry philm2webahighthroughputdatabaseofmacromolecularhostpathogeninteractionsontheweb AT thieuthanh philm2webahighthroughputdatabaseofmacromolecularhostpathogeninteractionsontheweb |