Cargando…

Compilation of parasitic immunogenic proteins from 30 years of published research using machine learning and natural language processing

The World Health Organisation reported in 2020 that six of the top 10 sources of death in low-income countries are parasites. Parasites are microorganisms in a relationship with a larger organism, the host. They acquire all benefits at the host’s expense. A disease develops if the parasitic infectio...

Descripción completa

Detalles Bibliográficos
Autores principales: Goodswen, Stephen J., Kennedy, Paul J., Ellis, John T.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9208253/
https://www.ncbi.nlm.nih.gov/pubmed/35725870
http://dx.doi.org/10.1038/s41598-022-13790-1
_version_ 1784729703178829824
author Goodswen, Stephen J.
Kennedy, Paul J.
Ellis, John T.
author_facet Goodswen, Stephen J.
Kennedy, Paul J.
Ellis, John T.
author_sort Goodswen, Stephen J.
collection PubMed
description The World Health Organisation reported in 2020 that six of the top 10 sources of death in low-income countries are parasites. Parasites are microorganisms in a relationship with a larger organism, the host. They acquire all benefits at the host’s expense. A disease develops if the parasitic infection disrupts normal functioning of the host. This disruption can range from mild to severe, including death. Humans and livestock continue to be challenged by established and emerging infectious disease threats. Vaccination is the most efficient tool for preventing current and future threats. Immunogenic proteins sourced from the disease-causing parasite are worthwhile vaccine components (subunits) due to reliable safety and manufacturing capacity. Publications with ‘subunit vaccine’ in their title have accumulated to thousands over the last three decades. However, there are possibly thousands more reporting immunogenicity results without mentioning ‘subunit’ and/or ‘vaccine’. The exact number is unclear given the non-standardised keywords in publications. The study aim is to identify parasite proteins that induce a protective response in an animal model as reported in the scientific literature within the last 30 years using machine learning and natural language processing. Source code to fulfil this aim and the vaccine candidate list obtained is made available.
format Online
Article
Text
id pubmed-9208253
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-92082532022-06-21 Compilation of parasitic immunogenic proteins from 30 years of published research using machine learning and natural language processing Goodswen, Stephen J. Kennedy, Paul J. Ellis, John T. Sci Rep Article The World Health Organisation reported in 2020 that six of the top 10 sources of death in low-income countries are parasites. Parasites are microorganisms in a relationship with a larger organism, the host. They acquire all benefits at the host’s expense. A disease develops if the parasitic infection disrupts normal functioning of the host. This disruption can range from mild to severe, including death. Humans and livestock continue to be challenged by established and emerging infectious disease threats. Vaccination is the most efficient tool for preventing current and future threats. Immunogenic proteins sourced from the disease-causing parasite are worthwhile vaccine components (subunits) due to reliable safety and manufacturing capacity. Publications with ‘subunit vaccine’ in their title have accumulated to thousands over the last three decades. However, there are possibly thousands more reporting immunogenicity results without mentioning ‘subunit’ and/or ‘vaccine’. The exact number is unclear given the non-standardised keywords in publications. The study aim is to identify parasite proteins that induce a protective response in an animal model as reported in the scientific literature within the last 30 years using machine learning and natural language processing. Source code to fulfil this aim and the vaccine candidate list obtained is made available. Nature Publishing Group UK 2022-06-20 /pmc/articles/PMC9208253/ /pubmed/35725870 http://dx.doi.org/10.1038/s41598-022-13790-1 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Goodswen, Stephen J.
Kennedy, Paul J.
Ellis, John T.
Compilation of parasitic immunogenic proteins from 30 years of published research using machine learning and natural language processing
title Compilation of parasitic immunogenic proteins from 30 years of published research using machine learning and natural language processing
title_full Compilation of parasitic immunogenic proteins from 30 years of published research using machine learning and natural language processing
title_fullStr Compilation of parasitic immunogenic proteins from 30 years of published research using machine learning and natural language processing
title_full_unstemmed Compilation of parasitic immunogenic proteins from 30 years of published research using machine learning and natural language processing
title_short Compilation of parasitic immunogenic proteins from 30 years of published research using machine learning and natural language processing
title_sort compilation of parasitic immunogenic proteins from 30 years of published research using machine learning and natural language processing
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9208253/
https://www.ncbi.nlm.nih.gov/pubmed/35725870
http://dx.doi.org/10.1038/s41598-022-13790-1
work_keys_str_mv AT goodswenstephenj compilationofparasiticimmunogenicproteinsfrom30yearsofpublishedresearchusingmachinelearningandnaturallanguageprocessing
AT kennedypaulj compilationofparasiticimmunogenicproteinsfrom30yearsofpublishedresearchusingmachinelearningandnaturallanguageprocessing
AT ellisjohnt compilationofparasiticimmunogenicproteinsfrom30yearsofpublishedresearchusingmachinelearningandnaturallanguageprocessing