Cargando…
Organism-specific training improves performance of linear B-cell epitope prediction
MOTIVATION: In silico identification of linear B-cell epitopes represents an important step in the development of diagnostic tests and vaccine candidates, by providing potential high-probability targets for experimental investigation. Current predictive tools were developed under a generalist approa...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8665745/ https://www.ncbi.nlm.nih.gov/pubmed/34289025 http://dx.doi.org/10.1093/bioinformatics/btab536 |
_version_ | 1784614072586600448 |
---|---|
author | Ashford, Jodie Reis-Cunha, João Lobo, Igor Lobo, Francisco Campelo, Felipe |
author_facet | Ashford, Jodie Reis-Cunha, João Lobo, Igor Lobo, Francisco Campelo, Felipe |
author_sort | Ashford, Jodie |
collection | PubMed |
description | MOTIVATION: In silico identification of linear B-cell epitopes represents an important step in the development of diagnostic tests and vaccine candidates, by providing potential high-probability targets for experimental investigation. Current predictive tools were developed under a generalist approach, training models with heterogeneous datasets to develop predictors that can be deployed for a wide variety of pathogens. However, continuous advances in processing power and the increasing amount of epitope data for a broad range of pathogens indicate that training organism or taxon-specific models may become a feasible alternative, with unexplored potential gains in predictive performance. RESULTS: This article shows how organism-specific training of epitope prediction models can yield substantial performance gains across several quality metrics when compared to models trained with heterogeneous and hybrid data, and with a variety of widely used predictors from the literature. These results suggest a promising alternative for the development of custom-tailored predictive models with high predictive power, which can be easily implemented and deployed for the investigation of specific pathogens. AVAILABILITY AND IMPLEMENTATION: The data underlying this article, as well as the full reproducibility scripts, are available at https://github.com/fcampelo/OrgSpec-paper. The R package that implements the organism-specific pipeline functions is available at https://github.com/fcampelo/epitopes. SUPPLEMENTARY INFORMATION: Supplementary materials are available at Bioinformatics online. |
format | Online Article Text |
id | pubmed-8665745 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-86657452021-12-13 Organism-specific training improves performance of linear B-cell epitope prediction Ashford, Jodie Reis-Cunha, João Lobo, Igor Lobo, Francisco Campelo, Felipe Bioinformatics Original Papers MOTIVATION: In silico identification of linear B-cell epitopes represents an important step in the development of diagnostic tests and vaccine candidates, by providing potential high-probability targets for experimental investigation. Current predictive tools were developed under a generalist approach, training models with heterogeneous datasets to develop predictors that can be deployed for a wide variety of pathogens. However, continuous advances in processing power and the increasing amount of epitope data for a broad range of pathogens indicate that training organism or taxon-specific models may become a feasible alternative, with unexplored potential gains in predictive performance. RESULTS: This article shows how organism-specific training of epitope prediction models can yield substantial performance gains across several quality metrics when compared to models trained with heterogeneous and hybrid data, and with a variety of widely used predictors from the literature. These results suggest a promising alternative for the development of custom-tailored predictive models with high predictive power, which can be easily implemented and deployed for the investigation of specific pathogens. AVAILABILITY AND IMPLEMENTATION: The data underlying this article, as well as the full reproducibility scripts, are available at https://github.com/fcampelo/OrgSpec-paper. The R package that implements the organism-specific pipeline functions is available at https://github.com/fcampelo/epitopes. SUPPLEMENTARY INFORMATION: Supplementary materials are available at Bioinformatics online. Oxford University Press 2021-07-21 /pmc/articles/PMC8665745/ /pubmed/34289025 http://dx.doi.org/10.1093/bioinformatics/btab536 Text en © The Author(s) 2021. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Original Papers Ashford, Jodie Reis-Cunha, João Lobo, Igor Lobo, Francisco Campelo, Felipe Organism-specific training improves performance of linear B-cell epitope prediction |
title | Organism-specific training improves performance of linear B-cell epitope prediction |
title_full | Organism-specific training improves performance of linear B-cell epitope prediction |
title_fullStr | Organism-specific training improves performance of linear B-cell epitope prediction |
title_full_unstemmed | Organism-specific training improves performance of linear B-cell epitope prediction |
title_short | Organism-specific training improves performance of linear B-cell epitope prediction |
title_sort | organism-specific training improves performance of linear b-cell epitope prediction |
topic | Original Papers |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8665745/ https://www.ncbi.nlm.nih.gov/pubmed/34289025 http://dx.doi.org/10.1093/bioinformatics/btab536 |
work_keys_str_mv | AT ashfordjodie organismspecifictrainingimprovesperformanceoflinearbcellepitopeprediction AT reiscunhajoao organismspecifictrainingimprovesperformanceoflinearbcellepitopeprediction AT loboigor organismspecifictrainingimprovesperformanceoflinearbcellepitopeprediction AT lobofrancisco organismspecifictrainingimprovesperformanceoflinearbcellepitopeprediction AT campelofelipe organismspecifictrainingimprovesperformanceoflinearbcellepitopeprediction |