Cargando…

Benchmarking Datasets from Malaria Cytotoxic T-cell Epitopes Using Machine Learning Approach

BACKGROUND: Epitope prediction remains a major challenge in malaria due to the unique parasite biology, in addition to rapidly evolving parasite sequence variation in Plasmodium species. Although several models for epitope prediction exist, they are not useful in Plasmodium specific epitope developm...

Descripción completa

Detalles Bibliográficos
Autor principal: Adiga, Rama
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Avicenna Research Institute 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8112139/
https://www.ncbi.nlm.nih.gov/pubmed/34012524
http://dx.doi.org/10.18502/ajmb.v13i2.5527
_version_ 1783690631657291776
author Adiga, Rama
author_facet Adiga, Rama
author_sort Adiga, Rama
collection PubMed
description BACKGROUND: Epitope prediction remains a major challenge in malaria due to the unique parasite biology, in addition to rapidly evolving parasite sequence variation in Plasmodium species. Although several models for epitope prediction exist, they are not useful in Plasmodium specific epitope development. Hence, it was proposed to use machine learning based methods to develop a peptide sequence based epitope predictor specific for malaria. METHODS: Model datasets were developed and performance was tested using various machine learning algorithms. Machine learning classifiers were trained on epitope data using sequence features and comparison of amino acid physicochemical properties was done to yield a valid prediction model. RESULTS: The findings from the analysis reveal that the model developed using selected classifiers after preprocessing by Waikato Environment for Knowledge Analysis (WEKA) performed better than other methods. The datasets for benchmarks of performance are deposited in the repository https://github.com/githubramaadiga/epitope_dataset . CONCLUSION: The study is the first in-silico study on benchmarking Plasmodium cytotoxic T cell epitope datasets using machine learning approach. The peptide based predictors have been used for the first time to classify cytotoxic T cell epitopes in malaria. Algorithms has been evaluated using real datasets from malaria to obtain the model.
format Online
Article
Text
id pubmed-8112139
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Avicenna Research Institute
record_format MEDLINE/PubMed
spelling pubmed-81121392021-05-18 Benchmarking Datasets from Malaria Cytotoxic T-cell Epitopes Using Machine Learning Approach Adiga, Rama Avicenna J Med Biotechnol Original Article BACKGROUND: Epitope prediction remains a major challenge in malaria due to the unique parasite biology, in addition to rapidly evolving parasite sequence variation in Plasmodium species. Although several models for epitope prediction exist, they are not useful in Plasmodium specific epitope development. Hence, it was proposed to use machine learning based methods to develop a peptide sequence based epitope predictor specific for malaria. METHODS: Model datasets were developed and performance was tested using various machine learning algorithms. Machine learning classifiers were trained on epitope data using sequence features and comparison of amino acid physicochemical properties was done to yield a valid prediction model. RESULTS: The findings from the analysis reveal that the model developed using selected classifiers after preprocessing by Waikato Environment for Knowledge Analysis (WEKA) performed better than other methods. The datasets for benchmarks of performance are deposited in the repository https://github.com/githubramaadiga/epitope_dataset . CONCLUSION: The study is the first in-silico study on benchmarking Plasmodium cytotoxic T cell epitope datasets using machine learning approach. The peptide based predictors have been used for the first time to classify cytotoxic T cell epitopes in malaria. Algorithms has been evaluated using real datasets from malaria to obtain the model. Avicenna Research Institute 2021 /pmc/articles/PMC8112139/ /pubmed/34012524 http://dx.doi.org/10.18502/ajmb.v13i2.5527 Text en Copyright© 2021 Avicenna Research Institute https://creativecommons.org/licenses/by-nc/4.0/This is an open access article distributed in accordance with the Creative Commons Attribution Non Commercial (CC BY-NC 4.0) license, which permits others to distribute, remix, adapt, build upon this work non-commercially, and license their derivative works on different terms, provided the original work is properly cited, appropriate credit is given, any changes made indicated, and the use is non-commercial. See: http://creativecommons.org/licenses/by-nc/4.0/ (https://creativecommons.org/licenses/by-nc/4.0/)
spellingShingle Original Article
Adiga, Rama
Benchmarking Datasets from Malaria Cytotoxic T-cell Epitopes Using Machine Learning Approach
title Benchmarking Datasets from Malaria Cytotoxic T-cell Epitopes Using Machine Learning Approach
title_full Benchmarking Datasets from Malaria Cytotoxic T-cell Epitopes Using Machine Learning Approach
title_fullStr Benchmarking Datasets from Malaria Cytotoxic T-cell Epitopes Using Machine Learning Approach
title_full_unstemmed Benchmarking Datasets from Malaria Cytotoxic T-cell Epitopes Using Machine Learning Approach
title_short Benchmarking Datasets from Malaria Cytotoxic T-cell Epitopes Using Machine Learning Approach
title_sort benchmarking datasets from malaria cytotoxic t-cell epitopes using machine learning approach
topic Original Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8112139/
https://www.ncbi.nlm.nih.gov/pubmed/34012524
http://dx.doi.org/10.18502/ajmb.v13i2.5527
work_keys_str_mv AT adigarama benchmarkingdatasetsfrommalariacytotoxictcellepitopesusingmachinelearningapproach