Cargando…
Benchmarking Datasets from Malaria Cytotoxic T-cell Epitopes Using Machine Learning Approach
BACKGROUND: Epitope prediction remains a major challenge in malaria due to the unique parasite biology, in addition to rapidly evolving parasite sequence variation in Plasmodium species. Although several models for epitope prediction exist, they are not useful in Plasmodium specific epitope developm...
Autor principal: | |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Avicenna Research Institute
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8112139/ https://www.ncbi.nlm.nih.gov/pubmed/34012524 http://dx.doi.org/10.18502/ajmb.v13i2.5527 |
_version_ | 1783690631657291776 |
---|---|
author | Adiga, Rama |
author_facet | Adiga, Rama |
author_sort | Adiga, Rama |
collection | PubMed |
description | BACKGROUND: Epitope prediction remains a major challenge in malaria due to the unique parasite biology, in addition to rapidly evolving parasite sequence variation in Plasmodium species. Although several models for epitope prediction exist, they are not useful in Plasmodium specific epitope development. Hence, it was proposed to use machine learning based methods to develop a peptide sequence based epitope predictor specific for malaria. METHODS: Model datasets were developed and performance was tested using various machine learning algorithms. Machine learning classifiers were trained on epitope data using sequence features and comparison of amino acid physicochemical properties was done to yield a valid prediction model. RESULTS: The findings from the analysis reveal that the model developed using selected classifiers after preprocessing by Waikato Environment for Knowledge Analysis (WEKA) performed better than other methods. The datasets for benchmarks of performance are deposited in the repository https://github.com/githubramaadiga/epitope_dataset . CONCLUSION: The study is the first in-silico study on benchmarking Plasmodium cytotoxic T cell epitope datasets using machine learning approach. The peptide based predictors have been used for the first time to classify cytotoxic T cell epitopes in malaria. Algorithms has been evaluated using real datasets from malaria to obtain the model. |
format | Online Article Text |
id | pubmed-8112139 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Avicenna Research Institute |
record_format | MEDLINE/PubMed |
spelling | pubmed-81121392021-05-18 Benchmarking Datasets from Malaria Cytotoxic T-cell Epitopes Using Machine Learning Approach Adiga, Rama Avicenna J Med Biotechnol Original Article BACKGROUND: Epitope prediction remains a major challenge in malaria due to the unique parasite biology, in addition to rapidly evolving parasite sequence variation in Plasmodium species. Although several models for epitope prediction exist, they are not useful in Plasmodium specific epitope development. Hence, it was proposed to use machine learning based methods to develop a peptide sequence based epitope predictor specific for malaria. METHODS: Model datasets were developed and performance was tested using various machine learning algorithms. Machine learning classifiers were trained on epitope data using sequence features and comparison of amino acid physicochemical properties was done to yield a valid prediction model. RESULTS: The findings from the analysis reveal that the model developed using selected classifiers after preprocessing by Waikato Environment for Knowledge Analysis (WEKA) performed better than other methods. The datasets for benchmarks of performance are deposited in the repository https://github.com/githubramaadiga/epitope_dataset . CONCLUSION: The study is the first in-silico study on benchmarking Plasmodium cytotoxic T cell epitope datasets using machine learning approach. The peptide based predictors have been used for the first time to classify cytotoxic T cell epitopes in malaria. Algorithms has been evaluated using real datasets from malaria to obtain the model. Avicenna Research Institute 2021 /pmc/articles/PMC8112139/ /pubmed/34012524 http://dx.doi.org/10.18502/ajmb.v13i2.5527 Text en Copyright© 2021 Avicenna Research Institute https://creativecommons.org/licenses/by-nc/4.0/This is an open access article distributed in accordance with the Creative Commons Attribution Non Commercial (CC BY-NC 4.0) license, which permits others to distribute, remix, adapt, build upon this work non-commercially, and license their derivative works on different terms, provided the original work is properly cited, appropriate credit is given, any changes made indicated, and the use is non-commercial. See: http://creativecommons.org/licenses/by-nc/4.0/ (https://creativecommons.org/licenses/by-nc/4.0/) |
spellingShingle | Original Article Adiga, Rama Benchmarking Datasets from Malaria Cytotoxic T-cell Epitopes Using Machine Learning Approach |
title | Benchmarking Datasets from Malaria Cytotoxic T-cell Epitopes Using Machine Learning Approach |
title_full | Benchmarking Datasets from Malaria Cytotoxic T-cell Epitopes Using Machine Learning Approach |
title_fullStr | Benchmarking Datasets from Malaria Cytotoxic T-cell Epitopes Using Machine Learning Approach |
title_full_unstemmed | Benchmarking Datasets from Malaria Cytotoxic T-cell Epitopes Using Machine Learning Approach |
title_short | Benchmarking Datasets from Malaria Cytotoxic T-cell Epitopes Using Machine Learning Approach |
title_sort | benchmarking datasets from malaria cytotoxic t-cell epitopes using machine learning approach |
topic | Original Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8112139/ https://www.ncbi.nlm.nih.gov/pubmed/34012524 http://dx.doi.org/10.18502/ajmb.v13i2.5527 |
work_keys_str_mv | AT adigarama benchmarkingdatasetsfrommalariacytotoxictcellepitopesusingmachinelearningapproach |