Cargando…

PredNTS: Improved and Robust Prediction of Nitrotyrosine Sites by Integrating Multiple Sequence Features

Nitrotyrosine, which is generated by numerous reactive nitrogen species, is a type of protein post-translational modification. Identification of site-specific nitration modification on tyrosine is a prerequisite to understanding the molecular function of nitrated proteins. Thanks to the progress of...

Descripción completa

Detalles Bibliográficos
Autores principales: Nilamyani, Andi Nur, Auliah, Firda Nurul, Moni, Mohammad Ali, Shoombuatong, Watshara, Hasan, Md Mehedi, Kurata, Hiroyuki
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7962192/
https://www.ncbi.nlm.nih.gov/pubmed/33800121
http://dx.doi.org/10.3390/ijms22052704
_version_ 1783665421049659392
author Nilamyani, Andi Nur
Auliah, Firda Nurul
Moni, Mohammad Ali
Shoombuatong, Watshara
Hasan, Md Mehedi
Kurata, Hiroyuki
author_facet Nilamyani, Andi Nur
Auliah, Firda Nurul
Moni, Mohammad Ali
Shoombuatong, Watshara
Hasan, Md Mehedi
Kurata, Hiroyuki
author_sort Nilamyani, Andi Nur
collection PubMed
description Nitrotyrosine, which is generated by numerous reactive nitrogen species, is a type of protein post-translational modification. Identification of site-specific nitration modification on tyrosine is a prerequisite to understanding the molecular function of nitrated proteins. Thanks to the progress of machine learning, computational prediction can play a vital role before the biological experimentation. Herein, we developed a computational predictor PredNTS by integrating multiple sequence features including K-mer, composition of k-spaced amino acid pairs (CKSAAP), AAindex, and binary encoding schemes. The important features were selected by the recursive feature elimination approach using a random forest classifier. Finally, we linearly combined the successive random forest (RF) probability scores generated by the different, single encoding-employing RF models. The resultant PredNTS predictor achieved an area under a curve (AUC) of 0.910 using five-fold cross validation. It outperformed the existing predictors on a comprehensive and independent dataset. Furthermore, we investigated several machine learning algorithms to demonstrate the superiority of the employed RF algorithm. The PredNTS is a useful computational resource for the prediction of nitrotyrosine sites. The web-application with the curated datasets of the PredNTS is publicly available.
format Online
Article
Text
id pubmed-7962192
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-79621922021-03-17 PredNTS: Improved and Robust Prediction of Nitrotyrosine Sites by Integrating Multiple Sequence Features Nilamyani, Andi Nur Auliah, Firda Nurul Moni, Mohammad Ali Shoombuatong, Watshara Hasan, Md Mehedi Kurata, Hiroyuki Int J Mol Sci Article Nitrotyrosine, which is generated by numerous reactive nitrogen species, is a type of protein post-translational modification. Identification of site-specific nitration modification on tyrosine is a prerequisite to understanding the molecular function of nitrated proteins. Thanks to the progress of machine learning, computational prediction can play a vital role before the biological experimentation. Herein, we developed a computational predictor PredNTS by integrating multiple sequence features including K-mer, composition of k-spaced amino acid pairs (CKSAAP), AAindex, and binary encoding schemes. The important features were selected by the recursive feature elimination approach using a random forest classifier. Finally, we linearly combined the successive random forest (RF) probability scores generated by the different, single encoding-employing RF models. The resultant PredNTS predictor achieved an area under a curve (AUC) of 0.910 using five-fold cross validation. It outperformed the existing predictors on a comprehensive and independent dataset. Furthermore, we investigated several machine learning algorithms to demonstrate the superiority of the employed RF algorithm. The PredNTS is a useful computational resource for the prediction of nitrotyrosine sites. The web-application with the curated datasets of the PredNTS is publicly available. MDPI 2021-03-08 /pmc/articles/PMC7962192/ /pubmed/33800121 http://dx.doi.org/10.3390/ijms22052704 Text en © 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Nilamyani, Andi Nur
Auliah, Firda Nurul
Moni, Mohammad Ali
Shoombuatong, Watshara
Hasan, Md Mehedi
Kurata, Hiroyuki
PredNTS: Improved and Robust Prediction of Nitrotyrosine Sites by Integrating Multiple Sequence Features
title PredNTS: Improved and Robust Prediction of Nitrotyrosine Sites by Integrating Multiple Sequence Features
title_full PredNTS: Improved and Robust Prediction of Nitrotyrosine Sites by Integrating Multiple Sequence Features
title_fullStr PredNTS: Improved and Robust Prediction of Nitrotyrosine Sites by Integrating Multiple Sequence Features
title_full_unstemmed PredNTS: Improved and Robust Prediction of Nitrotyrosine Sites by Integrating Multiple Sequence Features
title_short PredNTS: Improved and Robust Prediction of Nitrotyrosine Sites by Integrating Multiple Sequence Features
title_sort prednts: improved and robust prediction of nitrotyrosine sites by integrating multiple sequence features
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7962192/
https://www.ncbi.nlm.nih.gov/pubmed/33800121
http://dx.doi.org/10.3390/ijms22052704
work_keys_str_mv AT nilamyaniandinur predntsimprovedandrobustpredictionofnitrotyrosinesitesbyintegratingmultiplesequencefeatures
AT auliahfirdanurul predntsimprovedandrobustpredictionofnitrotyrosinesitesbyintegratingmultiplesequencefeatures
AT monimohammadali predntsimprovedandrobustpredictionofnitrotyrosinesitesbyintegratingmultiplesequencefeatures
AT shoombuatongwatshara predntsimprovedandrobustpredictionofnitrotyrosinesitesbyintegratingmultiplesequencefeatures
AT hasanmdmehedi predntsimprovedandrobustpredictionofnitrotyrosinesitesbyintegratingmultiplesequencefeatures
AT kuratahiroyuki predntsimprovedandrobustpredictionofnitrotyrosinesitesbyintegratingmultiplesequencefeatures