Cargando…

Residue Adjacency Matrix Based Feature Engineering for Predicting Cysteine Reactivity in Proteins

Free radicals that form from reactive species of nitrogen and oxygen can react dangerously with cellular components and are involved with the pathogenesis of diabetes, cancer, Parkinson's, and heart disease. Cysteine amino acids, due to their reactive nature, are prone to oxidation by these fre...

Descripción completa

Detalles Bibliográficos
Autores principales: Mapes, Norman John, Rodriguez, Christopher, Chowriappa, Pradeep, Dua, Sumeet
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Research Network of Computational and Structural Biotechnology 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6327741/
https://www.ncbi.nlm.nih.gov/pubmed/30671196
http://dx.doi.org/10.1016/j.csbj.2018.12.005
_version_ 1783386525530062848
author Mapes, Norman John
Rodriguez, Christopher
Chowriappa, Pradeep
Dua, Sumeet
author_facet Mapes, Norman John
Rodriguez, Christopher
Chowriappa, Pradeep
Dua, Sumeet
author_sort Mapes, Norman John
collection PubMed
description Free radicals that form from reactive species of nitrogen and oxygen can react dangerously with cellular components and are involved with the pathogenesis of diabetes, cancer, Parkinson's, and heart disease. Cysteine amino acids, due to their reactive nature, are prone to oxidation by these free radicals. Determining which cysteines oxidize within proteins is crucial to our understanding of these chronic diseases. Wet lab techniques, like differential alkylation, to determine which cysteines oxidize are often expensive and time-consuming. We utilize machine learning as a fast and inexpensive approach to identifying cysteines with oxidative capabilities. We created the original features RAMmod and RAMseq for use in classification. We also incorporated well-known features such as PROPKA, SASA, PSS, and PSSM. Our algorithm requires only the protein sequence to operate; however, we do use template matching by MODELLER to acquire 3D coordinates for additional feature extraction. There was a mean improvement of RAM over N6C by 22.04% MCC. It was statistically significant with a p-value of 0.015. RAM provided a significant increase over PSSM with a p-value of 0.040 and an average 70.09% improvement MCC.
format Online
Article
Text
id pubmed-6327741
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Research Network of Computational and Structural Biotechnology
record_format MEDLINE/PubMed
spelling pubmed-63277412019-01-22 Residue Adjacency Matrix Based Feature Engineering for Predicting Cysteine Reactivity in Proteins Mapes, Norman John Rodriguez, Christopher Chowriappa, Pradeep Dua, Sumeet Comput Struct Biotechnol J Research Article Free radicals that form from reactive species of nitrogen and oxygen can react dangerously with cellular components and are involved with the pathogenesis of diabetes, cancer, Parkinson's, and heart disease. Cysteine amino acids, due to their reactive nature, are prone to oxidation by these free radicals. Determining which cysteines oxidize within proteins is crucial to our understanding of these chronic diseases. Wet lab techniques, like differential alkylation, to determine which cysteines oxidize are often expensive and time-consuming. We utilize machine learning as a fast and inexpensive approach to identifying cysteines with oxidative capabilities. We created the original features RAMmod and RAMseq for use in classification. We also incorporated well-known features such as PROPKA, SASA, PSS, and PSSM. Our algorithm requires only the protein sequence to operate; however, we do use template matching by MODELLER to acquire 3D coordinates for additional feature extraction. There was a mean improvement of RAM over N6C by 22.04% MCC. It was statistically significant with a p-value of 0.015. RAM provided a significant increase over PSSM with a p-value of 0.040 and an average 70.09% improvement MCC. Research Network of Computational and Structural Biotechnology 2018-12-26 /pmc/articles/PMC6327741/ /pubmed/30671196 http://dx.doi.org/10.1016/j.csbj.2018.12.005 Text en © 2019 The Authors http://creativecommons.org/licenses/by/4.0/ This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Research Article
Mapes, Norman John
Rodriguez, Christopher
Chowriappa, Pradeep
Dua, Sumeet
Residue Adjacency Matrix Based Feature Engineering for Predicting Cysteine Reactivity in Proteins
title Residue Adjacency Matrix Based Feature Engineering for Predicting Cysteine Reactivity in Proteins
title_full Residue Adjacency Matrix Based Feature Engineering for Predicting Cysteine Reactivity in Proteins
title_fullStr Residue Adjacency Matrix Based Feature Engineering for Predicting Cysteine Reactivity in Proteins
title_full_unstemmed Residue Adjacency Matrix Based Feature Engineering for Predicting Cysteine Reactivity in Proteins
title_short Residue Adjacency Matrix Based Feature Engineering for Predicting Cysteine Reactivity in Proteins
title_sort residue adjacency matrix based feature engineering for predicting cysteine reactivity in proteins
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6327741/
https://www.ncbi.nlm.nih.gov/pubmed/30671196
http://dx.doi.org/10.1016/j.csbj.2018.12.005
work_keys_str_mv AT mapesnormanjohn residueadjacencymatrixbasedfeatureengineeringforpredictingcysteinereactivityinproteins
AT rodriguezchristopher residueadjacencymatrixbasedfeatureengineeringforpredictingcysteinereactivityinproteins
AT chowriappapradeep residueadjacencymatrixbasedfeatureengineeringforpredictingcysteinereactivityinproteins
AT duasumeet residueadjacencymatrixbasedfeatureengineeringforpredictingcysteinereactivityinproteins