Cargando…
Residue Adjacency Matrix Based Feature Engineering for Predicting Cysteine Reactivity in Proteins
Free radicals that form from reactive species of nitrogen and oxygen can react dangerously with cellular components and are involved with the pathogenesis of diabetes, cancer, Parkinson's, and heart disease. Cysteine amino acids, due to their reactive nature, are prone to oxidation by these fre...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Research Network of Computational and Structural Biotechnology
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6327741/ https://www.ncbi.nlm.nih.gov/pubmed/30671196 http://dx.doi.org/10.1016/j.csbj.2018.12.005 |
_version_ | 1783386525530062848 |
---|---|
author | Mapes, Norman John Rodriguez, Christopher Chowriappa, Pradeep Dua, Sumeet |
author_facet | Mapes, Norman John Rodriguez, Christopher Chowriappa, Pradeep Dua, Sumeet |
author_sort | Mapes, Norman John |
collection | PubMed |
description | Free radicals that form from reactive species of nitrogen and oxygen can react dangerously with cellular components and are involved with the pathogenesis of diabetes, cancer, Parkinson's, and heart disease. Cysteine amino acids, due to their reactive nature, are prone to oxidation by these free radicals. Determining which cysteines oxidize within proteins is crucial to our understanding of these chronic diseases. Wet lab techniques, like differential alkylation, to determine which cysteines oxidize are often expensive and time-consuming. We utilize machine learning as a fast and inexpensive approach to identifying cysteines with oxidative capabilities. We created the original features RAMmod and RAMseq for use in classification. We also incorporated well-known features such as PROPKA, SASA, PSS, and PSSM. Our algorithm requires only the protein sequence to operate; however, we do use template matching by MODELLER to acquire 3D coordinates for additional feature extraction. There was a mean improvement of RAM over N6C by 22.04% MCC. It was statistically significant with a p-value of 0.015. RAM provided a significant increase over PSSM with a p-value of 0.040 and an average 70.09% improvement MCC. |
format | Online Article Text |
id | pubmed-6327741 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | Research Network of Computational and Structural Biotechnology |
record_format | MEDLINE/PubMed |
spelling | pubmed-63277412019-01-22 Residue Adjacency Matrix Based Feature Engineering for Predicting Cysteine Reactivity in Proteins Mapes, Norman John Rodriguez, Christopher Chowriappa, Pradeep Dua, Sumeet Comput Struct Biotechnol J Research Article Free radicals that form from reactive species of nitrogen and oxygen can react dangerously with cellular components and are involved with the pathogenesis of diabetes, cancer, Parkinson's, and heart disease. Cysteine amino acids, due to their reactive nature, are prone to oxidation by these free radicals. Determining which cysteines oxidize within proteins is crucial to our understanding of these chronic diseases. Wet lab techniques, like differential alkylation, to determine which cysteines oxidize are often expensive and time-consuming. We utilize machine learning as a fast and inexpensive approach to identifying cysteines with oxidative capabilities. We created the original features RAMmod and RAMseq for use in classification. We also incorporated well-known features such as PROPKA, SASA, PSS, and PSSM. Our algorithm requires only the protein sequence to operate; however, we do use template matching by MODELLER to acquire 3D coordinates for additional feature extraction. There was a mean improvement of RAM over N6C by 22.04% MCC. It was statistically significant with a p-value of 0.015. RAM provided a significant increase over PSSM with a p-value of 0.040 and an average 70.09% improvement MCC. Research Network of Computational and Structural Biotechnology 2018-12-26 /pmc/articles/PMC6327741/ /pubmed/30671196 http://dx.doi.org/10.1016/j.csbj.2018.12.005 Text en © 2019 The Authors http://creativecommons.org/licenses/by/4.0/ This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Research Article Mapes, Norman John Rodriguez, Christopher Chowriappa, Pradeep Dua, Sumeet Residue Adjacency Matrix Based Feature Engineering for Predicting Cysteine Reactivity in Proteins |
title | Residue Adjacency Matrix Based Feature Engineering for Predicting Cysteine Reactivity in Proteins |
title_full | Residue Adjacency Matrix Based Feature Engineering for Predicting Cysteine Reactivity in Proteins |
title_fullStr | Residue Adjacency Matrix Based Feature Engineering for Predicting Cysteine Reactivity in Proteins |
title_full_unstemmed | Residue Adjacency Matrix Based Feature Engineering for Predicting Cysteine Reactivity in Proteins |
title_short | Residue Adjacency Matrix Based Feature Engineering for Predicting Cysteine Reactivity in Proteins |
title_sort | residue adjacency matrix based feature engineering for predicting cysteine reactivity in proteins |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6327741/ https://www.ncbi.nlm.nih.gov/pubmed/30671196 http://dx.doi.org/10.1016/j.csbj.2018.12.005 |
work_keys_str_mv | AT mapesnormanjohn residueadjacencymatrixbasedfeatureengineeringforpredictingcysteinereactivityinproteins AT rodriguezchristopher residueadjacencymatrixbasedfeatureengineeringforpredictingcysteinereactivityinproteins AT chowriappapradeep residueadjacencymatrixbasedfeatureengineeringforpredictingcysteinereactivityinproteins AT duasumeet residueadjacencymatrixbasedfeatureengineeringforpredictingcysteinereactivityinproteins |