Cargando…

NMRDSP: An Accurate Prediction of Protein Shape Strings from NMR Chemical Shifts and Sequence Data

Shape string is structural sequence and is an extremely important structure representation of protein backbone conformations. Nuclear magnetic resonance chemical shifts give a strong correlation with the local protein structure, and are exploited to predict protein structures in conjunction with com...

Descripción completa

Detalles Bibliográficos
Autores principales: Mao, Wusong, Cong, Peisheng, Wang, Zhiheng, Lu, Longjian, Zhu, Zhongliang, Li, Tonghua
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3871590/
https://www.ncbi.nlm.nih.gov/pubmed/24376713
http://dx.doi.org/10.1371/journal.pone.0083532
_version_ 1782296842011148288
author Mao, Wusong
Cong, Peisheng
Wang, Zhiheng
Lu, Longjian
Zhu, Zhongliang
Li, Tonghua
author_facet Mao, Wusong
Cong, Peisheng
Wang, Zhiheng
Lu, Longjian
Zhu, Zhongliang
Li, Tonghua
author_sort Mao, Wusong
collection PubMed
description Shape string is structural sequence and is an extremely important structure representation of protein backbone conformations. Nuclear magnetic resonance chemical shifts give a strong correlation with the local protein structure, and are exploited to predict protein structures in conjunction with computational approaches. Here we demonstrate a novel approach, NMRDSP, which can accurately predict the protein shape string based on nuclear magnetic resonance chemical shifts and structural profiles obtained from sequence data. The NMRDSP uses six chemical shifts (HA, H, N, CA, CB and C) and eight elements of structure profiles as features, a non-redundant set (1,003 entries) as the training set, and a conditional random field as a classification algorithm. For an independent testing set (203 entries), we achieved an accuracy of 75.8% for S8 (the eight states accuracy) and 87.8% for S3 (the three states accuracy). This is higher than only using chemical shifts or sequence data, and confirms that the chemical shift and the structure profile are significant features for shape string prediction and their combination prominently improves the accuracy of the predictor. We have constructed the NMRDSP web server and believe it could be employed to provide a solid platform to predict other protein structures and functions. The NMRDSP web server is freely available at http://cal.tongji.edu.cn/NMRDSP/index.jsp.
format Online
Article
Text
id pubmed-3871590
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-38715902013-12-27 NMRDSP: An Accurate Prediction of Protein Shape Strings from NMR Chemical Shifts and Sequence Data Mao, Wusong Cong, Peisheng Wang, Zhiheng Lu, Longjian Zhu, Zhongliang Li, Tonghua PLoS One Research Article Shape string is structural sequence and is an extremely important structure representation of protein backbone conformations. Nuclear magnetic resonance chemical shifts give a strong correlation with the local protein structure, and are exploited to predict protein structures in conjunction with computational approaches. Here we demonstrate a novel approach, NMRDSP, which can accurately predict the protein shape string based on nuclear magnetic resonance chemical shifts and structural profiles obtained from sequence data. The NMRDSP uses six chemical shifts (HA, H, N, CA, CB and C) and eight elements of structure profiles as features, a non-redundant set (1,003 entries) as the training set, and a conditional random field as a classification algorithm. For an independent testing set (203 entries), we achieved an accuracy of 75.8% for S8 (the eight states accuracy) and 87.8% for S3 (the three states accuracy). This is higher than only using chemical shifts or sequence data, and confirms that the chemical shift and the structure profile are significant features for shape string prediction and their combination prominently improves the accuracy of the predictor. We have constructed the NMRDSP web server and believe it could be employed to provide a solid platform to predict other protein structures and functions. The NMRDSP web server is freely available at http://cal.tongji.edu.cn/NMRDSP/index.jsp. Public Library of Science 2013-12-23 /pmc/articles/PMC3871590/ /pubmed/24376713 http://dx.doi.org/10.1371/journal.pone.0083532 Text en © 2013 Mao et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Mao, Wusong
Cong, Peisheng
Wang, Zhiheng
Lu, Longjian
Zhu, Zhongliang
Li, Tonghua
NMRDSP: An Accurate Prediction of Protein Shape Strings from NMR Chemical Shifts and Sequence Data
title NMRDSP: An Accurate Prediction of Protein Shape Strings from NMR Chemical Shifts and Sequence Data
title_full NMRDSP: An Accurate Prediction of Protein Shape Strings from NMR Chemical Shifts and Sequence Data
title_fullStr NMRDSP: An Accurate Prediction of Protein Shape Strings from NMR Chemical Shifts and Sequence Data
title_full_unstemmed NMRDSP: An Accurate Prediction of Protein Shape Strings from NMR Chemical Shifts and Sequence Data
title_short NMRDSP: An Accurate Prediction of Protein Shape Strings from NMR Chemical Shifts and Sequence Data
title_sort nmrdsp: an accurate prediction of protein shape strings from nmr chemical shifts and sequence data
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3871590/
https://www.ncbi.nlm.nih.gov/pubmed/24376713
http://dx.doi.org/10.1371/journal.pone.0083532
work_keys_str_mv AT maowusong nmrdspanaccuratepredictionofproteinshapestringsfromnmrchemicalshiftsandsequencedata
AT congpeisheng nmrdspanaccuratepredictionofproteinshapestringsfromnmrchemicalshiftsandsequencedata
AT wangzhiheng nmrdspanaccuratepredictionofproteinshapestringsfromnmrchemicalshiftsandsequencedata
AT lulongjian nmrdspanaccuratepredictionofproteinshapestringsfromnmrchemicalshiftsandsequencedata
AT zhuzhongliang nmrdspanaccuratepredictionofproteinshapestringsfromnmrchemicalshiftsandsequencedata
AT litonghua nmrdspanaccuratepredictionofproteinshapestringsfromnmrchemicalshiftsandsequencedata