Cargando…

Pretata: predicting TATA binding proteins with novel features and dimensionality reduction strategy

BACKGROUND: It is necessary and essential to discovery protein function from the novel primary sequences. Wet lab experimental procedures are not only time-consuming, but also costly, so predicting protein structure and function reliably based only on amino acid sequence has significant value. TATA-...

Descripción completa

Detalles Bibliográficos
Autores principales: Zou, Quan, Wan, Shixiang, Ju, Ying, Tang, Jijun, Zeng, Xiangxiang
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5259984/
https://www.ncbi.nlm.nih.gov/pubmed/28155714
http://dx.doi.org/10.1186/s12918-016-0353-5
_version_ 1782499317386313728
author Zou, Quan
Wan, Shixiang
Ju, Ying
Tang, Jijun
Zeng, Xiangxiang
author_facet Zou, Quan
Wan, Shixiang
Ju, Ying
Tang, Jijun
Zeng, Xiangxiang
author_sort Zou, Quan
collection PubMed
description BACKGROUND: It is necessary and essential to discovery protein function from the novel primary sequences. Wet lab experimental procedures are not only time-consuming, but also costly, so predicting protein structure and function reliably based only on amino acid sequence has significant value. TATA-binding protein (TBP) is a kind of DNA binding protein, which plays a key role in the transcription regulation. Our study proposed an automatic approach for identifying TATA-binding proteins efficiently, accurately, and conveniently. This method would guide for the special protein identification with computational intelligence strategies. RESULTS: Firstly, we proposed novel fingerprint features for TBP based on pseudo amino acid composition, physicochemical properties, and secondary structure. Secondly, hierarchical features dimensionality reduction strategies were employed to improve the performance furthermore. Currently, Pretata achieves 92.92% TATA-binding protein prediction accuracy, which is better than all other existing methods. CONCLUSIONS: The experiments demonstrate that our method could greatly improve the prediction accuracy and speed, thus allowing large-scale NGS data prediction to be practical. A web server is developed to facilitate the other researchers, which can be accessed at http://server.malab.cn/preTata/.
format Online
Article
Text
id pubmed-5259984
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-52599842017-01-26 Pretata: predicting TATA binding proteins with novel features and dimensionality reduction strategy Zou, Quan Wan, Shixiang Ju, Ying Tang, Jijun Zeng, Xiangxiang BMC Syst Biol Research BACKGROUND: It is necessary and essential to discovery protein function from the novel primary sequences. Wet lab experimental procedures are not only time-consuming, but also costly, so predicting protein structure and function reliably based only on amino acid sequence has significant value. TATA-binding protein (TBP) is a kind of DNA binding protein, which plays a key role in the transcription regulation. Our study proposed an automatic approach for identifying TATA-binding proteins efficiently, accurately, and conveniently. This method would guide for the special protein identification with computational intelligence strategies. RESULTS: Firstly, we proposed novel fingerprint features for TBP based on pseudo amino acid composition, physicochemical properties, and secondary structure. Secondly, hierarchical features dimensionality reduction strategies were employed to improve the performance furthermore. Currently, Pretata achieves 92.92% TATA-binding protein prediction accuracy, which is better than all other existing methods. CONCLUSIONS: The experiments demonstrate that our method could greatly improve the prediction accuracy and speed, thus allowing large-scale NGS data prediction to be practical. A web server is developed to facilitate the other researchers, which can be accessed at http://server.malab.cn/preTata/. BioMed Central 2016-12-23 /pmc/articles/PMC5259984/ /pubmed/28155714 http://dx.doi.org/10.1186/s12918-016-0353-5 Text en © The Author(s). 2016 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Zou, Quan
Wan, Shixiang
Ju, Ying
Tang, Jijun
Zeng, Xiangxiang
Pretata: predicting TATA binding proteins with novel features and dimensionality reduction strategy
title Pretata: predicting TATA binding proteins with novel features and dimensionality reduction strategy
title_full Pretata: predicting TATA binding proteins with novel features and dimensionality reduction strategy
title_fullStr Pretata: predicting TATA binding proteins with novel features and dimensionality reduction strategy
title_full_unstemmed Pretata: predicting TATA binding proteins with novel features and dimensionality reduction strategy
title_short Pretata: predicting TATA binding proteins with novel features and dimensionality reduction strategy
title_sort pretata: predicting tata binding proteins with novel features and dimensionality reduction strategy
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5259984/
https://www.ncbi.nlm.nih.gov/pubmed/28155714
http://dx.doi.org/10.1186/s12918-016-0353-5
work_keys_str_mv AT zouquan pretatapredictingtatabindingproteinswithnovelfeaturesanddimensionalityreductionstrategy
AT wanshixiang pretatapredictingtatabindingproteinswithnovelfeaturesanddimensionalityreductionstrategy
AT juying pretatapredictingtatabindingproteinswithnovelfeaturesanddimensionalityreductionstrategy
AT tangjijun pretatapredictingtatabindingproteinswithnovelfeaturesanddimensionalityreductionstrategy
AT zengxiangxiang pretatapredictingtatabindingproteinswithnovelfeaturesanddimensionalityreductionstrategy