Cargando…

A secondary structure-based position-specific scoring matrix applied to the improvement in protein secondary structure prediction

Protein secondary structure prediction (SSP) has a variety of applications; however, there has been relatively limited improvement in accuracy for years. With a vision of moving forward all related fields, we aimed to make a fundamental advance in SSP. There have been many admirable efforts made to...

Descripción completa

Detalles Bibliográficos
Autores principales: Chen, Teng-Ruei, Juan, Sheng-Hung, Huang, Yu-Wei, Lin, Yen-Cheng, Lo, Wei-Cheng
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8318245/
https://www.ncbi.nlm.nih.gov/pubmed/34320027
http://dx.doi.org/10.1371/journal.pone.0255076
_version_ 1783730207900827648
author Chen, Teng-Ruei
Juan, Sheng-Hung
Huang, Yu-Wei
Lin, Yen-Cheng
Lo, Wei-Cheng
author_facet Chen, Teng-Ruei
Juan, Sheng-Hung
Huang, Yu-Wei
Lin, Yen-Cheng
Lo, Wei-Cheng
author_sort Chen, Teng-Ruei
collection PubMed
description Protein secondary structure prediction (SSP) has a variety of applications; however, there has been relatively limited improvement in accuracy for years. With a vision of moving forward all related fields, we aimed to make a fundamental advance in SSP. There have been many admirable efforts made to improve the machine learning algorithm for SSP. This work thus took a step back by manipulating the input features. A secondary structure element-based position-specific scoring matrix (SSE-PSSM) is proposed, based on which a new set of machine learning features can be established. The feasibility of this new PSSM was evaluated by rigid independent tests with training and testing datasets sharing <25% sequence identities. In all experiments, the proposed PSSM outperformed the traditional amino acid PSSM. This new PSSM can be easily combined with the amino acid PSSM, and the improvement in accuracy was remarkable. Preliminary tests made by combining the SSE-PSSM and well-known SSP methods showed 2.0% and 5.2% average improvements in three- and eight-state SSP accuracies, respectively. If this PSSM can be integrated into state-of-the-art SSP methods, the overall accuracy of SSP may break the current restriction and eventually bring benefit to all research and applications where secondary structure prediction plays a vital role during development. To facilitate the application and integration of the SSE-PSSM with modern SSP methods, we have established a web server and standalone programs for generating SSE-PSSM available at http://10.life.nctu.edu.tw/SSE-PSSM.
format Online
Article
Text
id pubmed-8318245
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-83182452021-07-31 A secondary structure-based position-specific scoring matrix applied to the improvement in protein secondary structure prediction Chen, Teng-Ruei Juan, Sheng-Hung Huang, Yu-Wei Lin, Yen-Cheng Lo, Wei-Cheng PLoS One Research Article Protein secondary structure prediction (SSP) has a variety of applications; however, there has been relatively limited improvement in accuracy for years. With a vision of moving forward all related fields, we aimed to make a fundamental advance in SSP. There have been many admirable efforts made to improve the machine learning algorithm for SSP. This work thus took a step back by manipulating the input features. A secondary structure element-based position-specific scoring matrix (SSE-PSSM) is proposed, based on which a new set of machine learning features can be established. The feasibility of this new PSSM was evaluated by rigid independent tests with training and testing datasets sharing <25% sequence identities. In all experiments, the proposed PSSM outperformed the traditional amino acid PSSM. This new PSSM can be easily combined with the amino acid PSSM, and the improvement in accuracy was remarkable. Preliminary tests made by combining the SSE-PSSM and well-known SSP methods showed 2.0% and 5.2% average improvements in three- and eight-state SSP accuracies, respectively. If this PSSM can be integrated into state-of-the-art SSP methods, the overall accuracy of SSP may break the current restriction and eventually bring benefit to all research and applications where secondary structure prediction plays a vital role during development. To facilitate the application and integration of the SSE-PSSM with modern SSP methods, we have established a web server and standalone programs for generating SSE-PSSM available at http://10.life.nctu.edu.tw/SSE-PSSM. Public Library of Science 2021-07-28 /pmc/articles/PMC8318245/ /pubmed/34320027 http://dx.doi.org/10.1371/journal.pone.0255076 Text en © 2021 Chen et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Chen, Teng-Ruei
Juan, Sheng-Hung
Huang, Yu-Wei
Lin, Yen-Cheng
Lo, Wei-Cheng
A secondary structure-based position-specific scoring matrix applied to the improvement in protein secondary structure prediction
title A secondary structure-based position-specific scoring matrix applied to the improvement in protein secondary structure prediction
title_full A secondary structure-based position-specific scoring matrix applied to the improvement in protein secondary structure prediction
title_fullStr A secondary structure-based position-specific scoring matrix applied to the improvement in protein secondary structure prediction
title_full_unstemmed A secondary structure-based position-specific scoring matrix applied to the improvement in protein secondary structure prediction
title_short A secondary structure-based position-specific scoring matrix applied to the improvement in protein secondary structure prediction
title_sort secondary structure-based position-specific scoring matrix applied to the improvement in protein secondary structure prediction
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8318245/
https://www.ncbi.nlm.nih.gov/pubmed/34320027
http://dx.doi.org/10.1371/journal.pone.0255076
work_keys_str_mv AT chentengruei asecondarystructurebasedpositionspecificscoringmatrixappliedtotheimprovementinproteinsecondarystructureprediction
AT juanshenghung asecondarystructurebasedpositionspecificscoringmatrixappliedtotheimprovementinproteinsecondarystructureprediction
AT huangyuwei asecondarystructurebasedpositionspecificscoringmatrixappliedtotheimprovementinproteinsecondarystructureprediction
AT linyencheng asecondarystructurebasedpositionspecificscoringmatrixappliedtotheimprovementinproteinsecondarystructureprediction
AT loweicheng asecondarystructurebasedpositionspecificscoringmatrixappliedtotheimprovementinproteinsecondarystructureprediction
AT chentengruei secondarystructurebasedpositionspecificscoringmatrixappliedtotheimprovementinproteinsecondarystructureprediction
AT juanshenghung secondarystructurebasedpositionspecificscoringmatrixappliedtotheimprovementinproteinsecondarystructureprediction
AT huangyuwei secondarystructurebasedpositionspecificscoringmatrixappliedtotheimprovementinproteinsecondarystructureprediction
AT linyencheng secondarystructurebasedpositionspecificscoringmatrixappliedtotheimprovementinproteinsecondarystructureprediction
AT loweicheng secondarystructurebasedpositionspecificscoringmatrixappliedtotheimprovementinproteinsecondarystructureprediction