Cargando…
A secondary structure-based position-specific scoring matrix applied to the improvement in protein secondary structure prediction
Protein secondary structure prediction (SSP) has a variety of applications; however, there has been relatively limited improvement in accuracy for years. With a vision of moving forward all related fields, we aimed to make a fundamental advance in SSP. There have been many admirable efforts made to...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8318245/ https://www.ncbi.nlm.nih.gov/pubmed/34320027 http://dx.doi.org/10.1371/journal.pone.0255076 |
_version_ | 1783730207900827648 |
---|---|
author | Chen, Teng-Ruei Juan, Sheng-Hung Huang, Yu-Wei Lin, Yen-Cheng Lo, Wei-Cheng |
author_facet | Chen, Teng-Ruei Juan, Sheng-Hung Huang, Yu-Wei Lin, Yen-Cheng Lo, Wei-Cheng |
author_sort | Chen, Teng-Ruei |
collection | PubMed |
description | Protein secondary structure prediction (SSP) has a variety of applications; however, there has been relatively limited improvement in accuracy for years. With a vision of moving forward all related fields, we aimed to make a fundamental advance in SSP. There have been many admirable efforts made to improve the machine learning algorithm for SSP. This work thus took a step back by manipulating the input features. A secondary structure element-based position-specific scoring matrix (SSE-PSSM) is proposed, based on which a new set of machine learning features can be established. The feasibility of this new PSSM was evaluated by rigid independent tests with training and testing datasets sharing <25% sequence identities. In all experiments, the proposed PSSM outperformed the traditional amino acid PSSM. This new PSSM can be easily combined with the amino acid PSSM, and the improvement in accuracy was remarkable. Preliminary tests made by combining the SSE-PSSM and well-known SSP methods showed 2.0% and 5.2% average improvements in three- and eight-state SSP accuracies, respectively. If this PSSM can be integrated into state-of-the-art SSP methods, the overall accuracy of SSP may break the current restriction and eventually bring benefit to all research and applications where secondary structure prediction plays a vital role during development. To facilitate the application and integration of the SSE-PSSM with modern SSP methods, we have established a web server and standalone programs for generating SSE-PSSM available at http://10.life.nctu.edu.tw/SSE-PSSM. |
format | Online Article Text |
id | pubmed-8318245 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-83182452021-07-31 A secondary structure-based position-specific scoring matrix applied to the improvement in protein secondary structure prediction Chen, Teng-Ruei Juan, Sheng-Hung Huang, Yu-Wei Lin, Yen-Cheng Lo, Wei-Cheng PLoS One Research Article Protein secondary structure prediction (SSP) has a variety of applications; however, there has been relatively limited improvement in accuracy for years. With a vision of moving forward all related fields, we aimed to make a fundamental advance in SSP. There have been many admirable efforts made to improve the machine learning algorithm for SSP. This work thus took a step back by manipulating the input features. A secondary structure element-based position-specific scoring matrix (SSE-PSSM) is proposed, based on which a new set of machine learning features can be established. The feasibility of this new PSSM was evaluated by rigid independent tests with training and testing datasets sharing <25% sequence identities. In all experiments, the proposed PSSM outperformed the traditional amino acid PSSM. This new PSSM can be easily combined with the amino acid PSSM, and the improvement in accuracy was remarkable. Preliminary tests made by combining the SSE-PSSM and well-known SSP methods showed 2.0% and 5.2% average improvements in three- and eight-state SSP accuracies, respectively. If this PSSM can be integrated into state-of-the-art SSP methods, the overall accuracy of SSP may break the current restriction and eventually bring benefit to all research and applications where secondary structure prediction plays a vital role during development. To facilitate the application and integration of the SSE-PSSM with modern SSP methods, we have established a web server and standalone programs for generating SSE-PSSM available at http://10.life.nctu.edu.tw/SSE-PSSM. Public Library of Science 2021-07-28 /pmc/articles/PMC8318245/ /pubmed/34320027 http://dx.doi.org/10.1371/journal.pone.0255076 Text en © 2021 Chen et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Chen, Teng-Ruei Juan, Sheng-Hung Huang, Yu-Wei Lin, Yen-Cheng Lo, Wei-Cheng A secondary structure-based position-specific scoring matrix applied to the improvement in protein secondary structure prediction |
title | A secondary structure-based position-specific scoring matrix applied to the improvement in protein secondary structure prediction |
title_full | A secondary structure-based position-specific scoring matrix applied to the improvement in protein secondary structure prediction |
title_fullStr | A secondary structure-based position-specific scoring matrix applied to the improvement in protein secondary structure prediction |
title_full_unstemmed | A secondary structure-based position-specific scoring matrix applied to the improvement in protein secondary structure prediction |
title_short | A secondary structure-based position-specific scoring matrix applied to the improvement in protein secondary structure prediction |
title_sort | secondary structure-based position-specific scoring matrix applied to the improvement in protein secondary structure prediction |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8318245/ https://www.ncbi.nlm.nih.gov/pubmed/34320027 http://dx.doi.org/10.1371/journal.pone.0255076 |
work_keys_str_mv | AT chentengruei asecondarystructurebasedpositionspecificscoringmatrixappliedtotheimprovementinproteinsecondarystructureprediction AT juanshenghung asecondarystructurebasedpositionspecificscoringmatrixappliedtotheimprovementinproteinsecondarystructureprediction AT huangyuwei asecondarystructurebasedpositionspecificscoringmatrixappliedtotheimprovementinproteinsecondarystructureprediction AT linyencheng asecondarystructurebasedpositionspecificscoringmatrixappliedtotheimprovementinproteinsecondarystructureprediction AT loweicheng asecondarystructurebasedpositionspecificscoringmatrixappliedtotheimprovementinproteinsecondarystructureprediction AT chentengruei secondarystructurebasedpositionspecificscoringmatrixappliedtotheimprovementinproteinsecondarystructureprediction AT juanshenghung secondarystructurebasedpositionspecificscoringmatrixappliedtotheimprovementinproteinsecondarystructureprediction AT huangyuwei secondarystructurebasedpositionspecificscoringmatrixappliedtotheimprovementinproteinsecondarystructureprediction AT linyencheng secondarystructurebasedpositionspecificscoringmatrixappliedtotheimprovementinproteinsecondarystructureprediction AT loweicheng secondarystructurebasedpositionspecificscoringmatrixappliedtotheimprovementinproteinsecondarystructureprediction |