Cargando…
Computational Prediction of Intrinsically Disordered Proteins Based on Protein Sequences and Convolutional Neural Networks
Intrinsically disordered proteins (IDPs) possess at least one region that lacks a single stable structure in vivo, which makes them play an important role in a variety of biological functions. We propose a prediction method for IDPs based on convolutional neural networks (CNNs) and feature selection...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Hindawi
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8727116/ https://www.ncbi.nlm.nih.gov/pubmed/34992646 http://dx.doi.org/10.1155/2021/4455604 |
_version_ | 1784626446607581184 |
---|---|
author | He, Hao Yang, Yong |
author_facet | He, Hao Yang, Yong |
author_sort | He, Hao |
collection | PubMed |
description | Intrinsically disordered proteins (IDPs) possess at least one region that lacks a single stable structure in vivo, which makes them play an important role in a variety of biological functions. We propose a prediction method for IDPs based on convolutional neural networks (CNNs) and feature selection. The combination of sequence and evolutionary properties is used to describe the differences between disordered and ordered regions. Especially, to highlight the correlation between the target residue and adjacent residues, multiple windows are selected to preprocess the protein sequence through the selected properties. The shorter windows reflect the characteristics of the central residue, and the longer windows reflect the characteristics of the surroundings around the central residue. Moreover, to highlight the specificity of sequence and evolutionary properties, they are preprocessed, respectively. After that, the preprocessed properties are combined into feature matrices as the input of the constructed CNN. Our method is training as well as testing based on the DisProt database. The simulation results show that the proposed method can predict IDPs effectively, and the performance is competitive in comparison with IsUnstruct and ESpritz. |
format | Online Article Text |
id | pubmed-8727116 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Hindawi |
record_format | MEDLINE/PubMed |
spelling | pubmed-87271162022-01-05 Computational Prediction of Intrinsically Disordered Proteins Based on Protein Sequences and Convolutional Neural Networks He, Hao Yang, Yong Comput Intell Neurosci Research Article Intrinsically disordered proteins (IDPs) possess at least one region that lacks a single stable structure in vivo, which makes them play an important role in a variety of biological functions. We propose a prediction method for IDPs based on convolutional neural networks (CNNs) and feature selection. The combination of sequence and evolutionary properties is used to describe the differences between disordered and ordered regions. Especially, to highlight the correlation between the target residue and adjacent residues, multiple windows are selected to preprocess the protein sequence through the selected properties. The shorter windows reflect the characteristics of the central residue, and the longer windows reflect the characteristics of the surroundings around the central residue. Moreover, to highlight the specificity of sequence and evolutionary properties, they are preprocessed, respectively. After that, the preprocessed properties are combined into feature matrices as the input of the constructed CNN. Our method is training as well as testing based on the DisProt database. The simulation results show that the proposed method can predict IDPs effectively, and the performance is competitive in comparison with IsUnstruct and ESpritz. Hindawi 2021-12-28 /pmc/articles/PMC8727116/ /pubmed/34992646 http://dx.doi.org/10.1155/2021/4455604 Text en Copyright © 2021 Hao He and Yong Yang. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article He, Hao Yang, Yong Computational Prediction of Intrinsically Disordered Proteins Based on Protein Sequences and Convolutional Neural Networks |
title | Computational Prediction of Intrinsically Disordered Proteins Based on Protein Sequences and Convolutional Neural Networks |
title_full | Computational Prediction of Intrinsically Disordered Proteins Based on Protein Sequences and Convolutional Neural Networks |
title_fullStr | Computational Prediction of Intrinsically Disordered Proteins Based on Protein Sequences and Convolutional Neural Networks |
title_full_unstemmed | Computational Prediction of Intrinsically Disordered Proteins Based on Protein Sequences and Convolutional Neural Networks |
title_short | Computational Prediction of Intrinsically Disordered Proteins Based on Protein Sequences and Convolutional Neural Networks |
title_sort | computational prediction of intrinsically disordered proteins based on protein sequences and convolutional neural networks |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8727116/ https://www.ncbi.nlm.nih.gov/pubmed/34992646 http://dx.doi.org/10.1155/2021/4455604 |
work_keys_str_mv | AT hehao computationalpredictionofintrinsicallydisorderedproteinsbasedonproteinsequencesandconvolutionalneuralnetworks AT yangyong computationalpredictionofintrinsicallydisorderedproteinsbasedonproteinsequencesandconvolutionalneuralnetworks |