Cargando…
Protein local 3D structure prediction by Super Granule Support Vector Machines (Super GSVM)
BACKGROUND: Understanding the relationship between the protein sequence and the 3D structure is a major research area in bioinformatics. The prediction of complete protein tertiary structure based only on sequence information is still an impractical work. This paper aims at revealing the hidden know...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2009
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3226186/ https://www.ncbi.nlm.nih.gov/pubmed/19811680 http://dx.doi.org/10.1186/1471-2105-10-S11-S15 |
_version_ | 1782217578036330496 |
---|---|
author | Chen, Bernard Johnson, Matthew |
author_facet | Chen, Bernard Johnson, Matthew |
author_sort | Chen, Bernard |
collection | PubMed |
description | BACKGROUND: Understanding the relationship between the protein sequence and the 3D structure is a major research area in bioinformatics. The prediction of complete protein tertiary structure based only on sequence information is still an impractical work. This paper aims at revealing the hidden knowledge of the sequence motifs and the local tertiary structure. RESULTS: In this paper, we propose a Super Granule Support Vector Machine (Super GSVM) model to obtain the high quality protein sequence motifs and to predict local tertiary structure information based on purely sequence information. CONCLUSION: The proposed model overcomes the innate shortcoming of using the SVM on such a large data set, which is the inherent computational complexity involved in training support vectors for huge datasets including half million of samples. The satisfactory prediction results show the Super GSVM model generates decent protein sequence clusters and has the ability to capture the hidden sequence-to-structure information. This model also has a strong potential in the application of SVMs on other research areas with huge datasets. |
format | Online Article Text |
id | pubmed-3226186 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2009 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-32261862011-11-30 Protein local 3D structure prediction by Super Granule Support Vector Machines (Super GSVM) Chen, Bernard Johnson, Matthew BMC Bioinformatics Proceedings BACKGROUND: Understanding the relationship between the protein sequence and the 3D structure is a major research area in bioinformatics. The prediction of complete protein tertiary structure based only on sequence information is still an impractical work. This paper aims at revealing the hidden knowledge of the sequence motifs and the local tertiary structure. RESULTS: In this paper, we propose a Super Granule Support Vector Machine (Super GSVM) model to obtain the high quality protein sequence motifs and to predict local tertiary structure information based on purely sequence information. CONCLUSION: The proposed model overcomes the innate shortcoming of using the SVM on such a large data set, which is the inherent computational complexity involved in training support vectors for huge datasets including half million of samples. The satisfactory prediction results show the Super GSVM model generates decent protein sequence clusters and has the ability to capture the hidden sequence-to-structure information. This model also has a strong potential in the application of SVMs on other research areas with huge datasets. BioMed Central 2009-10-08 /pmc/articles/PMC3226186/ /pubmed/19811680 http://dx.doi.org/10.1186/1471-2105-10-S11-S15 Text en Copyright ©2009 Chen and Johnson; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Proceedings Chen, Bernard Johnson, Matthew Protein local 3D structure prediction by Super Granule Support Vector Machines (Super GSVM) |
title | Protein local 3D structure prediction by Super Granule Support Vector Machines (Super GSVM) |
title_full | Protein local 3D structure prediction by Super Granule Support Vector Machines (Super GSVM) |
title_fullStr | Protein local 3D structure prediction by Super Granule Support Vector Machines (Super GSVM) |
title_full_unstemmed | Protein local 3D structure prediction by Super Granule Support Vector Machines (Super GSVM) |
title_short | Protein local 3D structure prediction by Super Granule Support Vector Machines (Super GSVM) |
title_sort | protein local 3d structure prediction by super granule support vector machines (super gsvm) |
topic | Proceedings |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3226186/ https://www.ncbi.nlm.nih.gov/pubmed/19811680 http://dx.doi.org/10.1186/1471-2105-10-S11-S15 |
work_keys_str_mv | AT chenbernard proteinlocal3dstructurepredictionbysupergranulesupportvectormachinessupergsvm AT johnsonmatthew proteinlocal3dstructurepredictionbysupergranulesupportvectormachinessupergsvm |