Cargando…

Protein local 3D structure prediction by Super Granule Support Vector Machines (Super GSVM)

BACKGROUND: Understanding the relationship between the protein sequence and the 3D structure is a major research area in bioinformatics. The prediction of complete protein tertiary structure based only on sequence information is still an impractical work. This paper aims at revealing the hidden know...

Descripción completa

Detalles Bibliográficos
Autores principales: Chen, Bernard, Johnson, Matthew
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2009
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3226186/
https://www.ncbi.nlm.nih.gov/pubmed/19811680
http://dx.doi.org/10.1186/1471-2105-10-S11-S15
_version_ 1782217578036330496
author Chen, Bernard
Johnson, Matthew
author_facet Chen, Bernard
Johnson, Matthew
author_sort Chen, Bernard
collection PubMed
description BACKGROUND: Understanding the relationship between the protein sequence and the 3D structure is a major research area in bioinformatics. The prediction of complete protein tertiary structure based only on sequence information is still an impractical work. This paper aims at revealing the hidden knowledge of the sequence motifs and the local tertiary structure. RESULTS: In this paper, we propose a Super Granule Support Vector Machine (Super GSVM) model to obtain the high quality protein sequence motifs and to predict local tertiary structure information based on purely sequence information. CONCLUSION: The proposed model overcomes the innate shortcoming of using the SVM on such a large data set, which is the inherent computational complexity involved in training support vectors for huge datasets including half million of samples. The satisfactory prediction results show the Super GSVM model generates decent protein sequence clusters and has the ability to capture the hidden sequence-to-structure information. This model also has a strong potential in the application of SVMs on other research areas with huge datasets.
format Online
Article
Text
id pubmed-3226186
institution National Center for Biotechnology Information
language English
publishDate 2009
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-32261862011-11-30 Protein local 3D structure prediction by Super Granule Support Vector Machines (Super GSVM) Chen, Bernard Johnson, Matthew BMC Bioinformatics Proceedings BACKGROUND: Understanding the relationship between the protein sequence and the 3D structure is a major research area in bioinformatics. The prediction of complete protein tertiary structure based only on sequence information is still an impractical work. This paper aims at revealing the hidden knowledge of the sequence motifs and the local tertiary structure. RESULTS: In this paper, we propose a Super Granule Support Vector Machine (Super GSVM) model to obtain the high quality protein sequence motifs and to predict local tertiary structure information based on purely sequence information. CONCLUSION: The proposed model overcomes the innate shortcoming of using the SVM on such a large data set, which is the inherent computational complexity involved in training support vectors for huge datasets including half million of samples. The satisfactory prediction results show the Super GSVM model generates decent protein sequence clusters and has the ability to capture the hidden sequence-to-structure information. This model also has a strong potential in the application of SVMs on other research areas with huge datasets. BioMed Central 2009-10-08 /pmc/articles/PMC3226186/ /pubmed/19811680 http://dx.doi.org/10.1186/1471-2105-10-S11-S15 Text en Copyright ©2009 Chen and Johnson; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Proceedings
Chen, Bernard
Johnson, Matthew
Protein local 3D structure prediction by Super Granule Support Vector Machines (Super GSVM)
title Protein local 3D structure prediction by Super Granule Support Vector Machines (Super GSVM)
title_full Protein local 3D structure prediction by Super Granule Support Vector Machines (Super GSVM)
title_fullStr Protein local 3D structure prediction by Super Granule Support Vector Machines (Super GSVM)
title_full_unstemmed Protein local 3D structure prediction by Super Granule Support Vector Machines (Super GSVM)
title_short Protein local 3D structure prediction by Super Granule Support Vector Machines (Super GSVM)
title_sort protein local 3d structure prediction by super granule support vector machines (super gsvm)
topic Proceedings
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3226186/
https://www.ncbi.nlm.nih.gov/pubmed/19811680
http://dx.doi.org/10.1186/1471-2105-10-S11-S15
work_keys_str_mv AT chenbernard proteinlocal3dstructurepredictionbysupergranulesupportvectormachinessupergsvm
AT johnsonmatthew proteinlocal3dstructurepredictionbysupergranulesupportvectormachinessupergsvm