Cargando…
Improving the performance of DomainDiscovery of protein domain boundary assignment using inter-domain linker index
BACKGROUND: Knowledge of protein domain boundaries is critical for the characterisation and understanding of protein function. The ability to identify domains without the knowledge of the structure – by using sequence information only – is an essential step in many types of protein analyses. In this...
Autores principales: | , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2006
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1764483/ https://www.ncbi.nlm.nih.gov/pubmed/17254311 http://dx.doi.org/10.1186/1471-2105-7-S5-S6 |
_version_ | 1782131621262000128 |
---|---|
author | Sikder, Abdur R Zomaya, Albert Y |
author_facet | Sikder, Abdur R Zomaya, Albert Y |
author_sort | Sikder, Abdur R |
collection | PubMed |
description | BACKGROUND: Knowledge of protein domain boundaries is critical for the characterisation and understanding of protein function. The ability to identify domains without the knowledge of the structure – by using sequence information only – is an essential step in many types of protein analyses. In this present study, we demonstrate that the performance of DomainDiscovery is improved significantly by including the inter-domain linker index value for domain identification from sequence-based information. Improved DomainDiscovery uses a Support Vector Machine (SVM) approach and a unique training dataset built on the principle of consensus among experts in defining domains in protein structure. The SVM was trained using a PSSM (Position Specific Scoring Matrix), secondary structure, solvent accessibility information and inter-domain linker index to detect possible domain boundaries for a target sequence. RESULTS: Improved DomainDiscovery is compared with other methods by benchmarking against a structurally non-redundant dataset and also CASP5 targets. Improved DomainDiscovery achieves 70% accuracy for domain boundary identification in multi-domains proteins. CONCLUSION: Improved DomainDiscovery compares favourably to the performance of other methods and excels in the identification of domain boundaries for multi-domain proteins as a result of introducing support vector machine with benchmark_2 dataset. |
format | Text |
id | pubmed-1764483 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2006 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-17644832007-01-09 Improving the performance of DomainDiscovery of protein domain boundary assignment using inter-domain linker index Sikder, Abdur R Zomaya, Albert Y BMC Bioinformatics Proceedings BACKGROUND: Knowledge of protein domain boundaries is critical for the characterisation and understanding of protein function. The ability to identify domains without the knowledge of the structure – by using sequence information only – is an essential step in many types of protein analyses. In this present study, we demonstrate that the performance of DomainDiscovery is improved significantly by including the inter-domain linker index value for domain identification from sequence-based information. Improved DomainDiscovery uses a Support Vector Machine (SVM) approach and a unique training dataset built on the principle of consensus among experts in defining domains in protein structure. The SVM was trained using a PSSM (Position Specific Scoring Matrix), secondary structure, solvent accessibility information and inter-domain linker index to detect possible domain boundaries for a target sequence. RESULTS: Improved DomainDiscovery is compared with other methods by benchmarking against a structurally non-redundant dataset and also CASP5 targets. Improved DomainDiscovery achieves 70% accuracy for domain boundary identification in multi-domains proteins. CONCLUSION: Improved DomainDiscovery compares favourably to the performance of other methods and excels in the identification of domain boundaries for multi-domain proteins as a result of introducing support vector machine with benchmark_2 dataset. BioMed Central 2006-12-18 /pmc/articles/PMC1764483/ /pubmed/17254311 http://dx.doi.org/10.1186/1471-2105-7-S5-S6 Text en Copyright © 2006 Sikder and Zomaya; licensee BioMed Central Ltd http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Proceedings Sikder, Abdur R Zomaya, Albert Y Improving the performance of DomainDiscovery of protein domain boundary assignment using inter-domain linker index |
title | Improving the performance of DomainDiscovery of protein domain boundary assignment using inter-domain linker index |
title_full | Improving the performance of DomainDiscovery of protein domain boundary assignment using inter-domain linker index |
title_fullStr | Improving the performance of DomainDiscovery of protein domain boundary assignment using inter-domain linker index |
title_full_unstemmed | Improving the performance of DomainDiscovery of protein domain boundary assignment using inter-domain linker index |
title_short | Improving the performance of DomainDiscovery of protein domain boundary assignment using inter-domain linker index |
title_sort | improving the performance of domaindiscovery of protein domain boundary assignment using inter-domain linker index |
topic | Proceedings |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1764483/ https://www.ncbi.nlm.nih.gov/pubmed/17254311 http://dx.doi.org/10.1186/1471-2105-7-S5-S6 |
work_keys_str_mv | AT sikderabdurr improvingtheperformanceofdomaindiscoveryofproteindomainboundaryassignmentusinginterdomainlinkerindex AT zomayaalberty improvingtheperformanceofdomaindiscoveryofproteindomainboundaryassignmentusinginterdomainlinkerindex |