Cargando…

Support vector machine (SVM) based multiclass prediction with basic statistical analysis of plasminogen activators

BACKGROUND: Plasminogen (Pg), the precursor of the proteolytic and fibrinolytic enzyme of blood, is converted to the active enzyme plasmin (Pm) by different plasminogen activators (tissue plasminogen activators and urokinase), including the bacterial activators streptokinase and staphylokinase, whic...

Descripción completa

Detalles Bibliográficos
Autores principales: Muthukrishnan, Selvaraj, Puri, Munish, Lefevre, Christophe
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3924408/
https://www.ncbi.nlm.nih.gov/pubmed/24468032
http://dx.doi.org/10.1186/1756-0500-7-63
_version_ 1782303740658712576
author Muthukrishnan, Selvaraj
Puri, Munish
Lefevre, Christophe
author_facet Muthukrishnan, Selvaraj
Puri, Munish
Lefevre, Christophe
author_sort Muthukrishnan, Selvaraj
collection PubMed
description BACKGROUND: Plasminogen (Pg), the precursor of the proteolytic and fibrinolytic enzyme of blood, is converted to the active enzyme plasmin (Pm) by different plasminogen activators (tissue plasminogen activators and urokinase), including the bacterial activators streptokinase and staphylokinase, which activate Pg to Pm and thus are used clinically for thrombolysis. The identification of Pg-activators is therefore an important step in understanding their functional mechanism and derives new therapies. METHODS: In this study, different computational methods for predicting plasminogen activator peptide sequences with high accuracy were investigated, including support vector machines (SVM) based on amino acid (AC), dipeptide composition (DC), PSSM profile and Hybrid methods used to predict different Pg-activators from both prokaryotic and eukaryotic origins. RESULTS: Overall maximum accuracy, evaluated using the five-fold cross validation technique, was 88.37%, 84.32%, 87.61%, 85.63% in 0.87, 0.83,0.86 and 0.85 MCC with amino (AC) or dipeptide composition (DC), PSSM profile and Hybrid methods respectively. Through this study, we have found that the different subfamilies of Pg-activators are quite closely correlated in terms of amino, dipeptide, PSSM and Hybrid compositions. Therefore, our prediction results show that plasminogen activators are predictable with a high accuracy from their primary sequence. Prediction performance was also cross-checked by confusion matrix and ROC (Receiver operating characteristics) analysis. A web server to facilitate the prediction of Pg-activators from primary sequence data was implemented. CONCLUSION: The results show that dipeptide, PSSM profile, and Hybrid based methods perform better than single amino acid composition (AC). Furthermore, we also have developed a web server, which predicts the Pg-activators and their classification (available online at http://mamsap.it.deakin.edu.au/plas_pred/home.html). Our experimental results show that our approaches are faster and achieve generally a good prediction performance.
format Online
Article
Text
id pubmed-3924408
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-39244082014-03-03 Support vector machine (SVM) based multiclass prediction with basic statistical analysis of plasminogen activators Muthukrishnan, Selvaraj Puri, Munish Lefevre, Christophe BMC Res Notes Research Article BACKGROUND: Plasminogen (Pg), the precursor of the proteolytic and fibrinolytic enzyme of blood, is converted to the active enzyme plasmin (Pm) by different plasminogen activators (tissue plasminogen activators and urokinase), including the bacterial activators streptokinase and staphylokinase, which activate Pg to Pm and thus are used clinically for thrombolysis. The identification of Pg-activators is therefore an important step in understanding their functional mechanism and derives new therapies. METHODS: In this study, different computational methods for predicting plasminogen activator peptide sequences with high accuracy were investigated, including support vector machines (SVM) based on amino acid (AC), dipeptide composition (DC), PSSM profile and Hybrid methods used to predict different Pg-activators from both prokaryotic and eukaryotic origins. RESULTS: Overall maximum accuracy, evaluated using the five-fold cross validation technique, was 88.37%, 84.32%, 87.61%, 85.63% in 0.87, 0.83,0.86 and 0.85 MCC with amino (AC) or dipeptide composition (DC), PSSM profile and Hybrid methods respectively. Through this study, we have found that the different subfamilies of Pg-activators are quite closely correlated in terms of amino, dipeptide, PSSM and Hybrid compositions. Therefore, our prediction results show that plasminogen activators are predictable with a high accuracy from their primary sequence. Prediction performance was also cross-checked by confusion matrix and ROC (Receiver operating characteristics) analysis. A web server to facilitate the prediction of Pg-activators from primary sequence data was implemented. CONCLUSION: The results show that dipeptide, PSSM profile, and Hybrid based methods perform better than single amino acid composition (AC). Furthermore, we also have developed a web server, which predicts the Pg-activators and their classification (available online at http://mamsap.it.deakin.edu.au/plas_pred/home.html). Our experimental results show that our approaches are faster and achieve generally a good prediction performance. BioMed Central 2014-01-27 /pmc/articles/PMC3924408/ /pubmed/24468032 http://dx.doi.org/10.1186/1756-0500-7-63 Text en Copyright © 2014 Muthukrishnan et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Muthukrishnan, Selvaraj
Puri, Munish
Lefevre, Christophe
Support vector machine (SVM) based multiclass prediction with basic statistical analysis of plasminogen activators
title Support vector machine (SVM) based multiclass prediction with basic statistical analysis of plasminogen activators
title_full Support vector machine (SVM) based multiclass prediction with basic statistical analysis of plasminogen activators
title_fullStr Support vector machine (SVM) based multiclass prediction with basic statistical analysis of plasminogen activators
title_full_unstemmed Support vector machine (SVM) based multiclass prediction with basic statistical analysis of plasminogen activators
title_short Support vector machine (SVM) based multiclass prediction with basic statistical analysis of plasminogen activators
title_sort support vector machine (svm) based multiclass prediction with basic statistical analysis of plasminogen activators
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3924408/
https://www.ncbi.nlm.nih.gov/pubmed/24468032
http://dx.doi.org/10.1186/1756-0500-7-63
work_keys_str_mv AT muthukrishnanselvaraj supportvectormachinesvmbasedmulticlasspredictionwithbasicstatisticalanalysisofplasminogenactivators
AT purimunish supportvectormachinesvmbasedmulticlasspredictionwithbasicstatisticalanalysisofplasminogenactivators
AT lefevrechristophe supportvectormachinesvmbasedmulticlasspredictionwithbasicstatisticalanalysisofplasminogenactivators