Cargando…

Classification of seed members of five riboswitch families as short sequences based on the features extracted by Block Location-Based Feature Extraction (BLBFE) method

[Image: see text] Introduction: Riboswitches are short regulatory elements generally found in the untranslated regions of prokaryotes’ mRNAs and classified into several families. Due to the binding possibility between riboswitches and antibiotics, their usage as engineered regulatory elements and al...

Descripción completa

Detalles Bibliográficos
Autores principales: Golabi, Faegheh, Mehdizadeh Aghdam, Elnaz, Shamsi, Mousa, Sedaaghi, Mohammad Hossein, Barzegar, Abolfazl, Hejazi, Mohammad Saeid
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Tabriz University of Medical Sciences (TUOMS Publishing Group) 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8022236/
https://www.ncbi.nlm.nih.gov/pubmed/33842280
http://dx.doi.org/10.34172/bi.2021.17
_version_ 1783674898381537280
author Golabi, Faegheh
Mehdizadeh Aghdam, Elnaz
Shamsi, Mousa
Sedaaghi, Mohammad Hossein
Barzegar, Abolfazl
Hejazi, Mohammad Saeid
author_facet Golabi, Faegheh
Mehdizadeh Aghdam, Elnaz
Shamsi, Mousa
Sedaaghi, Mohammad Hossein
Barzegar, Abolfazl
Hejazi, Mohammad Saeid
author_sort Golabi, Faegheh
collection PubMed
description [Image: see text] Introduction: Riboswitches are short regulatory elements generally found in the untranslated regions of prokaryotes’ mRNAs and classified into several families. Due to the binding possibility between riboswitches and antibiotics, their usage as engineered regulatory elements and also their evolutionary contribution, the need for bioinformatics tools of riboswitch detection is increasing. We have previously introduced an alignment independent algorithm for the identification of frequent sequential blocks in the families of riboswitches. Herein, we report the application of block location-based feature extraction strategy (BLBFE), which uses the locations of detected blocks on riboswitch sequences as features for classification of seed sequences. Besides, mono- and dinucleotide frequencies, k-mer, DAC, DCC, DACC, PC-PseDNC-General and SC-PseDNC-General methods as some feature extraction strategies were investigated. Methods: The classifiers of the Decision tree, KNN, LDA, and Naïve Bayes, as well as k-fold cross-validation, were employed for all methods of feature extraction to compare their performances based on the criteria of accuracy, sensitivity, specificity, and f-score performance measures. Results: The outcome of the study showed that the BLBFE strategy classified the riboswitches indicating 87.65% average correct classification rate (CCR). Moreover, the performance of the proposed feature extraction method was confirmed with average values of 94.31%, 85.01%, 95.45% and 85.38% for accuracy, sensitivity, specificity, and f-score, respectively. Conclusion: Our result approved the performance of the BLBFE strategy in the classification and discrimination of the riboswitch groups showing remarkable higher values of CCR, accuracy, sensitivity, specificity and f-score relative to previously studied feature extraction methods.
format Online
Article
Text
id pubmed-8022236
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Tabriz University of Medical Sciences (TUOMS Publishing Group)
record_format MEDLINE/PubMed
spelling pubmed-80222362021-04-09 Classification of seed members of five riboswitch families as short sequences based on the features extracted by Block Location-Based Feature Extraction (BLBFE) method Golabi, Faegheh Mehdizadeh Aghdam, Elnaz Shamsi, Mousa Sedaaghi, Mohammad Hossein Barzegar, Abolfazl Hejazi, Mohammad Saeid Bioimpacts Original Research [Image: see text] Introduction: Riboswitches are short regulatory elements generally found in the untranslated regions of prokaryotes’ mRNAs and classified into several families. Due to the binding possibility between riboswitches and antibiotics, their usage as engineered regulatory elements and also their evolutionary contribution, the need for bioinformatics tools of riboswitch detection is increasing. We have previously introduced an alignment independent algorithm for the identification of frequent sequential blocks in the families of riboswitches. Herein, we report the application of block location-based feature extraction strategy (BLBFE), which uses the locations of detected blocks on riboswitch sequences as features for classification of seed sequences. Besides, mono- and dinucleotide frequencies, k-mer, DAC, DCC, DACC, PC-PseDNC-General and SC-PseDNC-General methods as some feature extraction strategies were investigated. Methods: The classifiers of the Decision tree, KNN, LDA, and Naïve Bayes, as well as k-fold cross-validation, were employed for all methods of feature extraction to compare their performances based on the criteria of accuracy, sensitivity, specificity, and f-score performance measures. Results: The outcome of the study showed that the BLBFE strategy classified the riboswitches indicating 87.65% average correct classification rate (CCR). Moreover, the performance of the proposed feature extraction method was confirmed with average values of 94.31%, 85.01%, 95.45% and 85.38% for accuracy, sensitivity, specificity, and f-score, respectively. Conclusion: Our result approved the performance of the BLBFE strategy in the classification and discrimination of the riboswitch groups showing remarkable higher values of CCR, accuracy, sensitivity, specificity and f-score relative to previously studied feature extraction methods. Tabriz University of Medical Sciences (TUOMS Publishing Group) 2021 2020-04-17 /pmc/articles/PMC8022236/ /pubmed/33842280 http://dx.doi.org/10.34172/bi.2021.17 Text en © 2021 The Author(s) This work is published by BioImpacts as an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by-nc/4.0/). Non-commercial uses of the work are permitted, provided the original work is properly cited.
spellingShingle Original Research
Golabi, Faegheh
Mehdizadeh Aghdam, Elnaz
Shamsi, Mousa
Sedaaghi, Mohammad Hossein
Barzegar, Abolfazl
Hejazi, Mohammad Saeid
Classification of seed members of five riboswitch families as short sequences based on the features extracted by Block Location-Based Feature Extraction (BLBFE) method
title Classification of seed members of five riboswitch families as short sequences based on the features extracted by Block Location-Based Feature Extraction (BLBFE) method
title_full Classification of seed members of five riboswitch families as short sequences based on the features extracted by Block Location-Based Feature Extraction (BLBFE) method
title_fullStr Classification of seed members of five riboswitch families as short sequences based on the features extracted by Block Location-Based Feature Extraction (BLBFE) method
title_full_unstemmed Classification of seed members of five riboswitch families as short sequences based on the features extracted by Block Location-Based Feature Extraction (BLBFE) method
title_short Classification of seed members of five riboswitch families as short sequences based on the features extracted by Block Location-Based Feature Extraction (BLBFE) method
title_sort classification of seed members of five riboswitch families as short sequences based on the features extracted by block location-based feature extraction (blbfe) method
topic Original Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8022236/
https://www.ncbi.nlm.nih.gov/pubmed/33842280
http://dx.doi.org/10.34172/bi.2021.17
work_keys_str_mv AT golabifaegheh classificationofseedmembersoffiveriboswitchfamiliesasshortsequencesbasedonthefeaturesextractedbyblocklocationbasedfeatureextractionblbfemethod
AT mehdizadehaghdamelnaz classificationofseedmembersoffiveriboswitchfamiliesasshortsequencesbasedonthefeaturesextractedbyblocklocationbasedfeatureextractionblbfemethod
AT shamsimousa classificationofseedmembersoffiveriboswitchfamiliesasshortsequencesbasedonthefeaturesextractedbyblocklocationbasedfeatureextractionblbfemethod
AT sedaaghimohammadhossein classificationofseedmembersoffiveriboswitchfamiliesasshortsequencesbasedonthefeaturesextractedbyblocklocationbasedfeatureextractionblbfemethod
AT barzegarabolfazl classificationofseedmembersoffiveriboswitchfamiliesasshortsequencesbasedonthefeaturesextractedbyblocklocationbasedfeatureextractionblbfemethod
AT hejazimohammadsaeid classificationofseedmembersoffiveriboswitchfamiliesasshortsequencesbasedonthefeaturesextractedbyblocklocationbasedfeatureextractionblbfemethod