Cargando…

PSSMCOOL: a comprehensive R package for generating evolutionary-based descriptors of protein sequences from PSSM profiles

Position-specific scoring matrix (PSSM), also called profile, is broadly used for representing the evolutionary history of a given protein sequence. Several investigations reported that the PSSM-based feature descriptors can improve the prediction of various protein attributes such as interaction, f...

Descripción completa

Detalles Bibliográficos
Autores principales: Mohammadi, Alireza, Zahiri, Javad, Mohammadi, Saber, Khodarahmi, Mohsen, Arab, Seyed Shahriar
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8977839/
https://www.ncbi.nlm.nih.gov/pubmed/35388370
http://dx.doi.org/10.1093/biomethods/bpac008
_version_ 1784680851362021376
author Mohammadi, Alireza
Zahiri, Javad
Mohammadi, Saber
Khodarahmi, Mohsen
Arab, Seyed Shahriar
author_facet Mohammadi, Alireza
Zahiri, Javad
Mohammadi, Saber
Khodarahmi, Mohsen
Arab, Seyed Shahriar
author_sort Mohammadi, Alireza
collection PubMed
description Position-specific scoring matrix (PSSM), also called profile, is broadly used for representing the evolutionary history of a given protein sequence. Several investigations reported that the PSSM-based feature descriptors can improve the prediction of various protein attributes such as interaction, function, subcellular localization, secondary structure, disorder regions, and accessible surface area. While plenty of algorithms have been suggested for extracting evolutionary features from PSSM in recent years, there is not any integrated standalone tool for providing these descriptors. Here, we introduce PSSMCOOL, a flexible comprehensive R package that generates 38 PSSM-based feature vectors. To our best knowledge, PSSMCOOL is the first PSSM-based feature extraction tool implemented in R. With the growing demand for exploiting machine-learning algorithms in computational biology, this package would be a practical tool for machine-learning predictions.
format Online
Article
Text
id pubmed-8977839
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-89778392022-04-05 PSSMCOOL: a comprehensive R package for generating evolutionary-based descriptors of protein sequences from PSSM profiles Mohammadi, Alireza Zahiri, Javad Mohammadi, Saber Khodarahmi, Mohsen Arab, Seyed Shahriar Biol Methods Protoc Methods Article Position-specific scoring matrix (PSSM), also called profile, is broadly used for representing the evolutionary history of a given protein sequence. Several investigations reported that the PSSM-based feature descriptors can improve the prediction of various protein attributes such as interaction, function, subcellular localization, secondary structure, disorder regions, and accessible surface area. While plenty of algorithms have been suggested for extracting evolutionary features from PSSM in recent years, there is not any integrated standalone tool for providing these descriptors. Here, we introduce PSSMCOOL, a flexible comprehensive R package that generates 38 PSSM-based feature vectors. To our best knowledge, PSSMCOOL is the first PSSM-based feature extraction tool implemented in R. With the growing demand for exploiting machine-learning algorithms in computational biology, this package would be a practical tool for machine-learning predictions. Oxford University Press 2022-03-30 /pmc/articles/PMC8977839/ /pubmed/35388370 http://dx.doi.org/10.1093/biomethods/bpac008 Text en © The Author(s) 2022. Published by Oxford University Press. https://creativecommons.org/licenses/by-nc/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (https://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Methods Article
Mohammadi, Alireza
Zahiri, Javad
Mohammadi, Saber
Khodarahmi, Mohsen
Arab, Seyed Shahriar
PSSMCOOL: a comprehensive R package for generating evolutionary-based descriptors of protein sequences from PSSM profiles
title PSSMCOOL: a comprehensive R package for generating evolutionary-based descriptors of protein sequences from PSSM profiles
title_full PSSMCOOL: a comprehensive R package for generating evolutionary-based descriptors of protein sequences from PSSM profiles
title_fullStr PSSMCOOL: a comprehensive R package for generating evolutionary-based descriptors of protein sequences from PSSM profiles
title_full_unstemmed PSSMCOOL: a comprehensive R package for generating evolutionary-based descriptors of protein sequences from PSSM profiles
title_short PSSMCOOL: a comprehensive R package for generating evolutionary-based descriptors of protein sequences from PSSM profiles
title_sort pssmcool: a comprehensive r package for generating evolutionary-based descriptors of protein sequences from pssm profiles
topic Methods Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8977839/
https://www.ncbi.nlm.nih.gov/pubmed/35388370
http://dx.doi.org/10.1093/biomethods/bpac008
work_keys_str_mv AT mohammadialireza pssmcoolacomprehensiverpackageforgeneratingevolutionarybaseddescriptorsofproteinsequencesfrompssmprofiles
AT zahirijavad pssmcoolacomprehensiverpackageforgeneratingevolutionarybaseddescriptorsofproteinsequencesfrompssmprofiles
AT mohammadisaber pssmcoolacomprehensiverpackageforgeneratingevolutionarybaseddescriptorsofproteinsequencesfrompssmprofiles
AT khodarahmimohsen pssmcoolacomprehensiverpackageforgeneratingevolutionarybaseddescriptorsofproteinsequencesfrompssmprofiles
AT arabseyedshahriar pssmcoolacomprehensiverpackageforgeneratingevolutionarybaseddescriptorsofproteinsequencesfrompssmprofiles