Cargando…
PSSMCOOL: a comprehensive R package for generating evolutionary-based descriptors of protein sequences from PSSM profiles
Position-specific scoring matrix (PSSM), also called profile, is broadly used for representing the evolutionary history of a given protein sequence. Several investigations reported that the PSSM-based feature descriptors can improve the prediction of various protein attributes such as interaction, f...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8977839/ https://www.ncbi.nlm.nih.gov/pubmed/35388370 http://dx.doi.org/10.1093/biomethods/bpac008 |
_version_ | 1784680851362021376 |
---|---|
author | Mohammadi, Alireza Zahiri, Javad Mohammadi, Saber Khodarahmi, Mohsen Arab, Seyed Shahriar |
author_facet | Mohammadi, Alireza Zahiri, Javad Mohammadi, Saber Khodarahmi, Mohsen Arab, Seyed Shahriar |
author_sort | Mohammadi, Alireza |
collection | PubMed |
description | Position-specific scoring matrix (PSSM), also called profile, is broadly used for representing the evolutionary history of a given protein sequence. Several investigations reported that the PSSM-based feature descriptors can improve the prediction of various protein attributes such as interaction, function, subcellular localization, secondary structure, disorder regions, and accessible surface area. While plenty of algorithms have been suggested for extracting evolutionary features from PSSM in recent years, there is not any integrated standalone tool for providing these descriptors. Here, we introduce PSSMCOOL, a flexible comprehensive R package that generates 38 PSSM-based feature vectors. To our best knowledge, PSSMCOOL is the first PSSM-based feature extraction tool implemented in R. With the growing demand for exploiting machine-learning algorithms in computational biology, this package would be a practical tool for machine-learning predictions. |
format | Online Article Text |
id | pubmed-8977839 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-89778392022-04-05 PSSMCOOL: a comprehensive R package for generating evolutionary-based descriptors of protein sequences from PSSM profiles Mohammadi, Alireza Zahiri, Javad Mohammadi, Saber Khodarahmi, Mohsen Arab, Seyed Shahriar Biol Methods Protoc Methods Article Position-specific scoring matrix (PSSM), also called profile, is broadly used for representing the evolutionary history of a given protein sequence. Several investigations reported that the PSSM-based feature descriptors can improve the prediction of various protein attributes such as interaction, function, subcellular localization, secondary structure, disorder regions, and accessible surface area. While plenty of algorithms have been suggested for extracting evolutionary features from PSSM in recent years, there is not any integrated standalone tool for providing these descriptors. Here, we introduce PSSMCOOL, a flexible comprehensive R package that generates 38 PSSM-based feature vectors. To our best knowledge, PSSMCOOL is the first PSSM-based feature extraction tool implemented in R. With the growing demand for exploiting machine-learning algorithms in computational biology, this package would be a practical tool for machine-learning predictions. Oxford University Press 2022-03-30 /pmc/articles/PMC8977839/ /pubmed/35388370 http://dx.doi.org/10.1093/biomethods/bpac008 Text en © The Author(s) 2022. Published by Oxford University Press. https://creativecommons.org/licenses/by-nc/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (https://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com |
spellingShingle | Methods Article Mohammadi, Alireza Zahiri, Javad Mohammadi, Saber Khodarahmi, Mohsen Arab, Seyed Shahriar PSSMCOOL: a comprehensive R package for generating evolutionary-based descriptors of protein sequences from PSSM profiles |
title | PSSMCOOL: a comprehensive R package for generating evolutionary-based descriptors of protein sequences from PSSM profiles |
title_full | PSSMCOOL: a comprehensive R package for generating evolutionary-based descriptors of protein sequences from PSSM profiles |
title_fullStr | PSSMCOOL: a comprehensive R package for generating evolutionary-based descriptors of protein sequences from PSSM profiles |
title_full_unstemmed | PSSMCOOL: a comprehensive R package for generating evolutionary-based descriptors of protein sequences from PSSM profiles |
title_short | PSSMCOOL: a comprehensive R package for generating evolutionary-based descriptors of protein sequences from PSSM profiles |
title_sort | pssmcool: a comprehensive r package for generating evolutionary-based descriptors of protein sequences from pssm profiles |
topic | Methods Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8977839/ https://www.ncbi.nlm.nih.gov/pubmed/35388370 http://dx.doi.org/10.1093/biomethods/bpac008 |
work_keys_str_mv | AT mohammadialireza pssmcoolacomprehensiverpackageforgeneratingevolutionarybaseddescriptorsofproteinsequencesfrompssmprofiles AT zahirijavad pssmcoolacomprehensiverpackageforgeneratingevolutionarybaseddescriptorsofproteinsequencesfrompssmprofiles AT mohammadisaber pssmcoolacomprehensiverpackageforgeneratingevolutionarybaseddescriptorsofproteinsequencesfrompssmprofiles AT khodarahmimohsen pssmcoolacomprehensiverpackageforgeneratingevolutionarybaseddescriptorsofproteinsequencesfrompssmprofiles AT arabseyedshahriar pssmcoolacomprehensiverpackageforgeneratingevolutionarybaseddescriptorsofproteinsequencesfrompssmprofiles |