Cargando…

Graph Theory-Based Sequence Descriptors as Remote Homology Predictors

Alignment-free (AF) methodologies have increased in popularity in the last decades as alternative tools to alignment-based (AB) algorithms for performing comparative sequence analyses. They have been especially useful to detect remote homologs within the twilight zone of highly diverse gene/protein...

Descripción completa

Detalles Bibliográficos
Autores principales: Agüero-Chapin, Guillermin, Galpert, Deborah, Molina-Ruiz, Reinaldo, Ancede-Gallardo, Evys, Pérez-Machado, Gisselle, De la Riva, Gustavo A., Antunes, Agostinho
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7022958/
https://www.ncbi.nlm.nih.gov/pubmed/31878100
http://dx.doi.org/10.3390/biom10010026
_version_ 1783498137862668288
author Agüero-Chapin, Guillermin
Galpert, Deborah
Molina-Ruiz, Reinaldo
Ancede-Gallardo, Evys
Pérez-Machado, Gisselle
De la Riva, Gustavo A.
Antunes, Agostinho
author_facet Agüero-Chapin, Guillermin
Galpert, Deborah
Molina-Ruiz, Reinaldo
Ancede-Gallardo, Evys
Pérez-Machado, Gisselle
De la Riva, Gustavo A.
Antunes, Agostinho
author_sort Agüero-Chapin, Guillermin
collection PubMed
description Alignment-free (AF) methodologies have increased in popularity in the last decades as alternative tools to alignment-based (AB) algorithms for performing comparative sequence analyses. They have been especially useful to detect remote homologs within the twilight zone of highly diverse gene/protein families and superfamilies. The most popular alignment-free methodologies, as well as their applications to classification problems, have been described in previous reviews. Despite a new set of graph theory-derived sequence/structural descriptors that have been gaining relevance in the detection of remote homology, they have been omitted as AF predictors when the topic is addressed. Here, we first go over the most popular AF approaches used for detecting homology signals within the twilight zone and then bring out the state-of-the-art tools encoding graph theory-derived sequence/structure descriptors and their success for identifying remote homologs. We also highlight the tendency of integrating AF features/measures with the AB ones, either into the same prediction model or by assembling the predictions from different algorithms using voting/weighting strategies, for improving the detection of remote signals. Lastly, we briefly discuss the efforts made to scale up AB and AF features/measures for the comparison of multiple genomes and proteomes. Alongside the achieved experiences in remote homology detection by both the most popular AF tools and other less known ones, we provide our own using the graphical–numerical methodologies, MARCH-INSIDE, TI2BioP, and ProtDCal. We also present a new Python-based tool (SeqDivA) with a friendly graphical user interface (GUI) for delimiting the twilight zone by using several similar criteria.
format Online
Article
Text
id pubmed-7022958
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-70229582020-03-12 Graph Theory-Based Sequence Descriptors as Remote Homology Predictors Agüero-Chapin, Guillermin Galpert, Deborah Molina-Ruiz, Reinaldo Ancede-Gallardo, Evys Pérez-Machado, Gisselle De la Riva, Gustavo A. Antunes, Agostinho Biomolecules Review Alignment-free (AF) methodologies have increased in popularity in the last decades as alternative tools to alignment-based (AB) algorithms for performing comparative sequence analyses. They have been especially useful to detect remote homologs within the twilight zone of highly diverse gene/protein families and superfamilies. The most popular alignment-free methodologies, as well as their applications to classification problems, have been described in previous reviews. Despite a new set of graph theory-derived sequence/structural descriptors that have been gaining relevance in the detection of remote homology, they have been omitted as AF predictors when the topic is addressed. Here, we first go over the most popular AF approaches used for detecting homology signals within the twilight zone and then bring out the state-of-the-art tools encoding graph theory-derived sequence/structure descriptors and their success for identifying remote homologs. We also highlight the tendency of integrating AF features/measures with the AB ones, either into the same prediction model or by assembling the predictions from different algorithms using voting/weighting strategies, for improving the detection of remote signals. Lastly, we briefly discuss the efforts made to scale up AB and AF features/measures for the comparison of multiple genomes and proteomes. Alongside the achieved experiences in remote homology detection by both the most popular AF tools and other less known ones, we provide our own using the graphical–numerical methodologies, MARCH-INSIDE, TI2BioP, and ProtDCal. We also present a new Python-based tool (SeqDivA) with a friendly graphical user interface (GUI) for delimiting the twilight zone by using several similar criteria. MDPI 2019-12-23 /pmc/articles/PMC7022958/ /pubmed/31878100 http://dx.doi.org/10.3390/biom10010026 Text en © 2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Review
Agüero-Chapin, Guillermin
Galpert, Deborah
Molina-Ruiz, Reinaldo
Ancede-Gallardo, Evys
Pérez-Machado, Gisselle
De la Riva, Gustavo A.
Antunes, Agostinho
Graph Theory-Based Sequence Descriptors as Remote Homology Predictors
title Graph Theory-Based Sequence Descriptors as Remote Homology Predictors
title_full Graph Theory-Based Sequence Descriptors as Remote Homology Predictors
title_fullStr Graph Theory-Based Sequence Descriptors as Remote Homology Predictors
title_full_unstemmed Graph Theory-Based Sequence Descriptors as Remote Homology Predictors
title_short Graph Theory-Based Sequence Descriptors as Remote Homology Predictors
title_sort graph theory-based sequence descriptors as remote homology predictors
topic Review
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7022958/
https://www.ncbi.nlm.nih.gov/pubmed/31878100
http://dx.doi.org/10.3390/biom10010026
work_keys_str_mv AT aguerochapinguillermin graphtheorybasedsequencedescriptorsasremotehomologypredictors
AT galpertdeborah graphtheorybasedsequencedescriptorsasremotehomologypredictors
AT molinaruizreinaldo graphtheorybasedsequencedescriptorsasremotehomologypredictors
AT ancedegallardoevys graphtheorybasedsequencedescriptorsasremotehomologypredictors
AT perezmachadogisselle graphtheorybasedsequencedescriptorsasremotehomologypredictors
AT delarivagustavoa graphtheorybasedsequencedescriptorsasremotehomologypredictors
AT antunesagostinho graphtheorybasedsequencedescriptorsasremotehomologypredictors