Cargando…

Improving model construction of profile HMMs for remote homology detection through structural alignment

BACKGROUND: Remote homology detection is a challenging problem in Bioinformatics. Arguably, profile Hidden Markov Models (pHMMs) are one of the most successful approaches in addressing this important problem. pHMM packages present a relatively small computational cost, and perform particularly well...

Descripción completa

Detalles Bibliográficos
Autores principales: Bernardes, Juliana S, Dávila, Alberto MR, Costa, Vítor S, Zaverucha, Gerson
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2007
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2245980/
https://www.ncbi.nlm.nih.gov/pubmed/17999748
http://dx.doi.org/10.1186/1471-2105-8-435
_version_ 1782150699610537984
author Bernardes, Juliana S
Dávila, Alberto MR
Costa, Vítor S
Zaverucha, Gerson
author_facet Bernardes, Juliana S
Dávila, Alberto MR
Costa, Vítor S
Zaverucha, Gerson
author_sort Bernardes, Juliana S
collection PubMed
description BACKGROUND: Remote homology detection is a challenging problem in Bioinformatics. Arguably, profile Hidden Markov Models (pHMMs) are one of the most successful approaches in addressing this important problem. pHMM packages present a relatively small computational cost, and perform particularly well at recognizing remote homologies. This raises the question of whether structural alignments could impact the performance of pHMMs trained from proteins in the Twilight Zone, as structural alignments are often more accurate than sequence alignments at identifying motifs and functional residues. Next, we assess the impact of using structural alignments in pHMM performance. RESULTS: We used the SCOP database to perform our experiments. Structural alignments were obtained using the 3DCOFFEE and MAMMOTH-mult tools; sequence alignments were obtained using CLUSTALW, TCOFFEE, MAFFT and PROBCONS. We performed leave-one-family-out cross-validation over super-families. Performance was evaluated through ROC curves and paired two tailed t-test. CONCLUSION: We observed that pHMMs derived from structural alignments performed significantly better than pHMMs derived from sequence alignment in low-identity regions, mainly below 20%. We believe this is because structural alignment tools are better at focusing on the important patterns that are more often conserved through evolution, resulting in higher quality pHMMs. On the other hand, sensitivity of these tools is still quite low for these low-identity regions. Our results suggest a number of possible directions for improvements in this area.
format Text
id pubmed-2245980
institution National Center for Biotechnology Information
language English
publishDate 2007
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-22459802008-02-16 Improving model construction of profile HMMs for remote homology detection through structural alignment Bernardes, Juliana S Dávila, Alberto MR Costa, Vítor S Zaverucha, Gerson BMC Bioinformatics Research Article BACKGROUND: Remote homology detection is a challenging problem in Bioinformatics. Arguably, profile Hidden Markov Models (pHMMs) are one of the most successful approaches in addressing this important problem. pHMM packages present a relatively small computational cost, and perform particularly well at recognizing remote homologies. This raises the question of whether structural alignments could impact the performance of pHMMs trained from proteins in the Twilight Zone, as structural alignments are often more accurate than sequence alignments at identifying motifs and functional residues. Next, we assess the impact of using structural alignments in pHMM performance. RESULTS: We used the SCOP database to perform our experiments. Structural alignments were obtained using the 3DCOFFEE and MAMMOTH-mult tools; sequence alignments were obtained using CLUSTALW, TCOFFEE, MAFFT and PROBCONS. We performed leave-one-family-out cross-validation over super-families. Performance was evaluated through ROC curves and paired two tailed t-test. CONCLUSION: We observed that pHMMs derived from structural alignments performed significantly better than pHMMs derived from sequence alignment in low-identity regions, mainly below 20%. We believe this is because structural alignment tools are better at focusing on the important patterns that are more often conserved through evolution, resulting in higher quality pHMMs. On the other hand, sensitivity of these tools is still quite low for these low-identity regions. Our results suggest a number of possible directions for improvements in this area. BioMed Central 2007-11-09 /pmc/articles/PMC2245980/ /pubmed/17999748 http://dx.doi.org/10.1186/1471-2105-8-435 Text en Copyright © 2007 Bernardes et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Bernardes, Juliana S
Dávila, Alberto MR
Costa, Vítor S
Zaverucha, Gerson
Improving model construction of profile HMMs for remote homology detection through structural alignment
title Improving model construction of profile HMMs for remote homology detection through structural alignment
title_full Improving model construction of profile HMMs for remote homology detection through structural alignment
title_fullStr Improving model construction of profile HMMs for remote homology detection through structural alignment
title_full_unstemmed Improving model construction of profile HMMs for remote homology detection through structural alignment
title_short Improving model construction of profile HMMs for remote homology detection through structural alignment
title_sort improving model construction of profile hmms for remote homology detection through structural alignment
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2245980/
https://www.ncbi.nlm.nih.gov/pubmed/17999748
http://dx.doi.org/10.1186/1471-2105-8-435
work_keys_str_mv AT bernardesjulianas improvingmodelconstructionofprofilehmmsforremotehomologydetectionthroughstructuralalignment
AT davilaalbertomr improvingmodelconstructionofprofilehmmsforremotehomologydetectionthroughstructuralalignment
AT costavitors improvingmodelconstructionofprofilehmmsforremotehomologydetectionthroughstructuralalignment
AT zaveruchagerson improvingmodelconstructionofprofilehmmsforremotehomologydetectionthroughstructuralalignment