Cargando…

Augmented training of hidden Markov models to recognize remote homologs via simulated evolution

Motivation: While profile hidden Markov models (HMMs) are successful and powerful methods to recognize homologous proteins, they can break down when homology becomes too distant due to lack of sufficient training data. We show that we can improve the performance of HMMs in this domain by using a sim...

Descripción completa

Detalles Bibliográficos
Autores principales: Kumar, Anoop, Cowen, Lenore
Formato: Texto
Lenguaje:English
Publicado: Oxford University Press 2009
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2732314/
https://www.ncbi.nlm.nih.gov/pubmed/19389731
http://dx.doi.org/10.1093/bioinformatics/btp265
_version_ 1782171023772221440
author Kumar, Anoop
Cowen, Lenore
author_facet Kumar, Anoop
Cowen, Lenore
author_sort Kumar, Anoop
collection PubMed
description Motivation: While profile hidden Markov models (HMMs) are successful and powerful methods to recognize homologous proteins, they can break down when homology becomes too distant due to lack of sufficient training data. We show that we can improve the performance of HMMs in this domain by using a simple simulated model of evolution to create an augmented training set. Results: We show, in two different remote protein homolog tasks, that HMMs whose training is augmented with simulated evolution outperform HMMs trained only on real data. We find that a mutation rate between 15 and 20% performs best for recognizing G-protein coupled receptor proteins in different classes, and for recognizing SCOP super-family proteins from different families. Contacts: anoop.kumar@tufts.edu;lenore.cowen@tufts.edu
format Text
id pubmed-2732314
institution National Center for Biotechnology Information
language English
publishDate 2009
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-27323142009-08-27 Augmented training of hidden Markov models to recognize remote homologs via simulated evolution Kumar, Anoop Cowen, Lenore Bioinformatics Original Papers Motivation: While profile hidden Markov models (HMMs) are successful and powerful methods to recognize homologous proteins, they can break down when homology becomes too distant due to lack of sufficient training data. We show that we can improve the performance of HMMs in this domain by using a simple simulated model of evolution to create an augmented training set. Results: We show, in two different remote protein homolog tasks, that HMMs whose training is augmented with simulated evolution outperform HMMs trained only on real data. We find that a mutation rate between 15 and 20% performs best for recognizing G-protein coupled receptor proteins in different classes, and for recognizing SCOP super-family proteins from different families. Contacts: anoop.kumar@tufts.edu;lenore.cowen@tufts.edu Oxford University Press 2009-07-01 2009-04-23 /pmc/articles/PMC2732314/ /pubmed/19389731 http://dx.doi.org/10.1093/bioinformatics/btp265 Text en © 2009 The Author(s) http://creativecommons.org/licenses/by-nc/2.0/uk/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.0/uk/) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Papers
Kumar, Anoop
Cowen, Lenore
Augmented training of hidden Markov models to recognize remote homologs via simulated evolution
title Augmented training of hidden Markov models to recognize remote homologs via simulated evolution
title_full Augmented training of hidden Markov models to recognize remote homologs via simulated evolution
title_fullStr Augmented training of hidden Markov models to recognize remote homologs via simulated evolution
title_full_unstemmed Augmented training of hidden Markov models to recognize remote homologs via simulated evolution
title_short Augmented training of hidden Markov models to recognize remote homologs via simulated evolution
title_sort augmented training of hidden markov models to recognize remote homologs via simulated evolution
topic Original Papers
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2732314/
https://www.ncbi.nlm.nih.gov/pubmed/19389731
http://dx.doi.org/10.1093/bioinformatics/btp265
work_keys_str_mv AT kumaranoop augmentedtrainingofhiddenmarkovmodelstorecognizeremotehomologsviasimulatedevolution
AT cowenlenore augmentedtrainingofhiddenmarkovmodelstorecognizeremotehomologsviasimulatedevolution