Cargando…
Augmented training of hidden Markov models to recognize remote homologs via simulated evolution
Motivation: While profile hidden Markov models (HMMs) are successful and powerful methods to recognize homologous proteins, they can break down when homology becomes too distant due to lack of sufficient training data. We show that we can improve the performance of HMMs in this domain by using a sim...
Autores principales: | , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2009
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2732314/ https://www.ncbi.nlm.nih.gov/pubmed/19389731 http://dx.doi.org/10.1093/bioinformatics/btp265 |
_version_ | 1782171023772221440 |
---|---|
author | Kumar, Anoop Cowen, Lenore |
author_facet | Kumar, Anoop Cowen, Lenore |
author_sort | Kumar, Anoop |
collection | PubMed |
description | Motivation: While profile hidden Markov models (HMMs) are successful and powerful methods to recognize homologous proteins, they can break down when homology becomes too distant due to lack of sufficient training data. We show that we can improve the performance of HMMs in this domain by using a simple simulated model of evolution to create an augmented training set. Results: We show, in two different remote protein homolog tasks, that HMMs whose training is augmented with simulated evolution outperform HMMs trained only on real data. We find that a mutation rate between 15 and 20% performs best for recognizing G-protein coupled receptor proteins in different classes, and for recognizing SCOP super-family proteins from different families. Contacts: anoop.kumar@tufts.edu;lenore.cowen@tufts.edu |
format | Text |
id | pubmed-2732314 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2009 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-27323142009-08-27 Augmented training of hidden Markov models to recognize remote homologs via simulated evolution Kumar, Anoop Cowen, Lenore Bioinformatics Original Papers Motivation: While profile hidden Markov models (HMMs) are successful and powerful methods to recognize homologous proteins, they can break down when homology becomes too distant due to lack of sufficient training data. We show that we can improve the performance of HMMs in this domain by using a simple simulated model of evolution to create an augmented training set. Results: We show, in two different remote protein homolog tasks, that HMMs whose training is augmented with simulated evolution outperform HMMs trained only on real data. We find that a mutation rate between 15 and 20% performs best for recognizing G-protein coupled receptor proteins in different classes, and for recognizing SCOP super-family proteins from different families. Contacts: anoop.kumar@tufts.edu;lenore.cowen@tufts.edu Oxford University Press 2009-07-01 2009-04-23 /pmc/articles/PMC2732314/ /pubmed/19389731 http://dx.doi.org/10.1093/bioinformatics/btp265 Text en © 2009 The Author(s) http://creativecommons.org/licenses/by-nc/2.0/uk/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.0/uk/) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Original Papers Kumar, Anoop Cowen, Lenore Augmented training of hidden Markov models to recognize remote homologs via simulated evolution |
title | Augmented training of hidden Markov models to recognize remote homologs via simulated evolution |
title_full | Augmented training of hidden Markov models to recognize remote homologs via simulated evolution |
title_fullStr | Augmented training of hidden Markov models to recognize remote homologs via simulated evolution |
title_full_unstemmed | Augmented training of hidden Markov models to recognize remote homologs via simulated evolution |
title_short | Augmented training of hidden Markov models to recognize remote homologs via simulated evolution |
title_sort | augmented training of hidden markov models to recognize remote homologs via simulated evolution |
topic | Original Papers |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2732314/ https://www.ncbi.nlm.nih.gov/pubmed/19389731 http://dx.doi.org/10.1093/bioinformatics/btp265 |
work_keys_str_mv | AT kumaranoop augmentedtrainingofhiddenmarkovmodelstorecognizeremotehomologsviasimulatedevolution AT cowenlenore augmentedtrainingofhiddenmarkovmodelstorecognizeremotehomologsviasimulatedevolution |