Cargando…
MetWAMer: eukaryotic translation initiation site prediction
BACKGROUND: Translation initiation site (TIS) identification is an important aspect of the gene annotation process, requisite for the accurate delineation of protein sequences from transcript data. We have developed the MetWAMer package for TIS prediction in eukaryotic open reading frames of non-vir...
Autores principales: | , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2008
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2603428/ https://www.ncbi.nlm.nih.gov/pubmed/18801175 http://dx.doi.org/10.1186/1471-2105-9-381 |
_version_ | 1782162590253711360 |
---|---|
author | Sparks, Michael E Brendel, Volker |
author_facet | Sparks, Michael E Brendel, Volker |
author_sort | Sparks, Michael E |
collection | PubMed |
description | BACKGROUND: Translation initiation site (TIS) identification is an important aspect of the gene annotation process, requisite for the accurate delineation of protein sequences from transcript data. We have developed the MetWAMer package for TIS prediction in eukaryotic open reading frames of non-viral origin. MetWAMer can be used as a stand-alone, third-party tool for post-processing gene structure annotations generated by external computational programs and/or pipelines, or directly integrated into gene structure prediction software implementations. RESULTS: MetWAMer currently implements five distinct methods for TIS prediction, the most accurate of which is a routine that combines weighted, signal-based translation initiation site scores and the contrast in coding potential of sequences flanking TISs using a perceptron. Also, our program implements clustering capabilities through use of the k-medoids algorithm, thereby enabling cluster-specific TIS parameter utilization. In practice, our static weight array matrix-based indexing method for parameter set lookup can be used with good results in data sets exhibiting moderate levels of 5'-complete coverage. CONCLUSION: We demonstrate that improvements in statistically-based models for TIS prediction can be achieved by taking the class of each potential start-methionine into account pending certain testing conditions, and that our perceptron-based model is suitable for the TIS identification task. MetWAMer represents a well-documented, extensible, and freely available software system that can be readily re-trained for differing target applications and/or extended with existing and novel TIS prediction methods, to support further research efforts in this area. |
format | Text |
id | pubmed-2603428 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2008 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-26034282008-12-17 MetWAMer: eukaryotic translation initiation site prediction Sparks, Michael E Brendel, Volker BMC Bioinformatics Software BACKGROUND: Translation initiation site (TIS) identification is an important aspect of the gene annotation process, requisite for the accurate delineation of protein sequences from transcript data. We have developed the MetWAMer package for TIS prediction in eukaryotic open reading frames of non-viral origin. MetWAMer can be used as a stand-alone, third-party tool for post-processing gene structure annotations generated by external computational programs and/or pipelines, or directly integrated into gene structure prediction software implementations. RESULTS: MetWAMer currently implements five distinct methods for TIS prediction, the most accurate of which is a routine that combines weighted, signal-based translation initiation site scores and the contrast in coding potential of sequences flanking TISs using a perceptron. Also, our program implements clustering capabilities through use of the k-medoids algorithm, thereby enabling cluster-specific TIS parameter utilization. In practice, our static weight array matrix-based indexing method for parameter set lookup can be used with good results in data sets exhibiting moderate levels of 5'-complete coverage. CONCLUSION: We demonstrate that improvements in statistically-based models for TIS prediction can be achieved by taking the class of each potential start-methionine into account pending certain testing conditions, and that our perceptron-based model is suitable for the TIS identification task. MetWAMer represents a well-documented, extensible, and freely available software system that can be readily re-trained for differing target applications and/or extended with existing and novel TIS prediction methods, to support further research efforts in this area. BioMed Central 2008-09-18 /pmc/articles/PMC2603428/ /pubmed/18801175 http://dx.doi.org/10.1186/1471-2105-9-381 Text en Copyright © 2008 Sparks and Brendel; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Software Sparks, Michael E Brendel, Volker MetWAMer: eukaryotic translation initiation site prediction |
title | MetWAMer: eukaryotic translation initiation site prediction |
title_full | MetWAMer: eukaryotic translation initiation site prediction |
title_fullStr | MetWAMer: eukaryotic translation initiation site prediction |
title_full_unstemmed | MetWAMer: eukaryotic translation initiation site prediction |
title_short | MetWAMer: eukaryotic translation initiation site prediction |
title_sort | metwamer: eukaryotic translation initiation site prediction |
topic | Software |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2603428/ https://www.ncbi.nlm.nih.gov/pubmed/18801175 http://dx.doi.org/10.1186/1471-2105-9-381 |
work_keys_str_mv | AT sparksmichaele metwamereukaryotictranslationinitiationsiteprediction AT brendelvolker metwamereukaryotictranslationinitiationsiteprediction |