Cargando…

MetWAMer: eukaryotic translation initiation site prediction

BACKGROUND: Translation initiation site (TIS) identification is an important aspect of the gene annotation process, requisite for the accurate delineation of protein sequences from transcript data. We have developed the MetWAMer package for TIS prediction in eukaryotic open reading frames of non-vir...

Descripción completa

Detalles Bibliográficos
Autores principales: Sparks, Michael E, Brendel, Volker
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2008
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2603428/
https://www.ncbi.nlm.nih.gov/pubmed/18801175
http://dx.doi.org/10.1186/1471-2105-9-381
_version_ 1782162590253711360
author Sparks, Michael E
Brendel, Volker
author_facet Sparks, Michael E
Brendel, Volker
author_sort Sparks, Michael E
collection PubMed
description BACKGROUND: Translation initiation site (TIS) identification is an important aspect of the gene annotation process, requisite for the accurate delineation of protein sequences from transcript data. We have developed the MetWAMer package for TIS prediction in eukaryotic open reading frames of non-viral origin. MetWAMer can be used as a stand-alone, third-party tool for post-processing gene structure annotations generated by external computational programs and/or pipelines, or directly integrated into gene structure prediction software implementations. RESULTS: MetWAMer currently implements five distinct methods for TIS prediction, the most accurate of which is a routine that combines weighted, signal-based translation initiation site scores and the contrast in coding potential of sequences flanking TISs using a perceptron. Also, our program implements clustering capabilities through use of the k-medoids algorithm, thereby enabling cluster-specific TIS parameter utilization. In practice, our static weight array matrix-based indexing method for parameter set lookup can be used with good results in data sets exhibiting moderate levels of 5'-complete coverage. CONCLUSION: We demonstrate that improvements in statistically-based models for TIS prediction can be achieved by taking the class of each potential start-methionine into account pending certain testing conditions, and that our perceptron-based model is suitable for the TIS identification task. MetWAMer represents a well-documented, extensible, and freely available software system that can be readily re-trained for differing target applications and/or extended with existing and novel TIS prediction methods, to support further research efforts in this area.
format Text
id pubmed-2603428
institution National Center for Biotechnology Information
language English
publishDate 2008
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-26034282008-12-17 MetWAMer: eukaryotic translation initiation site prediction Sparks, Michael E Brendel, Volker BMC Bioinformatics Software BACKGROUND: Translation initiation site (TIS) identification is an important aspect of the gene annotation process, requisite for the accurate delineation of protein sequences from transcript data. We have developed the MetWAMer package for TIS prediction in eukaryotic open reading frames of non-viral origin. MetWAMer can be used as a stand-alone, third-party tool for post-processing gene structure annotations generated by external computational programs and/or pipelines, or directly integrated into gene structure prediction software implementations. RESULTS: MetWAMer currently implements five distinct methods for TIS prediction, the most accurate of which is a routine that combines weighted, signal-based translation initiation site scores and the contrast in coding potential of sequences flanking TISs using a perceptron. Also, our program implements clustering capabilities through use of the k-medoids algorithm, thereby enabling cluster-specific TIS parameter utilization. In practice, our static weight array matrix-based indexing method for parameter set lookup can be used with good results in data sets exhibiting moderate levels of 5'-complete coverage. CONCLUSION: We demonstrate that improvements in statistically-based models for TIS prediction can be achieved by taking the class of each potential start-methionine into account pending certain testing conditions, and that our perceptron-based model is suitable for the TIS identification task. MetWAMer represents a well-documented, extensible, and freely available software system that can be readily re-trained for differing target applications and/or extended with existing and novel TIS prediction methods, to support further research efforts in this area. BioMed Central 2008-09-18 /pmc/articles/PMC2603428/ /pubmed/18801175 http://dx.doi.org/10.1186/1471-2105-9-381 Text en Copyright © 2008 Sparks and Brendel; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Software
Sparks, Michael E
Brendel, Volker
MetWAMer: eukaryotic translation initiation site prediction
title MetWAMer: eukaryotic translation initiation site prediction
title_full MetWAMer: eukaryotic translation initiation site prediction
title_fullStr MetWAMer: eukaryotic translation initiation site prediction
title_full_unstemmed MetWAMer: eukaryotic translation initiation site prediction
title_short MetWAMer: eukaryotic translation initiation site prediction
title_sort metwamer: eukaryotic translation initiation site prediction
topic Software
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2603428/
https://www.ncbi.nlm.nih.gov/pubmed/18801175
http://dx.doi.org/10.1186/1471-2105-9-381
work_keys_str_mv AT sparksmichaele metwamereukaryotictranslationinitiationsiteprediction
AT brendelvolker metwamereukaryotictranslationinitiationsiteprediction