Cargando…

prot4EST: Translating Expressed Sequence Tags from neglected genomes

BACKGROUND: The genomes of an increasing number of species are being investigated through generation of expressed sequence tags (ESTs). However, ESTs are prone to sequencing errors and typically define incomplete transcripts, making downstream annotation difficult. Annotation would be greatly improv...

Descripción completa

Detalles Bibliográficos
Autores principales: Wasmuth, James D, Blaxter, Mark L
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2004
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC543579/
https://www.ncbi.nlm.nih.gov/pubmed/15571632
http://dx.doi.org/10.1186/1471-2105-5-187
_version_ 1782122132133642240
author Wasmuth, James D
Blaxter, Mark L
author_facet Wasmuth, James D
Blaxter, Mark L
author_sort Wasmuth, James D
collection PubMed
description BACKGROUND: The genomes of an increasing number of species are being investigated through generation of expressed sequence tags (ESTs). However, ESTs are prone to sequencing errors and typically define incomplete transcripts, making downstream annotation difficult. Annotation would be greatly improved with robust polypeptide translations. Many current solutions for EST translation require a large number of full-length gene sequences for training purposes, a resource that is not available for the majority of EST projects. RESULTS: As part of our ongoing EST programs investigating these "neglected" genomes, we have developed a polypeptide prediction pipeline, prot4EST. It incorporates freely available software to produce final translations that are more accurate than those derived from any single method. We show that this integrated approach goes a long way to overcoming the deficit in training data. CONCLUSIONS: prot4EST provides a portable EST translation solution and can be usefully applied to >95% of EST projects to improve downstream annotation. It is freely available from .
format Text
id pubmed-543579
institution National Center for Biotechnology Information
language English
publishDate 2004
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-5435792005-01-09 prot4EST: Translating Expressed Sequence Tags from neglected genomes Wasmuth, James D Blaxter, Mark L BMC Bioinformatics Software BACKGROUND: The genomes of an increasing number of species are being investigated through generation of expressed sequence tags (ESTs). However, ESTs are prone to sequencing errors and typically define incomplete transcripts, making downstream annotation difficult. Annotation would be greatly improved with robust polypeptide translations. Many current solutions for EST translation require a large number of full-length gene sequences for training purposes, a resource that is not available for the majority of EST projects. RESULTS: As part of our ongoing EST programs investigating these "neglected" genomes, we have developed a polypeptide prediction pipeline, prot4EST. It incorporates freely available software to produce final translations that are more accurate than those derived from any single method. We show that this integrated approach goes a long way to overcoming the deficit in training data. CONCLUSIONS: prot4EST provides a portable EST translation solution and can be usefully applied to >95% of EST projects to improve downstream annotation. It is freely available from . BioMed Central 2004-11-30 /pmc/articles/PMC543579/ /pubmed/15571632 http://dx.doi.org/10.1186/1471-2105-5-187 Text en Copyright © 2004 Wasmuth and Blaxter; licensee BioMed Central Ltd.
spellingShingle Software
Wasmuth, James D
Blaxter, Mark L
prot4EST: Translating Expressed Sequence Tags from neglected genomes
title prot4EST: Translating Expressed Sequence Tags from neglected genomes
title_full prot4EST: Translating Expressed Sequence Tags from neglected genomes
title_fullStr prot4EST: Translating Expressed Sequence Tags from neglected genomes
title_full_unstemmed prot4EST: Translating Expressed Sequence Tags from neglected genomes
title_short prot4EST: Translating Expressed Sequence Tags from neglected genomes
title_sort prot4est: translating expressed sequence tags from neglected genomes
topic Software
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC543579/
https://www.ncbi.nlm.nih.gov/pubmed/15571632
http://dx.doi.org/10.1186/1471-2105-5-187
work_keys_str_mv AT wasmuthjamesd prot4esttranslatingexpressedsequencetagsfromneglectedgenomes
AT blaxtermarkl prot4esttranslatingexpressedsequencetagsfromneglectedgenomes