Cargando…
prot4EST: Translating Expressed Sequence Tags from neglected genomes
BACKGROUND: The genomes of an increasing number of species are being investigated through generation of expressed sequence tags (ESTs). However, ESTs are prone to sequencing errors and typically define incomplete transcripts, making downstream annotation difficult. Annotation would be greatly improv...
Autores principales: | , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2004
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC543579/ https://www.ncbi.nlm.nih.gov/pubmed/15571632 http://dx.doi.org/10.1186/1471-2105-5-187 |
_version_ | 1782122132133642240 |
---|---|
author | Wasmuth, James D Blaxter, Mark L |
author_facet | Wasmuth, James D Blaxter, Mark L |
author_sort | Wasmuth, James D |
collection | PubMed |
description | BACKGROUND: The genomes of an increasing number of species are being investigated through generation of expressed sequence tags (ESTs). However, ESTs are prone to sequencing errors and typically define incomplete transcripts, making downstream annotation difficult. Annotation would be greatly improved with robust polypeptide translations. Many current solutions for EST translation require a large number of full-length gene sequences for training purposes, a resource that is not available for the majority of EST projects. RESULTS: As part of our ongoing EST programs investigating these "neglected" genomes, we have developed a polypeptide prediction pipeline, prot4EST. It incorporates freely available software to produce final translations that are more accurate than those derived from any single method. We show that this integrated approach goes a long way to overcoming the deficit in training data. CONCLUSIONS: prot4EST provides a portable EST translation solution and can be usefully applied to >95% of EST projects to improve downstream annotation. It is freely available from . |
format | Text |
id | pubmed-543579 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2004 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-5435792005-01-09 prot4EST: Translating Expressed Sequence Tags from neglected genomes Wasmuth, James D Blaxter, Mark L BMC Bioinformatics Software BACKGROUND: The genomes of an increasing number of species are being investigated through generation of expressed sequence tags (ESTs). However, ESTs are prone to sequencing errors and typically define incomplete transcripts, making downstream annotation difficult. Annotation would be greatly improved with robust polypeptide translations. Many current solutions for EST translation require a large number of full-length gene sequences for training purposes, a resource that is not available for the majority of EST projects. RESULTS: As part of our ongoing EST programs investigating these "neglected" genomes, we have developed a polypeptide prediction pipeline, prot4EST. It incorporates freely available software to produce final translations that are more accurate than those derived from any single method. We show that this integrated approach goes a long way to overcoming the deficit in training data. CONCLUSIONS: prot4EST provides a portable EST translation solution and can be usefully applied to >95% of EST projects to improve downstream annotation. It is freely available from . BioMed Central 2004-11-30 /pmc/articles/PMC543579/ /pubmed/15571632 http://dx.doi.org/10.1186/1471-2105-5-187 Text en Copyright © 2004 Wasmuth and Blaxter; licensee BioMed Central Ltd. |
spellingShingle | Software Wasmuth, James D Blaxter, Mark L prot4EST: Translating Expressed Sequence Tags from neglected genomes |
title | prot4EST: Translating Expressed Sequence Tags from neglected genomes |
title_full | prot4EST: Translating Expressed Sequence Tags from neglected genomes |
title_fullStr | prot4EST: Translating Expressed Sequence Tags from neglected genomes |
title_full_unstemmed | prot4EST: Translating Expressed Sequence Tags from neglected genomes |
title_short | prot4EST: Translating Expressed Sequence Tags from neglected genomes |
title_sort | prot4est: translating expressed sequence tags from neglected genomes |
topic | Software |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC543579/ https://www.ncbi.nlm.nih.gov/pubmed/15571632 http://dx.doi.org/10.1186/1471-2105-5-187 |
work_keys_str_mv | AT wasmuthjamesd prot4esttranslatingexpressedsequencetagsfromneglectedgenomes AT blaxtermarkl prot4esttranslatingexpressedsequencetagsfromneglectedgenomes |