Cargando…
galaxieEST: addressing EST identity through automated phylogenetic analysis
BACKGROUND: Research involving expressed sequence tags (ESTs) is intricately coupled to the existence of large, well-annotated sequence repositories. Comparatively complete and satisfactory annotated public sequence libraries are, however, available only for a limited range of organisms, rendering t...
Autores principales: | , , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2004
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC459213/ https://www.ncbi.nlm.nih.gov/pubmed/15236648 http://dx.doi.org/10.1186/1471-2105-5-87 |
_version_ | 1782121594136559616 |
---|---|
author | Nilsson, R Henrik Rajashekar, Balaji Larsson, Karl-Henrik Ursing, Björn M |
author_facet | Nilsson, R Henrik Rajashekar, Balaji Larsson, Karl-Henrik Ursing, Björn M |
author_sort | Nilsson, R Henrik |
collection | PubMed |
description | BACKGROUND: Research involving expressed sequence tags (ESTs) is intricately coupled to the existence of large, well-annotated sequence repositories. Comparatively complete and satisfactory annotated public sequence libraries are, however, available only for a limited range of organisms, rendering the absence of sequences and gene structure information a tangible problem for those working with taxa lacking an EST or genome sequencing project. Paralogous genes belonging to the same gene family but distinguished by derived characteristics are particularly prone to misidentification and erroneous annotation; high but incomplete levels of sequence similarity are typically difficult to interpret and have formed the basis of many unsubstantiated assumptions of orthology. In these cases, a phylogenetic study of the query sequence together with the most similar sequences in the database may be of great value to the identification process. In order to facilitate this laborious procedure, a project to employ automated phylogenetic analysis in the identification of ESTs was initiated. RESULTS: galaxieEST is an open source Perl-CGI script package designed to complement traditional similarity-based identification of EST sequences through employment of automated phylogenetic analysis. It uses a series of BLAST runs as a sieve to retrieve nucleotide and protein sequences for inclusion in neighbour joining and parsimony analyses; the output includes the BLAST output, the results of the phylogenetic analyses, and the corresponding multiple alignments. galaxieEST is available as an on-line web service for identification of fungal ESTs and for download / local installation for use with any organism group at . CONCLUSIONS: By addressing sequence relatedness in addition to similarity, galaxieEST provides an integrative view on EST origin and identity, which may prove particularly useful in cases where similarity searches return one or more pertinent, but not full, matches and additional information on the query EST is needed. |
format | Text |
id | pubmed-459213 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2004 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-4592132004-07-16 galaxieEST: addressing EST identity through automated phylogenetic analysis Nilsson, R Henrik Rajashekar, Balaji Larsson, Karl-Henrik Ursing, Björn M BMC Bioinformatics Software BACKGROUND: Research involving expressed sequence tags (ESTs) is intricately coupled to the existence of large, well-annotated sequence repositories. Comparatively complete and satisfactory annotated public sequence libraries are, however, available only for a limited range of organisms, rendering the absence of sequences and gene structure information a tangible problem for those working with taxa lacking an EST or genome sequencing project. Paralogous genes belonging to the same gene family but distinguished by derived characteristics are particularly prone to misidentification and erroneous annotation; high but incomplete levels of sequence similarity are typically difficult to interpret and have formed the basis of many unsubstantiated assumptions of orthology. In these cases, a phylogenetic study of the query sequence together with the most similar sequences in the database may be of great value to the identification process. In order to facilitate this laborious procedure, a project to employ automated phylogenetic analysis in the identification of ESTs was initiated. RESULTS: galaxieEST is an open source Perl-CGI script package designed to complement traditional similarity-based identification of EST sequences through employment of automated phylogenetic analysis. It uses a series of BLAST runs as a sieve to retrieve nucleotide and protein sequences for inclusion in neighbour joining and parsimony analyses; the output includes the BLAST output, the results of the phylogenetic analyses, and the corresponding multiple alignments. galaxieEST is available as an on-line web service for identification of fungal ESTs and for download / local installation for use with any organism group at . CONCLUSIONS: By addressing sequence relatedness in addition to similarity, galaxieEST provides an integrative view on EST origin and identity, which may prove particularly useful in cases where similarity searches return one or more pertinent, but not full, matches and additional information on the query EST is needed. BioMed Central 2004-07-05 /pmc/articles/PMC459213/ /pubmed/15236648 http://dx.doi.org/10.1186/1471-2105-5-87 Text en Copyright © 2004 Nilsson et al; licensee BioMed Central Ltd. This is an Open Access article: verbatim copying and redistribution of this article are permitted in all media for any purpose, provided this notice is preserved along with the article's original URL. |
spellingShingle | Software Nilsson, R Henrik Rajashekar, Balaji Larsson, Karl-Henrik Ursing, Björn M galaxieEST: addressing EST identity through automated phylogenetic analysis |
title | galaxieEST: addressing EST identity through automated phylogenetic analysis |
title_full | galaxieEST: addressing EST identity through automated phylogenetic analysis |
title_fullStr | galaxieEST: addressing EST identity through automated phylogenetic analysis |
title_full_unstemmed | galaxieEST: addressing EST identity through automated phylogenetic analysis |
title_short | galaxieEST: addressing EST identity through automated phylogenetic analysis |
title_sort | galaxieest: addressing est identity through automated phylogenetic analysis |
topic | Software |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC459213/ https://www.ncbi.nlm.nih.gov/pubmed/15236648 http://dx.doi.org/10.1186/1471-2105-5-87 |
work_keys_str_mv | AT nilssonrhenrik galaxieestaddressingestidentitythroughautomatedphylogeneticanalysis AT rajashekarbalaji galaxieestaddressingestidentitythroughautomatedphylogeneticanalysis AT larssonkarlhenrik galaxieestaddressingestidentitythroughautomatedphylogeneticanalysis AT ursingbjornm galaxieestaddressingestidentitythroughautomatedphylogeneticanalysis |