Cargando…

TBestDB: a taxonomically broad database of expressed sequence tags (ESTs)

The TBestDB database contains ∼370 000 clustered expressed sequence tag (EST) sequences from 49 organisms, covering a taxonomically broad range of poorly studied, mainly unicellular eukaryotes, and includes experimental information, consensus sequences, gene annotations and metabolic pathway predict...

Descripción completa

Detalles Bibliográficos
Autores principales: O'Brien, Emmet A., Koski, Liisa B., Zhang, Yue, Yang, LiuSong, Wang, Eric, Gray, Michael W., Burger, Gertraud, Lang, B. Franz
Formato: Texto
Lenguaje:English
Publicado: Oxford University Press 2007
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1899108/
https://www.ncbi.nlm.nih.gov/pubmed/17202165
http://dx.doi.org/10.1093/nar/gkl770
_version_ 1782133933642612736
author O'Brien, Emmet A.
Koski, Liisa B.
Zhang, Yue
Yang, LiuSong
Wang, Eric
Gray, Michael W.
Burger, Gertraud
Lang, B. Franz
author_facet O'Brien, Emmet A.
Koski, Liisa B.
Zhang, Yue
Yang, LiuSong
Wang, Eric
Gray, Michael W.
Burger, Gertraud
Lang, B. Franz
author_sort O'Brien, Emmet A.
collection PubMed
description The TBestDB database contains ∼370 000 clustered expressed sequence tag (EST) sequences from 49 organisms, covering a taxonomically broad range of poorly studied, mainly unicellular eukaryotes, and includes experimental information, consensus sequences, gene annotations and metabolic pathway predictions. Most of these ESTs have been generated by the Protist EST Program, a collaboration among six Canadian research groups. EST sequences are read from trace files up to a minimum quality cut-off, vector and linker sequence is masked, and the ESTs are clustered using phrap. The resulting consensus sequences are automatically annotated by using the AutoFACT program. The datasets are automatically checked for clustering errors due to chimerism and potential cross-contamination between organisms, and suspect data are flagged in or removed from the database. Access to data deposited in TBestDB by individual users can be restricted to those users for a limited period. With this first report on TBestDB, we open the database to the research community for free processing, annotation, interspecies comparisons and GenBank submission of EST data generated in individual laboratories. For instructions on submission to TBestDB, contact tbestdb@bch.umontreal.ca. The database can be queried at .
format Text
id pubmed-1899108
institution National Center for Biotechnology Information
language English
publishDate 2007
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-18991082007-06-26 TBestDB: a taxonomically broad database of expressed sequence tags (ESTs) O'Brien, Emmet A. Koski, Liisa B. Zhang, Yue Yang, LiuSong Wang, Eric Gray, Michael W. Burger, Gertraud Lang, B. Franz Nucleic Acids Res Articles The TBestDB database contains ∼370 000 clustered expressed sequence tag (EST) sequences from 49 organisms, covering a taxonomically broad range of poorly studied, mainly unicellular eukaryotes, and includes experimental information, consensus sequences, gene annotations and metabolic pathway predictions. Most of these ESTs have been generated by the Protist EST Program, a collaboration among six Canadian research groups. EST sequences are read from trace files up to a minimum quality cut-off, vector and linker sequence is masked, and the ESTs are clustered using phrap. The resulting consensus sequences are automatically annotated by using the AutoFACT program. The datasets are automatically checked for clustering errors due to chimerism and potential cross-contamination between organisms, and suspect data are flagged in or removed from the database. Access to data deposited in TBestDB by individual users can be restricted to those users for a limited period. With this first report on TBestDB, we open the database to the research community for free processing, annotation, interspecies comparisons and GenBank submission of EST data generated in individual laboratories. For instructions on submission to TBestDB, contact tbestdb@bch.umontreal.ca. The database can be queried at . Oxford University Press 2007-01 /pmc/articles/PMC1899108/ /pubmed/17202165 http://dx.doi.org/10.1093/nar/gkl770 Text en © 2006 The Author(s) This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.0/uk/) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Articles
O'Brien, Emmet A.
Koski, Liisa B.
Zhang, Yue
Yang, LiuSong
Wang, Eric
Gray, Michael W.
Burger, Gertraud
Lang, B. Franz
TBestDB: a taxonomically broad database of expressed sequence tags (ESTs)
title TBestDB: a taxonomically broad database of expressed sequence tags (ESTs)
title_full TBestDB: a taxonomically broad database of expressed sequence tags (ESTs)
title_fullStr TBestDB: a taxonomically broad database of expressed sequence tags (ESTs)
title_full_unstemmed TBestDB: a taxonomically broad database of expressed sequence tags (ESTs)
title_short TBestDB: a taxonomically broad database of expressed sequence tags (ESTs)
title_sort tbestdb: a taxonomically broad database of expressed sequence tags (ests)
topic Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1899108/
https://www.ncbi.nlm.nih.gov/pubmed/17202165
http://dx.doi.org/10.1093/nar/gkl770
work_keys_str_mv AT obrienemmeta tbestdbataxonomicallybroaddatabaseofexpressedsequencetagsests
AT koskiliisab tbestdbataxonomicallybroaddatabaseofexpressedsequencetagsests
AT zhangyue tbestdbataxonomicallybroaddatabaseofexpressedsequencetagsests
AT yangliusong tbestdbataxonomicallybroaddatabaseofexpressedsequencetagsests
AT wangeric tbestdbataxonomicallybroaddatabaseofexpressedsequencetagsests
AT graymichaelw tbestdbataxonomicallybroaddatabaseofexpressedsequencetagsests
AT burgergertraud tbestdbataxonomicallybroaddatabaseofexpressedsequencetagsests
AT langbfranz tbestdbataxonomicallybroaddatabaseofexpressedsequencetagsests