Cargando…

CleanEST: a database of cleansed EST libraries

The EST division of GenBank, dbEST, is widely used in many applications such as gene discovery and verification of exon–intron structure. However, the use of EST sequences in the dbEST libraries is often hampered by inconsistent terminology used to describe the library sources and by the presence of...

Descripción completa

Detalles Bibliográficos
Autores principales: Lee, Byungwook, Shin, Gwangsik
Formato: Texto
Lenguaje:English
Publicado: Oxford University Press 2009
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2686460/
https://www.ncbi.nlm.nih.gov/pubmed/18832365
http://dx.doi.org/10.1093/nar/gkn648
_version_ 1782167412528906240
author Lee, Byungwook
Shin, Gwangsik
author_facet Lee, Byungwook
Shin, Gwangsik
author_sort Lee, Byungwook
collection PubMed
description The EST division of GenBank, dbEST, is widely used in many applications such as gene discovery and verification of exon–intron structure. However, the use of EST sequences in the dbEST libraries is often hampered by inconsistent terminology used to describe the library sources and by the presence of contaminated sequences. Here, we describe CleanEST, a novel database server that classified dbEST libraries and removes contaminants. We classified all dbEST libraries according to species and sequencing center. In addition, we further classified human EST libraries by anatomical and pathological systems according to eVOC ontologies. For each dbEST library, we provide two different cleansed sequences: ‘pre-cleansed’ and ‘user-cleansed’. To generate pre-cleansed sequences, we cleansed sequences in dbEST by alignment of EST sequences against well-known contamination sources: UniVec, Escherichia coli, mitochondria and chloroplast (for plant). To provide user-cleansed sequences, we built an automatic user-cleansing pipeline, in which sequences of a user-selected library are cleansed on-the-fly according to user-selected options. The server is available at http://cleanest.kobic.re.kr/ and the database is updated monthly.
format Text
id pubmed-2686460
institution National Center for Biotechnology Information
language English
publishDate 2009
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-26864602009-05-26 CleanEST: a database of cleansed EST libraries Lee, Byungwook Shin, Gwangsik Nucleic Acids Res Articles The EST division of GenBank, dbEST, is widely used in many applications such as gene discovery and verification of exon–intron structure. However, the use of EST sequences in the dbEST libraries is often hampered by inconsistent terminology used to describe the library sources and by the presence of contaminated sequences. Here, we describe CleanEST, a novel database server that classified dbEST libraries and removes contaminants. We classified all dbEST libraries according to species and sequencing center. In addition, we further classified human EST libraries by anatomical and pathological systems according to eVOC ontologies. For each dbEST library, we provide two different cleansed sequences: ‘pre-cleansed’ and ‘user-cleansed’. To generate pre-cleansed sequences, we cleansed sequences in dbEST by alignment of EST sequences against well-known contamination sources: UniVec, Escherichia coli, mitochondria and chloroplast (for plant). To provide user-cleansed sequences, we built an automatic user-cleansing pipeline, in which sequences of a user-selected library are cleansed on-the-fly according to user-selected options. The server is available at http://cleanest.kobic.re.kr/ and the database is updated monthly. Oxford University Press 2009-01 2008-10-02 /pmc/articles/PMC2686460/ /pubmed/18832365 http://dx.doi.org/10.1093/nar/gkn648 Text en © 2008 The Author(s) http://creativecommons.org/licenses/by-nc/2.0/uk/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.0/uk/) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Articles
Lee, Byungwook
Shin, Gwangsik
CleanEST: a database of cleansed EST libraries
title CleanEST: a database of cleansed EST libraries
title_full CleanEST: a database of cleansed EST libraries
title_fullStr CleanEST: a database of cleansed EST libraries
title_full_unstemmed CleanEST: a database of cleansed EST libraries
title_short CleanEST: a database of cleansed EST libraries
title_sort cleanest: a database of cleansed est libraries
topic Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2686460/
https://www.ncbi.nlm.nih.gov/pubmed/18832365
http://dx.doi.org/10.1093/nar/gkn648
work_keys_str_mv AT leebyungwook cleanestadatabaseofcleansedestlibraries
AT shingwangsik cleanestadatabaseofcleansedestlibraries