Cargando…

GarlicESTdb: an online database and mining tool for garlic EST sequences

BACKGROUND: Allium sativum., commonly known as garlic, is a species in the onion genus (Allium), which is a large and diverse one containing over 1,250 species. Its close relatives include chives, onion, leek and shallot. Garlic has been used throughout recorded history for culinary, medicinal use a...

Descripción completa

Detalles Bibliográficos
Autores principales: Kim, Dae-Won, Jung, Tae-Sung, Nam, Seong-Hyeuk, Kwon, Hyuk-Ryul, Kim, Aeri, Chae, Sung-Hwa, Choi, Sang-Haeng, Kim, Dong-Wook, Kim, Ryong Nam, Park, Hong-Seog
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2009
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2689220/
https://www.ncbi.nlm.nih.gov/pubmed/19445732
http://dx.doi.org/10.1186/1471-2229-9-61
_version_ 1782167763041648640
author Kim, Dae-Won
Jung, Tae-Sung
Nam, Seong-Hyeuk
Kwon, Hyuk-Ryul
Kim, Aeri
Chae, Sung-Hwa
Choi, Sang-Haeng
Kim, Dong-Wook
Kim, Ryong Nam
Park, Hong-Seog
author_facet Kim, Dae-Won
Jung, Tae-Sung
Nam, Seong-Hyeuk
Kwon, Hyuk-Ryul
Kim, Aeri
Chae, Sung-Hwa
Choi, Sang-Haeng
Kim, Dong-Wook
Kim, Ryong Nam
Park, Hong-Seog
author_sort Kim, Dae-Won
collection PubMed
description BACKGROUND: Allium sativum., commonly known as garlic, is a species in the onion genus (Allium), which is a large and diverse one containing over 1,250 species. Its close relatives include chives, onion, leek and shallot. Garlic has been used throughout recorded history for culinary, medicinal use and health benefits. Currently, the interest in garlic is highly increasing due to nutritional and pharmaceutical value including high blood pressure and cholesterol, atherosclerosis and cancer. For all that, there are no comprehensive databases available for Expressed Sequence Tags(EST) of garlic for gene discovery and future efforts of genome annotation. That is why we developed a new garlic database and applications to enable comprehensive analysis of garlic gene expression. DESCRIPTION: GarlicESTdb is an integrated database and mining tool for large-scale garlic (Allium sativum) EST sequencing. A total of 21,595 ESTs collected from an in-house cDNA library were used to construct the database. The analysis pipeline is an automated system written in JAVA and consists of the following components: automatic preprocessing of EST reads, assembly of raw sequences, annotation of the assembled sequences, storage of the analyzed information into MySQL databases, and graphic display of all processed data. A web application was implemented with the latest J2EE (Java 2 Platform Enterprise Edition) software technology (JSP/EJB/JavaServlet) for browsing and querying the database, for creation of dynamic web pages on the client side, and for mapping annotated enzymes to KEGG pathways, the AJAX framework was also used partially. The online resources, such as putative annotation, single nucleotide polymorphisms (SNP) and tandem repeat data sets, can be searched by text, explored on the website, searched using BLAST, and downloaded. To archive more significant BLAST results, a curation system was introduced with which biologists can easily edit best-hit annotation information for others to view. The GarlicESTdb web application is freely available at . CONCLUSION: GarlicESTdb is the first incorporated online information database of EST sequences isolated from garlic that can be freely accessed and downloaded. It has many useful features for interactive mining of EST contigs and datasets from each library, including curation of annotated information, expression profiling, information retrieval, and summary of statistics of functional annotation. Consequently, the development of GarlicESTdb will provide a crucial contribution to biologists for data-mining and more efficient experimental studies.
format Text
id pubmed-2689220
institution National Center for Biotechnology Information
language English
publishDate 2009
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-26892202009-06-02 GarlicESTdb: an online database and mining tool for garlic EST sequences Kim, Dae-Won Jung, Tae-Sung Nam, Seong-Hyeuk Kwon, Hyuk-Ryul Kim, Aeri Chae, Sung-Hwa Choi, Sang-Haeng Kim, Dong-Wook Kim, Ryong Nam Park, Hong-Seog BMC Plant Biol Database BACKGROUND: Allium sativum., commonly known as garlic, is a species in the onion genus (Allium), which is a large and diverse one containing over 1,250 species. Its close relatives include chives, onion, leek and shallot. Garlic has been used throughout recorded history for culinary, medicinal use and health benefits. Currently, the interest in garlic is highly increasing due to nutritional and pharmaceutical value including high blood pressure and cholesterol, atherosclerosis and cancer. For all that, there are no comprehensive databases available for Expressed Sequence Tags(EST) of garlic for gene discovery and future efforts of genome annotation. That is why we developed a new garlic database and applications to enable comprehensive analysis of garlic gene expression. DESCRIPTION: GarlicESTdb is an integrated database and mining tool for large-scale garlic (Allium sativum) EST sequencing. A total of 21,595 ESTs collected from an in-house cDNA library were used to construct the database. The analysis pipeline is an automated system written in JAVA and consists of the following components: automatic preprocessing of EST reads, assembly of raw sequences, annotation of the assembled sequences, storage of the analyzed information into MySQL databases, and graphic display of all processed data. A web application was implemented with the latest J2EE (Java 2 Platform Enterprise Edition) software technology (JSP/EJB/JavaServlet) for browsing and querying the database, for creation of dynamic web pages on the client side, and for mapping annotated enzymes to KEGG pathways, the AJAX framework was also used partially. The online resources, such as putative annotation, single nucleotide polymorphisms (SNP) and tandem repeat data sets, can be searched by text, explored on the website, searched using BLAST, and downloaded. To archive more significant BLAST results, a curation system was introduced with which biologists can easily edit best-hit annotation information for others to view. The GarlicESTdb web application is freely available at . CONCLUSION: GarlicESTdb is the first incorporated online information database of EST sequences isolated from garlic that can be freely accessed and downloaded. It has many useful features for interactive mining of EST contigs and datasets from each library, including curation of annotated information, expression profiling, information retrieval, and summary of statistics of functional annotation. Consequently, the development of GarlicESTdb will provide a crucial contribution to biologists for data-mining and more efficient experimental studies. BioMed Central 2009-05-18 /pmc/articles/PMC2689220/ /pubmed/19445732 http://dx.doi.org/10.1186/1471-2229-9-61 Text en Copyright © 2009 Kim et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Database
Kim, Dae-Won
Jung, Tae-Sung
Nam, Seong-Hyeuk
Kwon, Hyuk-Ryul
Kim, Aeri
Chae, Sung-Hwa
Choi, Sang-Haeng
Kim, Dong-Wook
Kim, Ryong Nam
Park, Hong-Seog
GarlicESTdb: an online database and mining tool for garlic EST sequences
title GarlicESTdb: an online database and mining tool for garlic EST sequences
title_full GarlicESTdb: an online database and mining tool for garlic EST sequences
title_fullStr GarlicESTdb: an online database and mining tool for garlic EST sequences
title_full_unstemmed GarlicESTdb: an online database and mining tool for garlic EST sequences
title_short GarlicESTdb: an online database and mining tool for garlic EST sequences
title_sort garlicestdb: an online database and mining tool for garlic est sequences
topic Database
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2689220/
https://www.ncbi.nlm.nih.gov/pubmed/19445732
http://dx.doi.org/10.1186/1471-2229-9-61
work_keys_str_mv AT kimdaewon garlicestdbanonlinedatabaseandminingtoolforgarlicestsequences
AT jungtaesung garlicestdbanonlinedatabaseandminingtoolforgarlicestsequences
AT namseonghyeuk garlicestdbanonlinedatabaseandminingtoolforgarlicestsequences
AT kwonhyukryul garlicestdbanonlinedatabaseandminingtoolforgarlicestsequences
AT kimaeri garlicestdbanonlinedatabaseandminingtoolforgarlicestsequences
AT chaesunghwa garlicestdbanonlinedatabaseandminingtoolforgarlicestsequences
AT choisanghaeng garlicestdbanonlinedatabaseandminingtoolforgarlicestsequences
AT kimdongwook garlicestdbanonlinedatabaseandminingtoolforgarlicestsequences
AT kimryongnam garlicestdbanonlinedatabaseandminingtoolforgarlicestsequences
AT parkhongseog garlicestdbanonlinedatabaseandminingtoolforgarlicestsequences