Cargando…

GarlicESTdb: an online database and mining tool for garlic EST sequences

BACKGROUND: Allium sativum., commonly known as garlic, is a species in the onion genus (Allium), which is a large and diverse one containing over 1,250 species. Its close relatives include chives, onion, leek and shallot. Garlic has been used throughout recorded history for culinary, medicinal use a...

Descripción completa

Detalles Bibliográficos
Autores principales:	Kim, Dae-Won, Jung, Tae-Sung, Nam, Seong-Hyeuk, Kwon, Hyuk-Ryul, Kim, Aeri, Chae, Sung-Hwa, Choi, Sang-Haeng, Kim, Dong-Wook, Kim, Ryong Nam, Park, Hong-Seog
Formato:	Texto
Lenguaje:	English
Publicado:	BioMed Central 2009
Materias:	Database
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2689220/ https://www.ncbi.nlm.nih.gov/pubmed/19445732 http://dx.doi.org/10.1186/1471-2229-9-61

_version_	1782167763041648640
author	Kim, Dae-Won Jung, Tae-Sung Nam, Seong-Hyeuk Kwon, Hyuk-Ryul Kim, Aeri Chae, Sung-Hwa Choi, Sang-Haeng Kim, Dong-Wook Kim, Ryong Nam Park, Hong-Seog
author_facet	Kim, Dae-Won Jung, Tae-Sung Nam, Seong-Hyeuk Kwon, Hyuk-Ryul Kim, Aeri Chae, Sung-Hwa Choi, Sang-Haeng Kim, Dong-Wook Kim, Ryong Nam Park, Hong-Seog
author_sort	Kim, Dae-Won
collection	PubMed
description	BACKGROUND: Allium sativum., commonly known as garlic, is a species in the onion genus (Allium), which is a large and diverse one containing over 1,250 species. Its close relatives include chives, onion, leek and shallot. Garlic has been used throughout recorded history for culinary, medicinal use and health benefits. Currently, the interest in garlic is highly increasing due to nutritional and pharmaceutical value including high blood pressure and cholesterol, atherosclerosis and cancer. For all that, there are no comprehensive databases available for Expressed Sequence Tags(EST) of garlic for gene discovery and future efforts of genome annotation. That is why we developed a new garlic database and applications to enable comprehensive analysis of garlic gene expression. DESCRIPTION: GarlicESTdb is an integrated database and mining tool for large-scale garlic (Allium sativum) EST sequencing. A total of 21,595 ESTs collected from an in-house cDNA library were used to construct the database. The analysis pipeline is an automated system written in JAVA and consists of the following components: automatic preprocessing of EST reads, assembly of raw sequences, annotation of the assembled sequences, storage of the analyzed information into MySQL databases, and graphic display of all processed data. A web application was implemented with the latest J2EE (Java 2 Platform Enterprise Edition) software technology (JSP/EJB/JavaServlet) for browsing and querying the database, for creation of dynamic web pages on the client side, and for mapping annotated enzymes to KEGG pathways, the AJAX framework was also used partially. The online resources, such as putative annotation, single nucleotide polymorphisms (SNP) and tandem repeat data sets, can be searched by text, explored on the website, searched using BLAST, and downloaded. To archive more significant BLAST results, a curation system was introduced with which biologists can easily edit best-hit annotation information for others to view. The GarlicESTdb web application is freely available at . CONCLUSION: GarlicESTdb is the first incorporated online information database of EST sequences isolated from garlic that can be freely accessed and downloaded. It has many useful features for interactive mining of EST contigs and datasets from each library, including curation of annotated information, expression profiling, information retrieval, and summary of statistics of functional annotation. Consequently, the development of GarlicESTdb will provide a crucial contribution to biologists for data-mining and more efficient experimental studies.
format	Text
id	pubmed-2689220
institution	National Center for Biotechnology Information
language	English
publishDate	2009
publisher	BioMed Central
record_format	MEDLINE/PubMed
spelling	pubmed-26892202009-06-02 GarlicESTdb: an online database and mining tool for garlic EST sequences Kim, Dae-Won Jung, Tae-Sung Nam, Seong-Hyeuk Kwon, Hyuk-Ryul Kim, Aeri Chae, Sung-Hwa Choi, Sang-Haeng Kim, Dong-Wook Kim, Ryong Nam Park, Hong-Seog BMC Plant Biol Database BACKGROUND: Allium sativum., commonly known as garlic, is a species in the onion genus (Allium), which is a large and diverse one containing over 1,250 species. Its close relatives include chives, onion, leek and shallot. Garlic has been used throughout recorded history for culinary, medicinal use and health benefits. Currently, the interest in garlic is highly increasing due to nutritional and pharmaceutical value including high blood pressure and cholesterol, atherosclerosis and cancer. For all that, there are no comprehensive databases available for Expressed Sequence Tags(EST) of garlic for gene discovery and future efforts of genome annotation. That is why we developed a new garlic database and applications to enable comprehensive analysis of garlic gene expression. DESCRIPTION: GarlicESTdb is an integrated database and mining tool for large-scale garlic (Allium sativum) EST sequencing. A total of 21,595 ESTs collected from an in-house cDNA library were used to construct the database. The analysis pipeline is an automated system written in JAVA and consists of the following components: automatic preprocessing of EST reads, assembly of raw sequences, annotation of the assembled sequences, storage of the analyzed information into MySQL databases, and graphic display of all processed data. A web application was implemented with the latest J2EE (Java 2 Platform Enterprise Edition) software technology (JSP/EJB/JavaServlet) for browsing and querying the database, for creation of dynamic web pages on the client side, and for mapping annotated enzymes to KEGG pathways, the AJAX framework was also used partially. The online resources, such as putative annotation, single nucleotide polymorphisms (SNP) and tandem repeat data sets, can be searched by text, explored on the website, searched using BLAST, and downloaded. To archive more significant BLAST results, a curation system was introduced with which biologists can easily edit best-hit annotation information for others to view. The GarlicESTdb web application is freely available at . CONCLUSION: GarlicESTdb is the first incorporated online information database of EST sequences isolated from garlic that can be freely accessed and downloaded. It has many useful features for interactive mining of EST contigs and datasets from each library, including curation of annotated information, expression profiling, information retrieval, and summary of statistics of functional annotation. Consequently, the development of GarlicESTdb will provide a crucial contribution to biologists for data-mining and more efficient experimental studies. BioMed Central 2009-05-18 /pmc/articles/PMC2689220/ /pubmed/19445732 http://dx.doi.org/10.1186/1471-2229-9-61 Text en Copyright © 2009 Kim et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle	Database Kim, Dae-Won Jung, Tae-Sung Nam, Seong-Hyeuk Kwon, Hyuk-Ryul Kim, Aeri Chae, Sung-Hwa Choi, Sang-Haeng Kim, Dong-Wook Kim, Ryong Nam Park, Hong-Seog GarlicESTdb: an online database and mining tool for garlic EST sequences
title	GarlicESTdb: an online database and mining tool for garlic EST sequences
title_full	GarlicESTdb: an online database and mining tool for garlic EST sequences
title_fullStr	GarlicESTdb: an online database and mining tool for garlic EST sequences
title_full_unstemmed	GarlicESTdb: an online database and mining tool for garlic EST sequences
title_short	GarlicESTdb: an online database and mining tool for garlic EST sequences
title_sort	garlicestdb: an online database and mining tool for garlic est sequences
topic	Database
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2689220/ https://www.ncbi.nlm.nih.gov/pubmed/19445732 http://dx.doi.org/10.1186/1471-2229-9-61
work_keys_str_mv	AT kimdaewon garlicestdbanonlinedatabaseandminingtoolforgarlicestsequences AT jungtaesung garlicestdbanonlinedatabaseandminingtoolforgarlicestsequences AT namseonghyeuk garlicestdbanonlinedatabaseandminingtoolforgarlicestsequences AT kwonhyukryul garlicestdbanonlinedatabaseandminingtoolforgarlicestsequences AT kimaeri garlicestdbanonlinedatabaseandminingtoolforgarlicestsequences AT chaesunghwa garlicestdbanonlinedatabaseandminingtoolforgarlicestsequences AT choisanghaeng garlicestdbanonlinedatabaseandminingtoolforgarlicestsequences AT kimdongwook garlicestdbanonlinedatabaseandminingtoolforgarlicestsequences AT kimryongnam garlicestdbanonlinedatabaseandminingtoolforgarlicestsequences AT parkhongseog garlicestdbanonlinedatabaseandminingtoolforgarlicestsequences

GarlicESTdb: an online database and mining tool for garlic EST sequences

Ejemplares similares