Cargando…
GarlicESTdb: an online database and mining tool for garlic EST sequences
BACKGROUND: Allium sativum., commonly known as garlic, is a species in the onion genus (Allium), which is a large and diverse one containing over 1,250 species. Its close relatives include chives, onion, leek and shallot. Garlic has been used throughout recorded history for culinary, medicinal use a...
Autores principales: | , , , , , , , , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2009
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2689220/ https://www.ncbi.nlm.nih.gov/pubmed/19445732 http://dx.doi.org/10.1186/1471-2229-9-61 |
_version_ | 1782167763041648640 |
---|---|
author | Kim, Dae-Won Jung, Tae-Sung Nam, Seong-Hyeuk Kwon, Hyuk-Ryul Kim, Aeri Chae, Sung-Hwa Choi, Sang-Haeng Kim, Dong-Wook Kim, Ryong Nam Park, Hong-Seog |
author_facet | Kim, Dae-Won Jung, Tae-Sung Nam, Seong-Hyeuk Kwon, Hyuk-Ryul Kim, Aeri Chae, Sung-Hwa Choi, Sang-Haeng Kim, Dong-Wook Kim, Ryong Nam Park, Hong-Seog |
author_sort | Kim, Dae-Won |
collection | PubMed |
description | BACKGROUND: Allium sativum., commonly known as garlic, is a species in the onion genus (Allium), which is a large and diverse one containing over 1,250 species. Its close relatives include chives, onion, leek and shallot. Garlic has been used throughout recorded history for culinary, medicinal use and health benefits. Currently, the interest in garlic is highly increasing due to nutritional and pharmaceutical value including high blood pressure and cholesterol, atherosclerosis and cancer. For all that, there are no comprehensive databases available for Expressed Sequence Tags(EST) of garlic for gene discovery and future efforts of genome annotation. That is why we developed a new garlic database and applications to enable comprehensive analysis of garlic gene expression. DESCRIPTION: GarlicESTdb is an integrated database and mining tool for large-scale garlic (Allium sativum) EST sequencing. A total of 21,595 ESTs collected from an in-house cDNA library were used to construct the database. The analysis pipeline is an automated system written in JAVA and consists of the following components: automatic preprocessing of EST reads, assembly of raw sequences, annotation of the assembled sequences, storage of the analyzed information into MySQL databases, and graphic display of all processed data. A web application was implemented with the latest J2EE (Java 2 Platform Enterprise Edition) software technology (JSP/EJB/JavaServlet) for browsing and querying the database, for creation of dynamic web pages on the client side, and for mapping annotated enzymes to KEGG pathways, the AJAX framework was also used partially. The online resources, such as putative annotation, single nucleotide polymorphisms (SNP) and tandem repeat data sets, can be searched by text, explored on the website, searched using BLAST, and downloaded. To archive more significant BLAST results, a curation system was introduced with which biologists can easily edit best-hit annotation information for others to view. The GarlicESTdb web application is freely available at . CONCLUSION: GarlicESTdb is the first incorporated online information database of EST sequences isolated from garlic that can be freely accessed and downloaded. It has many useful features for interactive mining of EST contigs and datasets from each library, including curation of annotated information, expression profiling, information retrieval, and summary of statistics of functional annotation. Consequently, the development of GarlicESTdb will provide a crucial contribution to biologists for data-mining and more efficient experimental studies. |
format | Text |
id | pubmed-2689220 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2009 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-26892202009-06-02 GarlicESTdb: an online database and mining tool for garlic EST sequences Kim, Dae-Won Jung, Tae-Sung Nam, Seong-Hyeuk Kwon, Hyuk-Ryul Kim, Aeri Chae, Sung-Hwa Choi, Sang-Haeng Kim, Dong-Wook Kim, Ryong Nam Park, Hong-Seog BMC Plant Biol Database BACKGROUND: Allium sativum., commonly known as garlic, is a species in the onion genus (Allium), which is a large and diverse one containing over 1,250 species. Its close relatives include chives, onion, leek and shallot. Garlic has been used throughout recorded history for culinary, medicinal use and health benefits. Currently, the interest in garlic is highly increasing due to nutritional and pharmaceutical value including high blood pressure and cholesterol, atherosclerosis and cancer. For all that, there are no comprehensive databases available for Expressed Sequence Tags(EST) of garlic for gene discovery and future efforts of genome annotation. That is why we developed a new garlic database and applications to enable comprehensive analysis of garlic gene expression. DESCRIPTION: GarlicESTdb is an integrated database and mining tool for large-scale garlic (Allium sativum) EST sequencing. A total of 21,595 ESTs collected from an in-house cDNA library were used to construct the database. The analysis pipeline is an automated system written in JAVA and consists of the following components: automatic preprocessing of EST reads, assembly of raw sequences, annotation of the assembled sequences, storage of the analyzed information into MySQL databases, and graphic display of all processed data. A web application was implemented with the latest J2EE (Java 2 Platform Enterprise Edition) software technology (JSP/EJB/JavaServlet) for browsing and querying the database, for creation of dynamic web pages on the client side, and for mapping annotated enzymes to KEGG pathways, the AJAX framework was also used partially. The online resources, such as putative annotation, single nucleotide polymorphisms (SNP) and tandem repeat data sets, can be searched by text, explored on the website, searched using BLAST, and downloaded. To archive more significant BLAST results, a curation system was introduced with which biologists can easily edit best-hit annotation information for others to view. The GarlicESTdb web application is freely available at . CONCLUSION: GarlicESTdb is the first incorporated online information database of EST sequences isolated from garlic that can be freely accessed and downloaded. It has many useful features for interactive mining of EST contigs and datasets from each library, including curation of annotated information, expression profiling, information retrieval, and summary of statistics of functional annotation. Consequently, the development of GarlicESTdb will provide a crucial contribution to biologists for data-mining and more efficient experimental studies. BioMed Central 2009-05-18 /pmc/articles/PMC2689220/ /pubmed/19445732 http://dx.doi.org/10.1186/1471-2229-9-61 Text en Copyright © 2009 Kim et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Database Kim, Dae-Won Jung, Tae-Sung Nam, Seong-Hyeuk Kwon, Hyuk-Ryul Kim, Aeri Chae, Sung-Hwa Choi, Sang-Haeng Kim, Dong-Wook Kim, Ryong Nam Park, Hong-Seog GarlicESTdb: an online database and mining tool for garlic EST sequences |
title | GarlicESTdb: an online database and mining tool for garlic EST sequences |
title_full | GarlicESTdb: an online database and mining tool for garlic EST sequences |
title_fullStr | GarlicESTdb: an online database and mining tool for garlic EST sequences |
title_full_unstemmed | GarlicESTdb: an online database and mining tool for garlic EST sequences |
title_short | GarlicESTdb: an online database and mining tool for garlic EST sequences |
title_sort | garlicestdb: an online database and mining tool for garlic est sequences |
topic | Database |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2689220/ https://www.ncbi.nlm.nih.gov/pubmed/19445732 http://dx.doi.org/10.1186/1471-2229-9-61 |
work_keys_str_mv | AT kimdaewon garlicestdbanonlinedatabaseandminingtoolforgarlicestsequences AT jungtaesung garlicestdbanonlinedatabaseandminingtoolforgarlicestsequences AT namseonghyeuk garlicestdbanonlinedatabaseandminingtoolforgarlicestsequences AT kwonhyukryul garlicestdbanonlinedatabaseandminingtoolforgarlicestsequences AT kimaeri garlicestdbanonlinedatabaseandminingtoolforgarlicestsequences AT chaesunghwa garlicestdbanonlinedatabaseandminingtoolforgarlicestsequences AT choisanghaeng garlicestdbanonlinedatabaseandminingtoolforgarlicestsequences AT kimdongwook garlicestdbanonlinedatabaseandminingtoolforgarlicestsequences AT kimryongnam garlicestdbanonlinedatabaseandminingtoolforgarlicestsequences AT parkhongseog garlicestdbanonlinedatabaseandminingtoolforgarlicestsequences |