Cargando…

Identitag, a relational database for SAGE tag identification and interspecies comparison of SAGE libraries

BACKGROUND: Serial Analysis of Gene Expression (SAGE) is a method of large-scale gene expression analysis that has the potential to generate the full list of mRNAs present within a cell population at a given time and their frequency. An essential step in SAGE library analysis is the unambiguous assi...

Descripción completa

Detalles Bibliográficos
Autores principales: Keime, Céline, Damiola, Francesca, Mouchiroud, Dominique, Duret, Laurent, Gandrillon, Olivier
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2004
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC535903/
https://www.ncbi.nlm.nih.gov/pubmed/15469608
http://dx.doi.org/10.1186/1471-2105-5-143
_version_ 1782122034286821376
author Keime, Céline
Damiola, Francesca
Mouchiroud, Dominique
Duret, Laurent
Gandrillon, Olivier
author_facet Keime, Céline
Damiola, Francesca
Mouchiroud, Dominique
Duret, Laurent
Gandrillon, Olivier
author_sort Keime, Céline
collection PubMed
description BACKGROUND: Serial Analysis of Gene Expression (SAGE) is a method of large-scale gene expression analysis that has the potential to generate the full list of mRNAs present within a cell population at a given time and their frequency. An essential step in SAGE library analysis is the unambiguous assignment of each 14 bp tag to the transcript from which it was derived. This process, called tag-to-gene mapping, represents a step that has to be improved in the analysis of SAGE libraries. Indeed, the existing web sites providing correspondence between tags and transcripts do not concern all species for which numerous EST and cDNA have already been sequenced. RESULTS: This is the reason why we designed and implemented a freely available tool called Identitag for tag identification that can be used in any species for which transcript sequences are available. Identitag is based on a relational database structure in order to allow rapid and easy storage and updating of data and, most importantly, in order to be able to precisely define identification parameters. This structure can be seen like three interconnected modules : the first one stores virtual tags extracted from a given list of transcript sequences, the second stores experimental tags observed in SAGE experiments, and the third allows the annotation of the transcript sequences used for virtual tag extraction. It therefore connects an observed tag to a virtual tag and to the sequence it comes from, and then to its functional annotation when available. Databases made from different species can be connected according to orthology relationship thus allowing the comparison of SAGE libraries between species. We successfully used Identitag to identify tags from our chicken SAGE libraries and for chicken to human SAGE tags interspecies comparison. Identitag sources are freely available on web site. CONCLUSIONS: Identitag is a flexible and powerful tool for tag identification in any single species and for interspecies comparison of SAGE libraries. It opens the way to comparative transcriptomic analysis, an emerging branch of biology.
format Text
id pubmed-535903
institution National Center for Biotechnology Information
language English
publishDate 2004
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-5359032004-12-17 Identitag, a relational database for SAGE tag identification and interspecies comparison of SAGE libraries Keime, Céline Damiola, Francesca Mouchiroud, Dominique Duret, Laurent Gandrillon, Olivier BMC Bioinformatics Software BACKGROUND: Serial Analysis of Gene Expression (SAGE) is a method of large-scale gene expression analysis that has the potential to generate the full list of mRNAs present within a cell population at a given time and their frequency. An essential step in SAGE library analysis is the unambiguous assignment of each 14 bp tag to the transcript from which it was derived. This process, called tag-to-gene mapping, represents a step that has to be improved in the analysis of SAGE libraries. Indeed, the existing web sites providing correspondence between tags and transcripts do not concern all species for which numerous EST and cDNA have already been sequenced. RESULTS: This is the reason why we designed and implemented a freely available tool called Identitag for tag identification that can be used in any species for which transcript sequences are available. Identitag is based on a relational database structure in order to allow rapid and easy storage and updating of data and, most importantly, in order to be able to precisely define identification parameters. This structure can be seen like three interconnected modules : the first one stores virtual tags extracted from a given list of transcript sequences, the second stores experimental tags observed in SAGE experiments, and the third allows the annotation of the transcript sequences used for virtual tag extraction. It therefore connects an observed tag to a virtual tag and to the sequence it comes from, and then to its functional annotation when available. Databases made from different species can be connected according to orthology relationship thus allowing the comparison of SAGE libraries between species. We successfully used Identitag to identify tags from our chicken SAGE libraries and for chicken to human SAGE tags interspecies comparison. Identitag sources are freely available on web site. CONCLUSIONS: Identitag is a flexible and powerful tool for tag identification in any single species and for interspecies comparison of SAGE libraries. It opens the way to comparative transcriptomic analysis, an emerging branch of biology. BioMed Central 2004-10-06 /pmc/articles/PMC535903/ /pubmed/15469608 http://dx.doi.org/10.1186/1471-2105-5-143 Text en Copyright © 2004 Keime et al; licensee BioMed Central Ltd.
spellingShingle Software
Keime, Céline
Damiola, Francesca
Mouchiroud, Dominique
Duret, Laurent
Gandrillon, Olivier
Identitag, a relational database for SAGE tag identification and interspecies comparison of SAGE libraries
title Identitag, a relational database for SAGE tag identification and interspecies comparison of SAGE libraries
title_full Identitag, a relational database for SAGE tag identification and interspecies comparison of SAGE libraries
title_fullStr Identitag, a relational database for SAGE tag identification and interspecies comparison of SAGE libraries
title_full_unstemmed Identitag, a relational database for SAGE tag identification and interspecies comparison of SAGE libraries
title_short Identitag, a relational database for SAGE tag identification and interspecies comparison of SAGE libraries
title_sort identitag, a relational database for sage tag identification and interspecies comparison of sage libraries
topic Software
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC535903/
https://www.ncbi.nlm.nih.gov/pubmed/15469608
http://dx.doi.org/10.1186/1471-2105-5-143
work_keys_str_mv AT keimeceline identitagarelationaldatabaseforsagetagidentificationandinterspeciescomparisonofsagelibraries
AT damiolafrancesca identitagarelationaldatabaseforsagetagidentificationandinterspeciescomparisonofsagelibraries
AT mouchirouddominique identitagarelationaldatabaseforsagetagidentificationandinterspeciescomparisonofsagelibraries
AT duretlaurent identitagarelationaldatabaseforsagetagidentificationandinterspeciescomparisonofsagelibraries
AT gandrillonolivier identitagarelationaldatabaseforsagetagidentificationandinterspeciescomparisonofsagelibraries