Cargando…

MeMo: a hybrid SQL/XML approach to metabolomic data management for functional genomics

BACKGROUND: The genome sequencing projects have shown our limited knowledge regarding gene function, e.g. S. cerevisiae has 5–6,000 genes of which nearly 1,000 have an uncertain function. Their gross influence on the behaviour of the cell can be observed using large-scale metabolomic studies. The me...

Descripción completa

Detalles Bibliográficos
Autores principales: Spasić, Irena, Dunn, Warwick B, Velarde, Giles, Tseng, Andy, Jenkins, Helen, Hardy, Nigel, Oliver, Stephen G, Kell, Douglas B
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2006
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1522028/
https://www.ncbi.nlm.nih.gov/pubmed/16753052
http://dx.doi.org/10.1186/1471-2105-7-281
_version_ 1782128788330512384
author Spasić, Irena
Dunn, Warwick B
Velarde, Giles
Tseng, Andy
Jenkins, Helen
Hardy, Nigel
Oliver, Stephen G
Kell, Douglas B
author_facet Spasić, Irena
Dunn, Warwick B
Velarde, Giles
Tseng, Andy
Jenkins, Helen
Hardy, Nigel
Oliver, Stephen G
Kell, Douglas B
author_sort Spasić, Irena
collection PubMed
description BACKGROUND: The genome sequencing projects have shown our limited knowledge regarding gene function, e.g. S. cerevisiae has 5–6,000 genes of which nearly 1,000 have an uncertain function. Their gross influence on the behaviour of the cell can be observed using large-scale metabolomic studies. The metabolomic data produced need to be structured and annotated in a machine-usable form to facilitate the exploration of the hidden links between the genes and their functions. DESCRIPTION: MeMo is a formal model for representing metabolomic data and the associated metadata. Two predominant platforms (SQL and XML) are used to encode the model. MeMo has been implemented as a relational database using a hybrid approach combining the advantages of the two technologies. It represents a practical solution for handling the sheer volume and complexity of the metabolomic data effectively and efficiently. The MeMo model and the associated software are available at . CONCLUSION: The maturity of relational database technology is used to support efficient data processing. The scalability and self-descriptiveness of XML are used to simplify the relational schema and facilitate the extensibility of the model necessitated by the creation of new experimental techniques. Special consideration is given to data integration issues as part of the systems biology agenda. MeMo has been physically integrated and cross-linked to related metabolomic and genomic databases. Semantic integration with other relevant databases has been supported through ontological annotation. Compatibility with other data formats is supported by automatic conversion.
format Text
id pubmed-1522028
institution National Center for Biotechnology Information
language English
publishDate 2006
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-15220282006-07-26 MeMo: a hybrid SQL/XML approach to metabolomic data management for functional genomics Spasić, Irena Dunn, Warwick B Velarde, Giles Tseng, Andy Jenkins, Helen Hardy, Nigel Oliver, Stephen G Kell, Douglas B BMC Bioinformatics Database BACKGROUND: The genome sequencing projects have shown our limited knowledge regarding gene function, e.g. S. cerevisiae has 5–6,000 genes of which nearly 1,000 have an uncertain function. Their gross influence on the behaviour of the cell can be observed using large-scale metabolomic studies. The metabolomic data produced need to be structured and annotated in a machine-usable form to facilitate the exploration of the hidden links between the genes and their functions. DESCRIPTION: MeMo is a formal model for representing metabolomic data and the associated metadata. Two predominant platforms (SQL and XML) are used to encode the model. MeMo has been implemented as a relational database using a hybrid approach combining the advantages of the two technologies. It represents a practical solution for handling the sheer volume and complexity of the metabolomic data effectively and efficiently. The MeMo model and the associated software are available at . CONCLUSION: The maturity of relational database technology is used to support efficient data processing. The scalability and self-descriptiveness of XML are used to simplify the relational schema and facilitate the extensibility of the model necessitated by the creation of new experimental techniques. Special consideration is given to data integration issues as part of the systems biology agenda. MeMo has been physically integrated and cross-linked to related metabolomic and genomic databases. Semantic integration with other relevant databases has been supported through ontological annotation. Compatibility with other data formats is supported by automatic conversion. BioMed Central 2006-06-05 /pmc/articles/PMC1522028/ /pubmed/16753052 http://dx.doi.org/10.1186/1471-2105-7-281 Text en Copyright © 2006 Spasić et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Database
Spasić, Irena
Dunn, Warwick B
Velarde, Giles
Tseng, Andy
Jenkins, Helen
Hardy, Nigel
Oliver, Stephen G
Kell, Douglas B
MeMo: a hybrid SQL/XML approach to metabolomic data management for functional genomics
title MeMo: a hybrid SQL/XML approach to metabolomic data management for functional genomics
title_full MeMo: a hybrid SQL/XML approach to metabolomic data management for functional genomics
title_fullStr MeMo: a hybrid SQL/XML approach to metabolomic data management for functional genomics
title_full_unstemmed MeMo: a hybrid SQL/XML approach to metabolomic data management for functional genomics
title_short MeMo: a hybrid SQL/XML approach to metabolomic data management for functional genomics
title_sort memo: a hybrid sql/xml approach to metabolomic data management for functional genomics
topic Database
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1522028/
https://www.ncbi.nlm.nih.gov/pubmed/16753052
http://dx.doi.org/10.1186/1471-2105-7-281
work_keys_str_mv AT spasicirena memoahybridsqlxmlapproachtometabolomicdatamanagementforfunctionalgenomics
AT dunnwarwickb memoahybridsqlxmlapproachtometabolomicdatamanagementforfunctionalgenomics
AT velardegiles memoahybridsqlxmlapproachtometabolomicdatamanagementforfunctionalgenomics
AT tsengandy memoahybridsqlxmlapproachtometabolomicdatamanagementforfunctionalgenomics
AT jenkinshelen memoahybridsqlxmlapproachtometabolomicdatamanagementforfunctionalgenomics
AT hardynigel memoahybridsqlxmlapproachtometabolomicdatamanagementforfunctionalgenomics
AT oliverstepheng memoahybridsqlxmlapproachtometabolomicdatamanagementforfunctionalgenomics
AT kelldouglasb memoahybridsqlxmlapproachtometabolomicdatamanagementforfunctionalgenomics