Cargando…
MeMo: a hybrid SQL/XML approach to metabolomic data management for functional genomics
BACKGROUND: The genome sequencing projects have shown our limited knowledge regarding gene function, e.g. S. cerevisiae has 5–6,000 genes of which nearly 1,000 have an uncertain function. Their gross influence on the behaviour of the cell can be observed using large-scale metabolomic studies. The me...
Autores principales: | , , , , , , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2006
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1522028/ https://www.ncbi.nlm.nih.gov/pubmed/16753052 http://dx.doi.org/10.1186/1471-2105-7-281 |
_version_ | 1782128788330512384 |
---|---|
author | Spasić, Irena Dunn, Warwick B Velarde, Giles Tseng, Andy Jenkins, Helen Hardy, Nigel Oliver, Stephen G Kell, Douglas B |
author_facet | Spasić, Irena Dunn, Warwick B Velarde, Giles Tseng, Andy Jenkins, Helen Hardy, Nigel Oliver, Stephen G Kell, Douglas B |
author_sort | Spasić, Irena |
collection | PubMed |
description | BACKGROUND: The genome sequencing projects have shown our limited knowledge regarding gene function, e.g. S. cerevisiae has 5–6,000 genes of which nearly 1,000 have an uncertain function. Their gross influence on the behaviour of the cell can be observed using large-scale metabolomic studies. The metabolomic data produced need to be structured and annotated in a machine-usable form to facilitate the exploration of the hidden links between the genes and their functions. DESCRIPTION: MeMo is a formal model for representing metabolomic data and the associated metadata. Two predominant platforms (SQL and XML) are used to encode the model. MeMo has been implemented as a relational database using a hybrid approach combining the advantages of the two technologies. It represents a practical solution for handling the sheer volume and complexity of the metabolomic data effectively and efficiently. The MeMo model and the associated software are available at . CONCLUSION: The maturity of relational database technology is used to support efficient data processing. The scalability and self-descriptiveness of XML are used to simplify the relational schema and facilitate the extensibility of the model necessitated by the creation of new experimental techniques. Special consideration is given to data integration issues as part of the systems biology agenda. MeMo has been physically integrated and cross-linked to related metabolomic and genomic databases. Semantic integration with other relevant databases has been supported through ontological annotation. Compatibility with other data formats is supported by automatic conversion. |
format | Text |
id | pubmed-1522028 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2006 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-15220282006-07-26 MeMo: a hybrid SQL/XML approach to metabolomic data management for functional genomics Spasić, Irena Dunn, Warwick B Velarde, Giles Tseng, Andy Jenkins, Helen Hardy, Nigel Oliver, Stephen G Kell, Douglas B BMC Bioinformatics Database BACKGROUND: The genome sequencing projects have shown our limited knowledge regarding gene function, e.g. S. cerevisiae has 5–6,000 genes of which nearly 1,000 have an uncertain function. Their gross influence on the behaviour of the cell can be observed using large-scale metabolomic studies. The metabolomic data produced need to be structured and annotated in a machine-usable form to facilitate the exploration of the hidden links between the genes and their functions. DESCRIPTION: MeMo is a formal model for representing metabolomic data and the associated metadata. Two predominant platforms (SQL and XML) are used to encode the model. MeMo has been implemented as a relational database using a hybrid approach combining the advantages of the two technologies. It represents a practical solution for handling the sheer volume and complexity of the metabolomic data effectively and efficiently. The MeMo model and the associated software are available at . CONCLUSION: The maturity of relational database technology is used to support efficient data processing. The scalability and self-descriptiveness of XML are used to simplify the relational schema and facilitate the extensibility of the model necessitated by the creation of new experimental techniques. Special consideration is given to data integration issues as part of the systems biology agenda. MeMo has been physically integrated and cross-linked to related metabolomic and genomic databases. Semantic integration with other relevant databases has been supported through ontological annotation. Compatibility with other data formats is supported by automatic conversion. BioMed Central 2006-06-05 /pmc/articles/PMC1522028/ /pubmed/16753052 http://dx.doi.org/10.1186/1471-2105-7-281 Text en Copyright © 2006 Spasić et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Database Spasić, Irena Dunn, Warwick B Velarde, Giles Tseng, Andy Jenkins, Helen Hardy, Nigel Oliver, Stephen G Kell, Douglas B MeMo: a hybrid SQL/XML approach to metabolomic data management for functional genomics |
title | MeMo: a hybrid SQL/XML approach to metabolomic data management for functional genomics |
title_full | MeMo: a hybrid SQL/XML approach to metabolomic data management for functional genomics |
title_fullStr | MeMo: a hybrid SQL/XML approach to metabolomic data management for functional genomics |
title_full_unstemmed | MeMo: a hybrid SQL/XML approach to metabolomic data management for functional genomics |
title_short | MeMo: a hybrid SQL/XML approach to metabolomic data management for functional genomics |
title_sort | memo: a hybrid sql/xml approach to metabolomic data management for functional genomics |
topic | Database |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1522028/ https://www.ncbi.nlm.nih.gov/pubmed/16753052 http://dx.doi.org/10.1186/1471-2105-7-281 |
work_keys_str_mv | AT spasicirena memoahybridsqlxmlapproachtometabolomicdatamanagementforfunctionalgenomics AT dunnwarwickb memoahybridsqlxmlapproachtometabolomicdatamanagementforfunctionalgenomics AT velardegiles memoahybridsqlxmlapproachtometabolomicdatamanagementforfunctionalgenomics AT tsengandy memoahybridsqlxmlapproachtometabolomicdatamanagementforfunctionalgenomics AT jenkinshelen memoahybridsqlxmlapproachtometabolomicdatamanagementforfunctionalgenomics AT hardynigel memoahybridsqlxmlapproachtometabolomicdatamanagementforfunctionalgenomics AT oliverstepheng memoahybridsqlxmlapproachtometabolomicdatamanagementforfunctionalgenomics AT kelldouglasb memoahybridsqlxmlapproachtometabolomicdatamanagementforfunctionalgenomics |