Cargando…
Towards Interoperability in Genome Databases: The MAtDB (MIPS Arabidopsis Thaliana Database) Experience
Increasing numbers of whole-genome sequences are available, but to interpret them fully requires more than listing all genes. Genome databases are faced with the challenges of integrating heterogenous data and enabling data mining. In comparison to a data warehousing approach, where integration is a...
Autor principal: | |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
Hindawi Publishing Corporation
2003
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2447410/ https://www.ncbi.nlm.nih.gov/pubmed/18629123 http://dx.doi.org/10.1002/cfg.278 |
_version_ | 1782156932228841472 |
---|---|
author | Schoof, Heiko |
author_facet | Schoof, Heiko |
author_sort | Schoof, Heiko |
collection | PubMed |
description | Increasing numbers of whole-genome sequences are available, but to interpret them fully requires more than listing all genes. Genome databases are faced with the challenges of integrating heterogenous data and enabling data mining. In comparison to a data warehousing approach, where integration is achieved through replication of all relevant data in a unified schema, distributed approaches provide greater flexibility and maintainability. These are important in a field where new data is generated rapidly and our understanding of the data changes. Interoperability between distributed data sources allows data maintenance to be separated from integration and analysis. Simple ways to access the data can facilitate the development of new data mining tools and the transition from model genome analysis to comparative genomics. With the MIPS Arabidopsis thaliana genome database (MAtDB, http://mips.gsf.de/proj/thal/db) our aim is to go beyond a data repository towards creating an integrated knowledge resource. To this end, the Arabidopsis genome has been a backbone against which to structure and integrate heterogenous data. The challenges to be met are continuous updating of data, the design of flexible data models that can evolve with new data, the integration of heterogenous data, e.g. through the use of ontologies, comprehensive views and visualization of complex information, simple interfaces for application access locally or via the Internet, and knowledge transfer across species. |
format | Text |
id | pubmed-2447410 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2003 |
publisher | Hindawi Publishing Corporation |
record_format | MEDLINE/PubMed |
spelling | pubmed-24474102008-07-14 Towards Interoperability in Genome Databases: The MAtDB (MIPS Arabidopsis Thaliana Database) Experience Schoof, Heiko Comp Funct Genomics Research Article Increasing numbers of whole-genome sequences are available, but to interpret them fully requires more than listing all genes. Genome databases are faced with the challenges of integrating heterogenous data and enabling data mining. In comparison to a data warehousing approach, where integration is achieved through replication of all relevant data in a unified schema, distributed approaches provide greater flexibility and maintainability. These are important in a field where new data is generated rapidly and our understanding of the data changes. Interoperability between distributed data sources allows data maintenance to be separated from integration and analysis. Simple ways to access the data can facilitate the development of new data mining tools and the transition from model genome analysis to comparative genomics. With the MIPS Arabidopsis thaliana genome database (MAtDB, http://mips.gsf.de/proj/thal/db) our aim is to go beyond a data repository towards creating an integrated knowledge resource. To this end, the Arabidopsis genome has been a backbone against which to structure and integrate heterogenous data. The challenges to be met are continuous updating of data, the design of flexible data models that can evolve with new data, the integration of heterogenous data, e.g. through the use of ontologies, comprehensive views and visualization of complex information, simple interfaces for application access locally or via the Internet, and knowledge transfer across species. Hindawi Publishing Corporation 2003-04 /pmc/articles/PMC2447410/ /pubmed/18629123 http://dx.doi.org/10.1002/cfg.278 Text en Copyright © 2003 Hindawi Publishing Corporation. http://creativecommons.org/licenses/by/ This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Schoof, Heiko Towards Interoperability in Genome Databases: The MAtDB (MIPS Arabidopsis Thaliana Database) Experience |
title | Towards Interoperability in Genome Databases: The MAtDB (MIPS
Arabidopsis Thaliana Database) Experience |
title_full | Towards Interoperability in Genome Databases: The MAtDB (MIPS
Arabidopsis Thaliana Database) Experience |
title_fullStr | Towards Interoperability in Genome Databases: The MAtDB (MIPS
Arabidopsis Thaliana Database) Experience |
title_full_unstemmed | Towards Interoperability in Genome Databases: The MAtDB (MIPS
Arabidopsis Thaliana Database) Experience |
title_short | Towards Interoperability in Genome Databases: The MAtDB (MIPS
Arabidopsis Thaliana Database) Experience |
title_sort | towards interoperability in genome databases: the matdb (mips
arabidopsis thaliana database) experience |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2447410/ https://www.ncbi.nlm.nih.gov/pubmed/18629123 http://dx.doi.org/10.1002/cfg.278 |
work_keys_str_mv | AT schoofheiko towardsinteroperabilityingenomedatabasesthematdbmipsarabidopsisthalianadatabaseexperience |