Cargando…

Applying GRID Technologies to XML Based OLAP Cube Construction

On-Line Analytical Processing (OLAP) is a powerful method for analysing large data warehouse data. Typically, the data for an OLAP database is collected from a set of data repositories such as e.g. operational databases. This data set is often huge, and it may not be known in advance what data is re...

Descripción completa

Detalles Bibliográficos
Autores principales: Niemi, Tapio Petteri, Niinimäki, M, Nummenmaa, J, Thanisch, P
Lenguaje:eng
Publicado: 2002
Materias:
Acceso en línea:http://cds.cern.ch/record/600552
_version_ 1780899942853771264
author Niemi, Tapio Petteri
Niinimäki, M
Nummenmaa, J
Thanisch, P
author_facet Niemi, Tapio Petteri
Niinimäki, M
Nummenmaa, J
Thanisch, P
author_sort Niemi, Tapio Petteri
collection CERN
description On-Line Analytical Processing (OLAP) is a powerful method for analysing large data warehouse data. Typically, the data for an OLAP database is collected from a set of data repositories such as e.g. operational databases. This data set is often huge, and it may not be known in advance what data is required and when to perform the desired data analysis tasks. Sometimes it may happen that some parts of the data are only needed occasionally. Therefore, storing all data to the OLAP database and keeping this database constantly up-to-date is not only a highly demanding task but it also may be overkill in practice. This suggests that in some applications it would be more feasible to form the OLAP cubes only when they are actually needed. However, the OLAP cube construction can be a slow process. Thus, we present a system that applies Grid technologies to distribute the computation. As the data sources may well be heterogeneous, we propose an XML language for data collection. The user's definition for a OLAP new cube often includes selecting and aggregating the data. In our system this computation is distributed to the computers that store the original data. This reduces the network traffic and speeds up the computation that is now performed in parallel. The sub results are sent back to the 'collecting server'. Usually, the results do not arrive simultaneously. However, the collecting server starts to process a sub result immediately after it has arrived. Therefore, there is no need to wait that all sub result are received. We have implemented a prototype for the system. The implementation applies Spitfire software and Mobile Analyzer technology. They both are Grid based products applying Grid Security Infrastructure.
id cern-600552
institution Organización Europea para la Investigación Nuclear
language eng
publishDate 2002
record_format invenio
spelling cern-6005522019-09-30T06:29:59Zhttp://cds.cern.ch/record/600552engNiemi, Tapio PetteriNiinimäki, MNummenmaa, JThanisch, PApplying GRID Technologies to XML Based OLAP Cube ConstructionComputing and ComputersOn-Line Analytical Processing (OLAP) is a powerful method for analysing large data warehouse data. Typically, the data for an OLAP database is collected from a set of data repositories such as e.g. operational databases. This data set is often huge, and it may not be known in advance what data is required and when to perform the desired data analysis tasks. Sometimes it may happen that some parts of the data are only needed occasionally. Therefore, storing all data to the OLAP database and keeping this database constantly up-to-date is not only a highly demanding task but it also may be overkill in practice. This suggests that in some applications it would be more feasible to form the OLAP cubes only when they are actually needed. However, the OLAP cube construction can be a slow process. Thus, we present a system that applies Grid technologies to distribute the computation. As the data sources may well be heterogeneous, we propose an XML language for data collection. The user's definition for a OLAP new cube often includes selecting and aggregating the data. In our system this computation is distributed to the computers that store the original data. This reduces the network traffic and speeds up the computation that is now performed in parallel. The sub results are sent back to the 'collecting server'. Usually, the results do not arrive simultaneously. However, the collecting server starts to process a sub result immediately after it has arrived. Therefore, there is no need to wait that all sub result are received. We have implemented a prototype for the system. The implementation applies Spitfire software and Mobile Analyzer technology. They both are Grid based products applying Grid Security Infrastructure.CERN-OPEN-2003-004oai:cds.cern.ch:6005522002-12-17
spellingShingle Computing and Computers
Niemi, Tapio Petteri
Niinimäki, M
Nummenmaa, J
Thanisch, P
Applying GRID Technologies to XML Based OLAP Cube Construction
title Applying GRID Technologies to XML Based OLAP Cube Construction
title_full Applying GRID Technologies to XML Based OLAP Cube Construction
title_fullStr Applying GRID Technologies to XML Based OLAP Cube Construction
title_full_unstemmed Applying GRID Technologies to XML Based OLAP Cube Construction
title_short Applying GRID Technologies to XML Based OLAP Cube Construction
title_sort applying grid technologies to xml based olap cube construction
topic Computing and Computers
url http://cds.cern.ch/record/600552
work_keys_str_mv AT niemitapiopetteri applyinggridtechnologiestoxmlbasedolapcubeconstruction
AT niinimakim applyinggridtechnologiestoxmlbasedolapcubeconstruction
AT nummenmaaj applyinggridtechnologiestoxmlbasedolapcubeconstruction
AT thanischp applyinggridtechnologiestoxmlbasedolapcubeconstruction