Cargando…

Multi-Dimensional Bitmap Indices for Optimising Data Access within Object Oriented Databases at CERN

Efficient query processing in high-dimensional search spaces is an important requirement for many analysis tools. In the literature on index data structures one can find a wide range of methods for optimising database access. In particular, bitmap indices have recently gained substantial popularity...

Descripción completa

Detalles Bibliográficos
Autor principal: Stockinger, K
Lenguaje:eng
Publicado: CERN 2001
Materias:
Acceso en línea:http://cds.cern.ch/record/528073
_version_ 1780897950332878848
author Stockinger, K
author_facet Stockinger, K
author_sort Stockinger, K
collection CERN
description Efficient query processing in high-dimensional search spaces is an important requirement for many analysis tools. In the literature on index data structures one can find a wide range of methods for optimising database access. In particular, bitmap indices have recently gained substantial popularity in data warehouse applications with large amounts of read mostly data. Bitmap indices are implemented in various commercial database products and are used for querying typical business applications. However, scientific data that is mostly characterised by non-discrete attribute values cannot be queried efficiently by the techniques currently supported. In this thesis we propose a novel access method based on bitmap indices that efficiently handles multi-dimensional queries against typical scientific data. The algorithm is called GenericRangeEval and is an extension of a bitmap index for discrete attribute values. By means of a cost model we study the performance of queries with various selectivities against uniformly distributed and independent data values. Experimentally we verify our analytical findings and demonstrate that for certain query selectivities the proposed bitmap index shows a significant performance improvement over traditional access methods. Apart from evaluating bitmap indices for synthetic data, we also evaluate this access method based on real data taken from High Energy Physics and Astronomy applications. We thus demonstrate that our approach is not only of theoretical value but also improves the performance of practical applications. In this sense this thesis is a successful proof that multi-dimensional access methods can significantly speed up typical high-dimensional end-user analysis.
id cern-528073
institution Organización Europea para la Investigación Nuclear
language eng
publishDate 2001
publisher CERN
record_format invenio
spelling cern-5280732019-09-30T06:29:59Zhttp://cds.cern.ch/record/528073engStockinger, KMulti-Dimensional Bitmap Indices for Optimising Data Access within Object Oriented Databases at CERNComputing and ComputersEfficient query processing in high-dimensional search spaces is an important requirement for many analysis tools. In the literature on index data structures one can find a wide range of methods for optimising database access. In particular, bitmap indices have recently gained substantial popularity in data warehouse applications with large amounts of read mostly data. Bitmap indices are implemented in various commercial database products and are used for querying typical business applications. However, scientific data that is mostly characterised by non-discrete attribute values cannot be queried efficiently by the techniques currently supported. In this thesis we propose a novel access method based on bitmap indices that efficiently handles multi-dimensional queries against typical scientific data. The algorithm is called GenericRangeEval and is an extension of a bitmap index for discrete attribute values. By means of a cost model we study the performance of queries with various selectivities against uniformly distributed and independent data values. Experimentally we verify our analytical findings and demonstrate that for certain query selectivities the proposed bitmap index shows a significant performance improvement over traditional access methods. Apart from evaluating bitmap indices for synthetic data, we also evaluate this access method based on real data taken from High Energy Physics and Astronomy applications. We thus demonstrate that our approach is not only of theoretical value but also improves the performance of practical applications. In this sense this thesis is a successful proof that multi-dimensional access methods can significantly speed up typical high-dimensional end-user analysis.CERNCERN-THESIS-2001-026oai:cds.cern.ch:5280732001
spellingShingle Computing and Computers
Stockinger, K
Multi-Dimensional Bitmap Indices for Optimising Data Access within Object Oriented Databases at CERN
title Multi-Dimensional Bitmap Indices for Optimising Data Access within Object Oriented Databases at CERN
title_full Multi-Dimensional Bitmap Indices for Optimising Data Access within Object Oriented Databases at CERN
title_fullStr Multi-Dimensional Bitmap Indices for Optimising Data Access within Object Oriented Databases at CERN
title_full_unstemmed Multi-Dimensional Bitmap Indices for Optimising Data Access within Object Oriented Databases at CERN
title_short Multi-Dimensional Bitmap Indices for Optimising Data Access within Object Oriented Databases at CERN
title_sort multi-dimensional bitmap indices for optimising data access within object oriented databases at cern
topic Computing and Computers
url http://cds.cern.ch/record/528073
work_keys_str_mv AT stockingerk multidimensionalbitmapindicesforoptimisingdataaccesswithinobjectorienteddatabasesatcern