Cargando…

Data Mining the SDSS SkyServer Database

An earlier paper (Szalay et. al. "Designing and Mining MultiTerabyte Astronomy Archives: The Sloan Digital Sky Survey," ACM SIGMOD 2000) described the Sloan Digital Sky Survey's (SDSS) data management needs by defining twenty database queries and twelve data visualization tasks that a...

Descripción completa

Detalles Bibliográficos
Autores principales: Gray, Jim, Slutz, Don, Szalay, Alex S., Thakar, Ani R., vandenBerg, Jan, Kunszt, Peter Z., Stoughton, Christopher
Lenguaje:eng
Publicado: 2002
Materias:
Acceso en línea:http://cds.cern.ch/record/538376
_version_ 1780898276224008192
author Gray, Jim
Slutz, Don
Szalay, Alex S.
Thakar, Ani R.
vandenBerg, Jan
Kunszt, Peter Z.
Stoughton, Christopher
author_facet Gray, Jim
Slutz, Don
Szalay, Alex S.
Thakar, Ani R.
vandenBerg, Jan
Kunszt, Peter Z.
Stoughton, Christopher
author_sort Gray, Jim
collection CERN
description An earlier paper (Szalay et. al. "Designing and Mining MultiTerabyte Astronomy Archives: The Sloan Digital Sky Survey," ACM SIGMOD 2000) described the Sloan Digital Sky Survey's (SDSS) data management needs by defining twenty database queries and twelve data visualization tasks that a good data management system should support. We built a database and interfaces to support both the query load and also a website for ad-hoc access. This paper reports on the database design, describes the data loading pipeline, and reports on the query implementation and performance. The queries typically translated to a single SQL statement. Most queries run in less than 20 seconds, allowing scientists to interactively explore the database. This paper is an in-depth tour of those queries. Readers should first have studied the companion overview paper Szalay et. al. "The SDSS SkyServer, Public Access to the Sloan Digital Sky Server Data" ACM SIGMOND 2002.
id cern-538376
institution Organización Europea para la Investigación Nuclear
language eng
publishDate 2002
record_format invenio
spelling cern-5383762023-03-15T19:10:58Zhttp://cds.cern.ch/record/538376engGray, JimSlutz, DonSzalay, Alex S.Thakar, Ani R.vandenBerg, JanKunszt, Peter Z.Stoughton, ChristopherData Mining the SDSS SkyServer DatabaseComputing and ComputersAn earlier paper (Szalay et. al. "Designing and Mining MultiTerabyte Astronomy Archives: The Sloan Digital Sky Survey," ACM SIGMOD 2000) described the Sloan Digital Sky Survey's (SDSS) data management needs by defining twenty database queries and twelve data visualization tasks that a good data management system should support. We built a database and interfaces to support both the query load and also a website for ad-hoc access. This paper reports on the database design, describes the data loading pipeline, and reports on the query implementation and performance. The queries typically translated to a single SQL statement. Most queries run in less than 20 seconds, allowing scientists to interactively explore the database. This paper is an in-depth tour of those queries. Readers should first have studied the companion overview paper Szalay et. al. "The SDSS SkyServer, Public Access to the Sloan Digital Sky Server Data" ACM SIGMOND 2002.An earlier paper (Szalay et. al. Designing and Mining MultiTerabyte Astronomy Archives: The Sloan Digital Sky Survey, ACM SIGMOD 2000) described the Sloan Digital Sky Survey's (SDSS) data management needs by defining twenty database queries and twelve data visualization tasks that a good data management system should support. We built a database and interfaces to support both the query load and also a website for ad-hoc access. This paper reports on the database design, describes the data loading pipeline, and reports on the query implementation and performance. The queries typically translated to a single SQL statement. Most queries run in less than 20 seconds, allowing scientists to interactively explore the database. This paper is an in-depth tour of those queries. Readers should first have studied the companion overview paper Szalay et. al. The SDSS SkyServer, Public Access to the Sloan Digital Sky Server Data ACM SIGMOND 2002.MICROSOFT-TECH-REPORT-MSR-TR-02-01FERMILAB-PUB-02-450-AEcs/0202014MSR-TR-2002-01oai:cds.cern.ch:5383762002-02-12
spellingShingle Computing and Computers
Gray, Jim
Slutz, Don
Szalay, Alex S.
Thakar, Ani R.
vandenBerg, Jan
Kunszt, Peter Z.
Stoughton, Christopher
Data Mining the SDSS SkyServer Database
title Data Mining the SDSS SkyServer Database
title_full Data Mining the SDSS SkyServer Database
title_fullStr Data Mining the SDSS SkyServer Database
title_full_unstemmed Data Mining the SDSS SkyServer Database
title_short Data Mining the SDSS SkyServer Database
title_sort data mining the sdss skyserver database
topic Computing and Computers
url http://cds.cern.ch/record/538376
work_keys_str_mv AT grayjim dataminingthesdssskyserverdatabase
AT slutzdon dataminingthesdssskyserverdatabase
AT szalayalexs dataminingthesdssskyserverdatabase
AT thakaranir dataminingthesdssskyserverdatabase
AT vandenbergjan dataminingthesdssskyserverdatabase
AT kunsztpeterz dataminingthesdssskyserverdatabase
AT stoughtonchristopher dataminingthesdssskyserverdatabase