Cargando…

DataSpread: Unifying Databases and Spreadsheets

Spreadsheet software is often the tool of choice for ad-hoc tabular data management, processing, and visualization, especially on tiny data sets. On the other hand, relational database systems offer significant power, expressivity, and efficiency over spreadsheet software for data management, while...

Descripción completa

Detalles Bibliográficos
Autores principales: Bendre, Mangesh, Sun, Bofan, Zhang, Ding, Zhou, Xinyan, Chang, Kevin ChenChuan, Parameswaran, Aditya
Formato: Online Artículo Texto
Lenguaje:English
Publicado: 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4756475/
https://www.ncbi.nlm.nih.gov/pubmed/26900487
_version_ 1782416339730694144
author Bendre, Mangesh
Sun, Bofan
Zhang, Ding
Zhou, Xinyan
Chang, Kevin ChenChuan
Parameswaran, Aditya
author_facet Bendre, Mangesh
Sun, Bofan
Zhang, Ding
Zhou, Xinyan
Chang, Kevin ChenChuan
Parameswaran, Aditya
author_sort Bendre, Mangesh
collection PubMed
description Spreadsheet software is often the tool of choice for ad-hoc tabular data management, processing, and visualization, especially on tiny data sets. On the other hand, relational database systems offer significant power, expressivity, and efficiency over spreadsheet software for data management, while lacking in the ease of use and ad-hoc analysis capabilities. We demonstrate DataSpread, a data exploration tool that holistically unifies databases and spreadsheets. It continues to offer a Microsoft Excel-based spreadsheet front-end, while in parallel managing all the data in a back-end database, specifically, PostgreSQL. DataSpread retains all the advantages of spreadsheets, including ease of use, ad-hoc analysis and visualization capabilities, and a schema-free nature, while also adding the advantages of traditional relational databases, such as scalability and the ability to use arbitrary SQL to import, filter, or join external or internal tables and have the results appear in the spreadsheet. DataSpread needs to reason about and reconcile differences in the notions of schema, addressing of cells and tuples, and the current “pane” (which exists in spreadsheets but not in traditional databases), and support data modifications at both the front-end and the back-end. Our demonstration will center on our first and early prototype of the DataSpread, and will give the attendees a sense for the enormous data exploration capabilities offered by unifying spreadsheets and databases.
format Online
Article
Text
id pubmed-4756475
institution National Center for Biotechnology Information
language English
publishDate 2015
record_format MEDLINE/PubMed
spelling pubmed-47564752016-02-17 DataSpread: Unifying Databases and Spreadsheets Bendre, Mangesh Sun, Bofan Zhang, Ding Zhou, Xinyan Chang, Kevin ChenChuan Parameswaran, Aditya Proceedings VLDB Endowment Article Spreadsheet software is often the tool of choice for ad-hoc tabular data management, processing, and visualization, especially on tiny data sets. On the other hand, relational database systems offer significant power, expressivity, and efficiency over spreadsheet software for data management, while lacking in the ease of use and ad-hoc analysis capabilities. We demonstrate DataSpread, a data exploration tool that holistically unifies databases and spreadsheets. It continues to offer a Microsoft Excel-based spreadsheet front-end, while in parallel managing all the data in a back-end database, specifically, PostgreSQL. DataSpread retains all the advantages of spreadsheets, including ease of use, ad-hoc analysis and visualization capabilities, and a schema-free nature, while also adding the advantages of traditional relational databases, such as scalability and the ability to use arbitrary SQL to import, filter, or join external or internal tables and have the results appear in the spreadsheet. DataSpread needs to reason about and reconcile differences in the notions of schema, addressing of cells and tuples, and the current “pane” (which exists in spreadsheets but not in traditional databases), and support data modifications at both the front-end and the back-end. Our demonstration will center on our first and early prototype of the DataSpread, and will give the attendees a sense for the enormous data exploration capabilities offered by unifying spreadsheets and databases. 2015-08 /pmc/articles/PMC4756475/ /pubmed/26900487 Text en This work is licensed under the Creative Commons Attribution-NonCommercial-NoDerivs 3.0 Unported License. To view a copy of this license, visit http://creativecommons.org/licenses/by-nc-nd/3.0/.
spellingShingle Article
Bendre, Mangesh
Sun, Bofan
Zhang, Ding
Zhou, Xinyan
Chang, Kevin ChenChuan
Parameswaran, Aditya
DataSpread: Unifying Databases and Spreadsheets
title DataSpread: Unifying Databases and Spreadsheets
title_full DataSpread: Unifying Databases and Spreadsheets
title_fullStr DataSpread: Unifying Databases and Spreadsheets
title_full_unstemmed DataSpread: Unifying Databases and Spreadsheets
title_short DataSpread: Unifying Databases and Spreadsheets
title_sort dataspread: unifying databases and spreadsheets
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4756475/
https://www.ncbi.nlm.nih.gov/pubmed/26900487
work_keys_str_mv AT bendremangesh dataspreadunifyingdatabasesandspreadsheets
AT sunbofan dataspreadunifyingdatabasesandspreadsheets
AT zhangding dataspreadunifyingdatabasesandspreadsheets
AT zhouxinyan dataspreadunifyingdatabasesandspreadsheets
AT changkevinchenchuan dataspreadunifyingdatabasesandspreadsheets
AT parameswaranaditya dataspreadunifyingdatabasesandspreadsheets