Cargando…

pvsR: An Open Source Interface to Big Data on the American Political Sphere

Digital data from the political sphere is abundant, omnipresent, and more and more directly accessible through the Internet. Project Vote Smart (PVS) is a prominent example of this big public data and covers various aspects of U.S. politics in astonishing detail. Despite the vast potential of PVS’ d...

Descripción completa

Detalles Bibliográficos
Autores principales: Matter, Ulrich, Stutzer, Alois
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4488489/
https://www.ncbi.nlm.nih.gov/pubmed/26132154
http://dx.doi.org/10.1371/journal.pone.0130501
_version_ 1782379167643336704
author Matter, Ulrich
Stutzer, Alois
author_facet Matter, Ulrich
Stutzer, Alois
author_sort Matter, Ulrich
collection PubMed
description Digital data from the political sphere is abundant, omnipresent, and more and more directly accessible through the Internet. Project Vote Smart (PVS) is a prominent example of this big public data and covers various aspects of U.S. politics in astonishing detail. Despite the vast potential of PVS’ data for political science, economics, and sociology, it is hardly used in empirical research. The systematic compilation of semi-structured data can be complicated and time consuming as the data format is not designed for conventional scientific research. This paper presents a new tool that makes the data easily accessible to a broad scientific community. We provide the software called pvsR as an add-on to the R programming environment for statistical computing. This open source interface (OSI) serves as a direct link between a statistical analysis and the large PVS database. The free and open code is expected to substantially reduce the cost of research with PVS’ new big public data in a vast variety of possible applications. We discuss its advantages vis-à-vis traditional methods of data generation as well as already existing interfaces. The validity of the library is documented based on an illustration involving female representation in local politics. In addition, pvsR facilitates the replication of research with PVS data at low costs, including the pre-processing of data. Similar OSIs are recommended for other big public databases.
format Online
Article
Text
id pubmed-4488489
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-44884892015-07-14 pvsR: An Open Source Interface to Big Data on the American Political Sphere Matter, Ulrich Stutzer, Alois PLoS One Research Article Digital data from the political sphere is abundant, omnipresent, and more and more directly accessible through the Internet. Project Vote Smart (PVS) is a prominent example of this big public data and covers various aspects of U.S. politics in astonishing detail. Despite the vast potential of PVS’ data for political science, economics, and sociology, it is hardly used in empirical research. The systematic compilation of semi-structured data can be complicated and time consuming as the data format is not designed for conventional scientific research. This paper presents a new tool that makes the data easily accessible to a broad scientific community. We provide the software called pvsR as an add-on to the R programming environment for statistical computing. This open source interface (OSI) serves as a direct link between a statistical analysis and the large PVS database. The free and open code is expected to substantially reduce the cost of research with PVS’ new big public data in a vast variety of possible applications. We discuss its advantages vis-à-vis traditional methods of data generation as well as already existing interfaces. The validity of the library is documented based on an illustration involving female representation in local politics. In addition, pvsR facilitates the replication of research with PVS data at low costs, including the pre-processing of data. Similar OSIs are recommended for other big public databases. Public Library of Science 2015-07-01 /pmc/articles/PMC4488489/ /pubmed/26132154 http://dx.doi.org/10.1371/journal.pone.0130501 Text en © 2015 Matter, Stutzer http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Matter, Ulrich
Stutzer, Alois
pvsR: An Open Source Interface to Big Data on the American Political Sphere
title pvsR: An Open Source Interface to Big Data on the American Political Sphere
title_full pvsR: An Open Source Interface to Big Data on the American Political Sphere
title_fullStr pvsR: An Open Source Interface to Big Data on the American Political Sphere
title_full_unstemmed pvsR: An Open Source Interface to Big Data on the American Political Sphere
title_short pvsR: An Open Source Interface to Big Data on the American Political Sphere
title_sort pvsr: an open source interface to big data on the american political sphere
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4488489/
https://www.ncbi.nlm.nih.gov/pubmed/26132154
http://dx.doi.org/10.1371/journal.pone.0130501
work_keys_str_mv AT matterulrich pvsranopensourceinterfacetobigdataontheamericanpoliticalsphere
AT stutzeralois pvsranopensourceinterfacetobigdataontheamericanpoliticalsphere