Cargando…

Use of Solr and Xapian in the Invenio document repository software

Invenio is a free comprehensive web-based document repository and digital library software suite originally developed at CERN. It can serve a variety of use cases from an institutional repository or digital library to a web journal. In order to fully use full-text documents for efficient search and...

Descripción completa

Detalles Bibliográficos
Autores principales: Glauner, Patrick O., Iwaszkiewicz, Jan, Le Meur, Jean-Yves, Simko, Tibor
Lenguaje:eng
Publicado: 2013
Materias:
Acceso en línea:http://cds.cern.ch/record/1634050
_version_ 1780934437789237248
author Glauner, Patrick O.
Iwaszkiewicz, Jan
Le Meur, Jean-Yves
Simko, Tibor
author_facet Glauner, Patrick O.
Iwaszkiewicz, Jan
Le Meur, Jean-Yves
Simko, Tibor
author_sort Glauner, Patrick O.
collection CERN
description Invenio is a free comprehensive web-based document repository and digital library software suite originally developed at CERN. It can serve a variety of use cases from an institutional repository or digital library to a web journal. In order to fully use full-text documents for efficient search and ranking, Solr was integrated into Invenio through a generic bridge. Solr indexes extracted full-texts and most relevant metadata. Consequently, Invenio takes advantage of Solr’s efficient search and word similarity ranking capabilities. In this paper, we first give an overview of Invenio, its capabilities and features. We then present our open source Solr integration as well as scalability challenges that arose for an Invenio- based multi-million record repository: the CERN Document Server. We also compare our Solr adapter to an alternative Xapian adapter using the same generic bridge. Both integrations are distributed with the Invenio package and ready to be used by the institutions using or adopting Invenio.
id cern-1634050
institution Organización Europea para la Investigación Nuclear
language eng
publishDate 2013
record_format invenio
spelling cern-16340502023-03-14T16:33:37Zhttp://cds.cern.ch/record/1634050engGlauner, Patrick O.Iwaszkiewicz, JanLe Meur, Jean-YvesSimko, TiborUse of Solr and Xapian in the Invenio document repository softwareComputing and ComputersInvenio is a free comprehensive web-based document repository and digital library software suite originally developed at CERN. It can serve a variety of use cases from an institutional repository or digital library to a web journal. In order to fully use full-text documents for efficient search and ranking, Solr was integrated into Invenio through a generic bridge. Solr indexes extracted full-texts and most relevant metadata. Consequently, Invenio takes advantage of Solr’s efficient search and word similarity ranking capabilities. In this paper, we first give an overview of Invenio, its capabilities and features. We then present our open source Solr integration as well as scalability challenges that arose for an Invenio- based multi-million record repository: the CERN Document Server. We also compare our Solr adapter to an alternative Xapian adapter using the same generic bridge. Both integrations are distributed with the Invenio package and ready to be used by the institutions using or adopting Invenio.Invenio is a free comprehensive web-based document repository and digital library software suite originally developed at CERN. It can serve a variety of use cases from an institutional repository or digital library to a web journal. In order to fully use full-text documents for efficient search and ranking, Solr was integrated into Invenio through a generic bridge. Solr indexes extracted full-texts and most relevant metadata. Consequently, Invenio takes advantage of Solr's efficient search and word similarity ranking capabilities. In this paper, we first give an overview of Invenio, its capabilities and features. We then present our open source Solr integration as well as scalability challenges that arose for an Invenio-based multi-million record repository: the CERN Document Server. We also compare our Solr adapter to an alternative Xapian adapter using the same generic bridge. Both integrations are distributed with the Invenio package and ready to be used by the institutions using or adopting Invenio.CERN-IT-2013-006arXiv:1310.0250arXiv:1310.0250oai:cds.cern.ch:16340502013-02-21
spellingShingle Computing and Computers
Glauner, Patrick O.
Iwaszkiewicz, Jan
Le Meur, Jean-Yves
Simko, Tibor
Use of Solr and Xapian in the Invenio document repository software
title Use of Solr and Xapian in the Invenio document repository software
title_full Use of Solr and Xapian in the Invenio document repository software
title_fullStr Use of Solr and Xapian in the Invenio document repository software
title_full_unstemmed Use of Solr and Xapian in the Invenio document repository software
title_short Use of Solr and Xapian in the Invenio document repository software
title_sort use of solr and xapian in the invenio document repository software
topic Computing and Computers
url http://cds.cern.ch/record/1634050
work_keys_str_mv AT glaunerpatricko useofsolrandxapianintheinveniodocumentrepositorysoftware
AT iwaszkiewiczjan useofsolrandxapianintheinveniodocumentrepositorysoftware
AT lemeurjeanyves useofsolrandxapianintheinveniodocumentrepositorysoftware
AT simkotibor useofsolrandxapianintheinveniodocumentrepositorysoftware