Cargando…

Compression Algorithms for the Storage of Bibliographic Information (First of two parts)

BIBLIOGRAPHICAL DATA BASES WHICH HAVE BEEN DEVELOPED IN PC COMPUTERS HAVE BEEN LIMITED REGARDING TOTAL NUMBER OF FICHES AND THEIR PERFORMANCE DUE TO THE SIZE OF THE PLATFORM IN WHICH THOSE DATA BASES ARE DEVELOPED AND INSTALLED. COMMERCIAL PC DATA BASE MANAGEMENT SOFTWARE HAVE BEEN CONSTRUCTED WITH...

Descripción completa

Detalles Bibliográficos
Autores principales: RUÍZ VELASCO Y ROMO, MIGUEL AGUSTÍN, Voutssás Márquez, Juan
Formato: Online Artículo
Lenguaje:spa
Publicado: Instituto de Investigaciones Bibliotecológicas y de la Información 2001
Materias:
Acceso en línea:http://rev-ib.unam.mx/ib/index.php/ib/article/view/3975
https://dx.doi.org/10.22201/iibi.0187358xp.2001.31.3975
_version_ 1780761117116596224
author RUÍZ VELASCO Y ROMO, MIGUEL AGUSTÍN
Voutssás Márquez, Juan
author_facet RUÍZ VELASCO Y ROMO, MIGUEL AGUSTÍN
Voutssás Márquez, Juan
author_sort RUÍZ VELASCO Y ROMO, MIGUEL AGUSTÍN
collection Investigación Bibliotecológica: archivonomía, bibliotecología e información
description BIBLIOGRAPHICAL DATA BASES WHICH HAVE BEEN DEVELOPED IN PC COMPUTERS HAVE BEEN LIMITED REGARDING TOTAL NUMBER OF FICHES AND THEIR PERFORMANCE DUE TO THE SIZE OF THE PLATFORM IN WHICH THOSE DATA BASES ARE DEVELOPED AND INSTALLED. COMMERCIAL PC DATA BASE MANAGEMENT SOFTWARE HAVE BEEN CONSTRUCTED WITH A GENERAL APPROACH, THINKING IN STANDARD APPLICATIONS, AND DO NOT CONSIDER THE PARTICULAR FEATURE OF THE BIBLIOGRAPHIC INFORMATION. THUS, THEY DECREASE IN PERFORMANCE EXPONENTIALLY IN RELATION WITH THE SIZE OF THE DATA BASE. IN THIS FIRST PART OF THE SURVEY, THE PROBLEMS ARE DISCUSSED, AS WELL AS THOSE TYPICAL FEATURES OF THE BIBLIOGRAPHIC INFORMATION REGARDING TO ITS INCLUSION IN COMPUTARIZED DATA BASES. A BIBLIOGRAPHIC DATA COMPRESSION MODEL IS INTRODUCED AS AN ALGORITHM, ALLOWING BETWEEN 40 AND 70 % OF COMPRESSION RATE WITHOUT LOSING INFORMATION QUALITY. IN THE SECOND DOCUMENT, PROCEDURES FOR CREATION AND COMPRESSION OF PRECONSTRUCTED INDEXES WILL BE PRESENTED, AS WELL AS RETRIEVAL FILES FOR WORD FREE-SEARCHING. SOME TECHNIQUES FOR CREATION AND RETRIEVAL OF BOTH ACCESS PATHS WILL BE FULLY DISCUSSED. IN THAT PART THE FINAL CONCLUSION SHOWS THAT DATA BASES WITH SEVERAL HUNDREDS OF THOUSANDS OF RECORDS OWNING SEVERAL MILLIONS OF RETRIEVAL WORDS CAN BE COMPRESSED TO THE AVAILABLE SPACE OF A CD- ROM (650 MB), AND EVEN EXPANDED TO GREATER FIGURES.
format Online
Article
id oai_unam-bibliotecologica-article-3975
institution Universidad Nacional Autónoma de México
language spa
publishDate 2001
publisher Instituto de Investigaciones Bibliotecológicas y de la Información
record_format ojs
spelling oai_unam-bibliotecologica-article-39752018-01-31T14:24:52Z Compression Algorithms for the Storage of Bibliographic Information (First of two parts) Algoritmo de compresión para el almacenamiento de información bibliográfica (primera de dos partes) RUÍZ VELASCO Y ROMO, MIGUEL AGUSTÍN Voutssás Márquez, Juan DATABASES INDEX AUTOMATION ALGORITHMS BASES DE DATOS AUTOMATIZACION DE INDICES ALGORITMOS BIBLIOGRAPHICAL DATA BASES WHICH HAVE BEEN DEVELOPED IN PC COMPUTERS HAVE BEEN LIMITED REGARDING TOTAL NUMBER OF FICHES AND THEIR PERFORMANCE DUE TO THE SIZE OF THE PLATFORM IN WHICH THOSE DATA BASES ARE DEVELOPED AND INSTALLED. COMMERCIAL PC DATA BASE MANAGEMENT SOFTWARE HAVE BEEN CONSTRUCTED WITH A GENERAL APPROACH, THINKING IN STANDARD APPLICATIONS, AND DO NOT CONSIDER THE PARTICULAR FEATURE OF THE BIBLIOGRAPHIC INFORMATION. THUS, THEY DECREASE IN PERFORMANCE EXPONENTIALLY IN RELATION WITH THE SIZE OF THE DATA BASE. IN THIS FIRST PART OF THE SURVEY, THE PROBLEMS ARE DISCUSSED, AS WELL AS THOSE TYPICAL FEATURES OF THE BIBLIOGRAPHIC INFORMATION REGARDING TO ITS INCLUSION IN COMPUTARIZED DATA BASES. A BIBLIOGRAPHIC DATA COMPRESSION MODEL IS INTRODUCED AS AN ALGORITHM, ALLOWING BETWEEN 40 AND 70 % OF COMPRESSION RATE WITHOUT LOSING INFORMATION QUALITY. IN THE SECOND DOCUMENT, PROCEDURES FOR CREATION AND COMPRESSION OF PRECONSTRUCTED INDEXES WILL BE PRESENTED, AS WELL AS RETRIEVAL FILES FOR WORD FREE-SEARCHING. SOME TECHNIQUES FOR CREATION AND RETRIEVAL OF BOTH ACCESS PATHS WILL BE FULLY DISCUSSED. IN THAT PART THE FINAL CONCLUSION SHOWS THAT DATA BASES WITH SEVERAL HUNDREDS OF THOUSANDS OF RECORDS OWNING SEVERAL MILLIONS OF RETRIEVAL WORDS CAN BE COMPRESSED TO THE AVAILABLE SPACE OF A CD- ROM (650 MB), AND EVEN EXPANDED TO GREATER FIGURES. LOS BANCOS DE DATOS BIBLIOGRÁFICOS DESARROLLADOS EN COMPUTADORAS DE TIPO PERSONAL SE HAN VISTO LIMITADOS EN CUANTO AL NUMERO DE FICHAS Y POR SU RENDIMIENTO EN FUNCIÓN DEL TAMAÑO DE LA PLATAFORMA EN DONDE SE DESARROLLAN E INSTALAN. LOS MANEJADORES COMERCIALES DE BASE DE DATOS PARA ESTOS EQUIPOS HAN SIDO CONSTRUIDOS DE ACUERDO CON NECESIDADES DE TIPO GENERAL EN EL MERCADO Y NO CONTEMPLAN LAS CARACTERÍSTICAS PROPIAS DE LA INFORMACIÓN BIBLIOGRÁFICA, POR LO QUE DECRECE SU RENDIMIENTO RÁPIDAMENTE EN FUNCIÓN AL TAMAÑO DEL BANCO DE DATOS. EN ESTA PRIMERA PARTE DEL DOCUMENTO SE ANALIZA ESA PROBLEMÁTICA Y LAS CARACTERÍSTICAS PROPIAS DE LA INFORMACIÓN BIBLIOGRÁFICA EN LO TOCANTE A SU INCLUSIÓN EN BANCOS DE DATOS ELECTRÓNICOS, Y SE PRESENTA UN MODELO DE COMPRESIÓN DE DATOS BIBLIOGRÁFICOS EN FORMA DE ALGORITMO QUE PERMITE ENTRE UN 40% Y 70% DE COMPRESIÓN SIN MENOSCABAR LAS CARACTERÍSTICAS PROPIAS DE LA INFORMACIÓN BIBLIOGRÁFICA. EN LA SEGUNDA PARTE DEL DOCUMENTO SE PRESENTAN LAS TÉCNICAS PARA CREAR Y COMPRIMIR ÍNDICES PRECONSTRUIDOS DE RECUPERACIÓN Y ARCHIVOS DE RECUPERACIÓN POR PALABRAS EN BÚSQUEDA LIBRE, ASÍ COMO LAS TÉCNICAS PARA ACCEDERLOS Y SERLE PRESENTADOS AL USUARIO FINAL. EN ESA PARTE SE CONCLUYE QUE BANCOS DE DATOS DE CIENTOS DE MILES DE FICHAS Y MILLONES DE PALABRAS DE RECUPERACIÓN PUEDEN COMPRIMIRSE EN EL ESPACIO DE UN CD-ROM (650MEGABYTES), Y AUN EXTRAPOLARSE ESTOS VALORES A COSTAS MUCHOS MAYORES. Instituto de Investigaciones Bibliotecológicas y de la Información 2001-07-01 info:eu-repo/semantics/article info:eu-repo/semantics/publishedVersion application/pdf http://rev-ib.unam.mx/ib/index.php/ib/article/view/3975 10.22201/iibi.0187358xp.2001.31.3975 Investigación Bibliotecológica. Archivonomía, bibliotecología e información; Vol. 15 No. 31 (2001) Investigación Bibliotecológica: archivonomía, bibliotecología e información; Vol. 15 Núm. 31 (2001) Investigación Bibliotecológica: archivonomía, bibliotecología e información; v. 15 n. 31 (2001) 2448-8321 0187-358X 10.22201/iibi.0187358xp.2001.31 spa http://rev-ib.unam.mx/ib/index.php/ib/article/view/3975/3527 Derechos de autor 2001 Investigación Bibliotecológica: archivonomía, bibliotecología e información
spellingShingle DATABASES
INDEX AUTOMATION
ALGORITHMS
BASES DE DATOS
AUTOMATIZACION DE INDICES
ALGORITMOS
RUÍZ VELASCO Y ROMO, MIGUEL AGUSTÍN
Voutssás Márquez, Juan
Compression Algorithms for the Storage of Bibliographic Information (First of two parts)
title Compression Algorithms for the Storage of Bibliographic Information (First of two parts)
title_alt Algoritmo de compresión para el almacenamiento de información bibliográfica (primera de dos partes)
title_full Compression Algorithms for the Storage of Bibliographic Information (First of two parts)
title_fullStr Compression Algorithms for the Storage of Bibliographic Information (First of two parts)
title_full_unstemmed Compression Algorithms for the Storage of Bibliographic Information (First of two parts)
title_short Compression Algorithms for the Storage of Bibliographic Information (First of two parts)
title_sort compression algorithms for the storage of bibliographic information (first of two parts)
topic DATABASES
INDEX AUTOMATION
ALGORITHMS
BASES DE DATOS
AUTOMATIZACION DE INDICES
ALGORITMOS
topic_facet DATABASES
INDEX AUTOMATION
ALGORITHMS
BASES DE DATOS
AUTOMATIZACION DE INDICES
ALGORITMOS
url http://rev-ib.unam.mx/ib/index.php/ib/article/view/3975
https://dx.doi.org/10.22201/iibi.0187358xp.2001.31.3975
work_keys_str_mv AT ruizvelascoyromomiguelagustin compressionalgorithmsforthestorageofbibliographicinformationfirstoftwoparts
AT voutssasmarquezjuan compressionalgorithmsforthestorageofbibliographicinformationfirstoftwoparts
AT ruizvelascoyromomiguelagustin algoritmodecompresionparaelalmacenamientodeinformacionbibliograficaprimeradedospartes
AT voutssasmarquezjuan algoritmodecompresionparaelalmacenamientodeinformacionbibliograficaprimeradedospartes