Cargando…
Compression Algorithms for the Storage of Bibliographic Information (First of two parts)
BIBLIOGRAPHICAL DATA BASES WHICH HAVE BEEN DEVELOPED IN PC COMPUTERS HAVE BEEN LIMITED REGARDING TOTAL NUMBER OF FICHES AND THEIR PERFORMANCE DUE TO THE SIZE OF THE PLATFORM IN WHICH THOSE DATA BASES ARE DEVELOPED AND INSTALLED. COMMERCIAL PC DATA BASE MANAGEMENT SOFTWARE HAVE BEEN CONSTRUCTED WITH...
Autores principales: | , |
---|---|
Formato: | Online Artículo |
Lenguaje: | spa |
Publicado: |
Instituto de Investigaciones Bibliotecológicas y de la Información
2001
|
Materias: | |
Acceso en línea: | http://rev-ib.unam.mx/ib/index.php/ib/article/view/3975 https://dx.doi.org/10.22201/iibi.0187358xp.2001.31.3975 |
_version_ | 1780761117116596224 |
---|---|
author | RUÍZ VELASCO Y ROMO, MIGUEL AGUSTÍN Voutssás Márquez, Juan |
author_facet | RUÍZ VELASCO Y ROMO, MIGUEL AGUSTÍN Voutssás Márquez, Juan |
author_sort | RUÍZ VELASCO Y ROMO, MIGUEL AGUSTÍN |
collection | Investigación Bibliotecológica: archivonomía, bibliotecología e información |
description | BIBLIOGRAPHICAL DATA BASES WHICH HAVE BEEN DEVELOPED IN PC COMPUTERS HAVE BEEN LIMITED REGARDING TOTAL NUMBER OF FICHES AND THEIR PERFORMANCE DUE TO THE SIZE OF THE PLATFORM IN WHICH THOSE DATA BASES ARE DEVELOPED AND INSTALLED. COMMERCIAL PC DATA BASE MANAGEMENT SOFTWARE HAVE BEEN CONSTRUCTED WITH A GENERAL APPROACH, THINKING IN STANDARD APPLICATIONS, AND DO NOT CONSIDER THE PARTICULAR FEATURE OF THE BIBLIOGRAPHIC INFORMATION. THUS, THEY DECREASE IN PERFORMANCE EXPONENTIALLY IN RELATION WITH THE SIZE OF THE DATA BASE. IN THIS FIRST PART OF THE SURVEY, THE PROBLEMS ARE DISCUSSED, AS WELL AS THOSE TYPICAL FEATURES OF THE BIBLIOGRAPHIC INFORMATION REGARDING TO ITS INCLUSION IN COMPUTARIZED DATA BASES. A BIBLIOGRAPHIC DATA COMPRESSION MODEL IS INTRODUCED AS AN ALGORITHM, ALLOWING BETWEEN 40 AND 70 % OF COMPRESSION RATE WITHOUT LOSING INFORMATION QUALITY. IN THE SECOND DOCUMENT, PROCEDURES FOR CREATION AND COMPRESSION OF PRECONSTRUCTED INDEXES WILL BE PRESENTED, AS WELL AS RETRIEVAL FILES FOR WORD FREE-SEARCHING. SOME TECHNIQUES FOR CREATION AND RETRIEVAL OF BOTH ACCESS PATHS WILL BE FULLY DISCUSSED. IN THAT PART THE FINAL CONCLUSION SHOWS THAT DATA BASES WITH SEVERAL HUNDREDS OF THOUSANDS OF RECORDS OWNING SEVERAL MILLIONS OF RETRIEVAL WORDS CAN BE COMPRESSED TO THE AVAILABLE SPACE OF A CD- ROM (650 MB), AND EVEN EXPANDED TO GREATER FIGURES. |
format | Online Article |
id | oai_unam-bibliotecologica-article-3975 |
institution | Universidad Nacional Autónoma de México |
language | spa |
publishDate | 2001 |
publisher | Instituto de Investigaciones Bibliotecológicas y de la Información |
record_format | ojs |
spelling | oai_unam-bibliotecologica-article-39752018-01-31T14:24:52Z Compression Algorithms for the Storage of Bibliographic Information (First of two parts) Algoritmo de compresión para el almacenamiento de información bibliográfica (primera de dos partes) RUÍZ VELASCO Y ROMO, MIGUEL AGUSTÍN Voutssás Márquez, Juan DATABASES INDEX AUTOMATION ALGORITHMS BASES DE DATOS AUTOMATIZACION DE INDICES ALGORITMOS BIBLIOGRAPHICAL DATA BASES WHICH HAVE BEEN DEVELOPED IN PC COMPUTERS HAVE BEEN LIMITED REGARDING TOTAL NUMBER OF FICHES AND THEIR PERFORMANCE DUE TO THE SIZE OF THE PLATFORM IN WHICH THOSE DATA BASES ARE DEVELOPED AND INSTALLED. COMMERCIAL PC DATA BASE MANAGEMENT SOFTWARE HAVE BEEN CONSTRUCTED WITH A GENERAL APPROACH, THINKING IN STANDARD APPLICATIONS, AND DO NOT CONSIDER THE PARTICULAR FEATURE OF THE BIBLIOGRAPHIC INFORMATION. THUS, THEY DECREASE IN PERFORMANCE EXPONENTIALLY IN RELATION WITH THE SIZE OF THE DATA BASE. IN THIS FIRST PART OF THE SURVEY, THE PROBLEMS ARE DISCUSSED, AS WELL AS THOSE TYPICAL FEATURES OF THE BIBLIOGRAPHIC INFORMATION REGARDING TO ITS INCLUSION IN COMPUTARIZED DATA BASES. A BIBLIOGRAPHIC DATA COMPRESSION MODEL IS INTRODUCED AS AN ALGORITHM, ALLOWING BETWEEN 40 AND 70 % OF COMPRESSION RATE WITHOUT LOSING INFORMATION QUALITY. IN THE SECOND DOCUMENT, PROCEDURES FOR CREATION AND COMPRESSION OF PRECONSTRUCTED INDEXES WILL BE PRESENTED, AS WELL AS RETRIEVAL FILES FOR WORD FREE-SEARCHING. SOME TECHNIQUES FOR CREATION AND RETRIEVAL OF BOTH ACCESS PATHS WILL BE FULLY DISCUSSED. IN THAT PART THE FINAL CONCLUSION SHOWS THAT DATA BASES WITH SEVERAL HUNDREDS OF THOUSANDS OF RECORDS OWNING SEVERAL MILLIONS OF RETRIEVAL WORDS CAN BE COMPRESSED TO THE AVAILABLE SPACE OF A CD- ROM (650 MB), AND EVEN EXPANDED TO GREATER FIGURES. LOS BANCOS DE DATOS BIBLIOGRÁFICOS DESARROLLADOS EN COMPUTADORAS DE TIPO PERSONAL SE HAN VISTO LIMITADOS EN CUANTO AL NUMERO DE FICHAS Y POR SU RENDIMIENTO EN FUNCIÓN DEL TAMAÑO DE LA PLATAFORMA EN DONDE SE DESARROLLAN E INSTALAN. LOS MANEJADORES COMERCIALES DE BASE DE DATOS PARA ESTOS EQUIPOS HAN SIDO CONSTRUIDOS DE ACUERDO CON NECESIDADES DE TIPO GENERAL EN EL MERCADO Y NO CONTEMPLAN LAS CARACTERÍSTICAS PROPIAS DE LA INFORMACIÓN BIBLIOGRÁFICA, POR LO QUE DECRECE SU RENDIMIENTO RÁPIDAMENTE EN FUNCIÓN AL TAMAÑO DEL BANCO DE DATOS. EN ESTA PRIMERA PARTE DEL DOCUMENTO SE ANALIZA ESA PROBLEMÁTICA Y LAS CARACTERÍSTICAS PROPIAS DE LA INFORMACIÓN BIBLIOGRÁFICA EN LO TOCANTE A SU INCLUSIÓN EN BANCOS DE DATOS ELECTRÓNICOS, Y SE PRESENTA UN MODELO DE COMPRESIÓN DE DATOS BIBLIOGRÁFICOS EN FORMA DE ALGORITMO QUE PERMITE ENTRE UN 40% Y 70% DE COMPRESIÓN SIN MENOSCABAR LAS CARACTERÍSTICAS PROPIAS DE LA INFORMACIÓN BIBLIOGRÁFICA. EN LA SEGUNDA PARTE DEL DOCUMENTO SE PRESENTAN LAS TÉCNICAS PARA CREAR Y COMPRIMIR ÍNDICES PRECONSTRUIDOS DE RECUPERACIÓN Y ARCHIVOS DE RECUPERACIÓN POR PALABRAS EN BÚSQUEDA LIBRE, ASÍ COMO LAS TÉCNICAS PARA ACCEDERLOS Y SERLE PRESENTADOS AL USUARIO FINAL. EN ESA PARTE SE CONCLUYE QUE BANCOS DE DATOS DE CIENTOS DE MILES DE FICHAS Y MILLONES DE PALABRAS DE RECUPERACIÓN PUEDEN COMPRIMIRSE EN EL ESPACIO DE UN CD-ROM (650MEGABYTES), Y AUN EXTRAPOLARSE ESTOS VALORES A COSTAS MUCHOS MAYORES. Instituto de Investigaciones Bibliotecológicas y de la Información 2001-07-01 info:eu-repo/semantics/article info:eu-repo/semantics/publishedVersion application/pdf http://rev-ib.unam.mx/ib/index.php/ib/article/view/3975 10.22201/iibi.0187358xp.2001.31.3975 Investigación Bibliotecológica. Archivonomía, bibliotecología e información; Vol. 15 No. 31 (2001) Investigación Bibliotecológica: archivonomía, bibliotecología e información; Vol. 15 Núm. 31 (2001) Investigación Bibliotecológica: archivonomía, bibliotecología e información; v. 15 n. 31 (2001) 2448-8321 0187-358X 10.22201/iibi.0187358xp.2001.31 spa http://rev-ib.unam.mx/ib/index.php/ib/article/view/3975/3527 Derechos de autor 2001 Investigación Bibliotecológica: archivonomía, bibliotecología e información |
spellingShingle | DATABASES INDEX AUTOMATION ALGORITHMS BASES DE DATOS AUTOMATIZACION DE INDICES ALGORITMOS RUÍZ VELASCO Y ROMO, MIGUEL AGUSTÍN Voutssás Márquez, Juan Compression Algorithms for the Storage of Bibliographic Information (First of two parts) |
title | Compression Algorithms for the Storage of Bibliographic Information (First of two parts) |
title_alt | Algoritmo de compresión para el almacenamiento de información bibliográfica (primera de dos partes) |
title_full | Compression Algorithms for the Storage of Bibliographic Information (First of two parts) |
title_fullStr | Compression Algorithms for the Storage of Bibliographic Information (First of two parts) |
title_full_unstemmed | Compression Algorithms for the Storage of Bibliographic Information (First of two parts) |
title_short | Compression Algorithms for the Storage of Bibliographic Information (First of two parts) |
title_sort | compression algorithms for the storage of bibliographic information (first of two parts) |
topic | DATABASES INDEX AUTOMATION ALGORITHMS BASES DE DATOS AUTOMATIZACION DE INDICES ALGORITMOS |
topic_facet | DATABASES INDEX AUTOMATION ALGORITHMS BASES DE DATOS AUTOMATIZACION DE INDICES ALGORITMOS |
url | http://rev-ib.unam.mx/ib/index.php/ib/article/view/3975 https://dx.doi.org/10.22201/iibi.0187358xp.2001.31.3975 |
work_keys_str_mv | AT ruizvelascoyromomiguelagustin compressionalgorithmsforthestorageofbibliographicinformationfirstoftwoparts AT voutssasmarquezjuan compressionalgorithmsforthestorageofbibliographicinformationfirstoftwoparts AT ruizvelascoyromomiguelagustin algoritmodecompresionparaelalmacenamientodeinformacionbibliograficaprimeradedospartes AT voutssasmarquezjuan algoritmodecompresionparaelalmacenamientodeinformacionbibliograficaprimeradedospartes |