Cargando…

FDup: a framework for general-purpose and efficient entity deduplication of record collections

Deduplication is a technique aiming at identifying and resolving duplicate metadata records in a collection. This article describes FDup (Flat Collections Deduper), a general-purpose software framework supporting a complete deduplication workflow to manage big data record collections: metadata recor...

Descripción completa

Detalles Bibliográficos
Autores principales: De Bonis, Michele, Manghi, Paolo, Atzori, Claudio
Formato: Online Artículo Texto
Lenguaje:English
Publicado: PeerJ Inc. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9575841/
https://www.ncbi.nlm.nih.gov/pubmed/36262137
http://dx.doi.org/10.7717/peerj-cs.1058