Cargando…
FDup: a framework for general-purpose and efficient entity deduplication of record collections
Deduplication is a technique aiming at identifying and resolving duplicate metadata records in a collection. This article describes FDup (Flat Collections Deduper), a general-purpose software framework supporting a complete deduplication workflow to manage big data record collections: metadata recor...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
PeerJ Inc.
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9575841/ https://www.ncbi.nlm.nih.gov/pubmed/36262137 http://dx.doi.org/10.7717/peerj-cs.1058 |