Cargando…
FDup: a framework for general-purpose and efficient entity deduplication of record collections
Deduplication is a technique aiming at identifying and resolving duplicate metadata records in a collection. This article describes FDup (Flat Collections Deduper), a general-purpose software framework supporting a complete deduplication workflow to manage big data record collections: metadata recor...
Autores principales: | De Bonis, Michele, Manghi, Paolo, Atzori, Claudio |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
PeerJ Inc.
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9575841/ https://www.ncbi.nlm.nih.gov/pubmed/36262137 http://dx.doi.org/10.7717/peerj-cs.1058 |
Ejemplares similares
-
Rule-based deduplication of article records from bibliographic databases
por: Jiang, Yu, et al.
Publicado: (2014) -
Trends in cleaning relational data: consistency and deduplication
por: Ilyas, Ihab F, et al.
Publicado: (2015) -
Data deduplication for data optimization for storage and network systems
por: Kim, Daehee, et al.
Publicado: (2016) -
Static Memory Deduplication for Performance Optimization in Cloud Computing
por: Jia, Gangyong, et al.
Publicado: (2017) -
UMIc: A Preprocessing Method for UMI Deduplication and Reads Correction
por: Tsagiopoulou, Maria, et al.
Publicado: (2021)