Cargando…
Dataset construction method of cross-lingual summarization based on filtering and text augmentation
Existing cross-lingual summarization (CLS) datasets consist of inconsistent sample quality and low scale. To address these problems, we propose a method that jointly supervises quality and scale to build CLS datasets. In terms of quality supervision, the method adopts a multi-strategy filtering algo...
Autores principales: | Pan, Hangyu, Xi, Yaoyi, Wang, Ling, Nan, Yu, Su, Zhizhong, Cao, Rong |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
PeerJ Inc.
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10280405/ https://www.ncbi.nlm.nih.gov/pubmed/37346668 http://dx.doi.org/10.7717/peerj-cs.1299 |
Ejemplares similares
-
Reaching for upper bound ROUGE score of extractive summarization methods
por: Akhmetov, Iskander, et al.
Publicado: (2022) -
Syntactic- and morphology-based text augmentation framework for Arabic sentiment analysis
por: Duwairi, Rehab, et al.
Publicado: (2021) -
Generative adversarial network based adaptive data augmentation for handwritten Arabic text recognition
por: Eltay, Mohamed, et al.
Publicado: (2022) -
Small facial image dataset augmentation using conditional GANs based on incomplete edge feature input
por: Hung, Shih-Kai, et al.
Publicado: (2021) -
Introducing DynaPTI–constructing a dynamic patent technology indicator using text mining and machine learning
por: Freunek, Michael, et al.
Publicado: (2023)