Cargando…

Pea-KD: Parameter-efficient and accurate Knowledge Distillation on BERT

Knowledge Distillation (KD) is one of the widely known methods for model compression. In essence, KD trains a smaller student model based on a larger teacher model and tries to retain the teacher model’s level of performance as much as possible. However, existing KD methods suffer from the following...

Descripción completa

Detalles Bibliográficos
Autores principales:	Cho, Ikhyun, Kang, U
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Public Library of Science 2022
Materias:	Research Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8856529/ https://www.ncbi.nlm.nih.gov/pubmed/35180258 http://dx.doi.org/10.1371/journal.pone.0263592

Ejemplares similares

PET: Parameter-efficient Knowledge Distillation on Transformer
por: Jeon, Hyojin, et al.
Publicado: (2023)

SensiMix: Sensitivity-Aware 8-bit index & 1-bit value mixed precision quantization for BERT compression
por: Piao, Tairen, et al.
Publicado: (2022)

Towards Transfer Learning Techniques—BERT, DistilBERT, BERTimbau, and DistilBERTimbau for Automatic Text Classification from Different Languages: A Case Study
por: Silva Barbon, Rafael, et al.
Publicado: (2022)

LAD: Layer-Wise Adaptive Distillation for BERT Model Compression
por: Lin, Ying-Jia, et al.
Publicado: (2023)

KD_ConvNeXt: knowledge distillation-based image classification of lung tumor surgical specimen sections
por: Zheng, Zhaoliang, et al.
Publicado: (2023)

AUBER: Automated BERT regularization
por: Lee, Hyun Dong, et al.
Publicado: (2021)

Fine-tuning of BERT Model to Accurately Predict Drug–Target Interactions
por: Kang, Hyeunseok, et al.
Publicado: (2022)

To BERT or Not to BERT Dealing with Possible BERT Failures in an Entailment Task
por: Fialho, Pedro, et al.
Publicado: (2020)

Communication-efficient federated learning via knowledge distillation
por: Wu, Chuhan, et al.
Publicado: (2022)

Template-Driven Knowledge Distillation for Compact and Accurate Periocular Biometrics Deep-Learning Models
por: Boutros, Fadi, et al.
Publicado: (2022)

Accurate and Reliable Classification of Unstructured Reports on Their Diagnostic Goal Using BERT Models
por: Rietberg, Max Tigo, et al.
Publicado: (2023)

Data-Efficient Sensor Upgrade Path Using Knowledge Distillation
por: Van Molle, Pieter, et al.
Publicado: (2021)

Compressing deep graph convolution network with multi-staged knowledge distillation
por: Kim, Junghun, et al.
Publicado: (2021)

Knowledge Distillation Facilitates the Lightweight and Efficient Plant Diseases Detection Model
por: Huang, Qianding, et al.
Publicado: (2023)

IUP-BERT: Identification of Umami Peptides Based on BERT Features
por: Jiang, Liangzhen, et al.
Publicado: (2022)

VGCN-BERT: Augmenting BERT with Graph Embedding for Text Classification
por: Lu, Zhibin, et al.
Publicado: (2020)

What does Chinese BERT learn about syntactic knowledge?
por: Zheng, Jianyu, et al.
Publicado: (2023)

Memory-Replay Knowledge Distillation
por: Wang, Jiyue, et al.
Publicado: (2021)

Knowledge Fusion Distillation: Improving Distillation with Multi-scale Attention Mechanisms
por: Li, Linfeng, et al.
Publicado: (2023)

Natural language processing analysis applied to COVID-19 open-text opinions using a distilBERT model for sentiment categorization
por: Jojoa, Mario, et al.
Publicado: (2022)

Fusion-ConvBERT: Parallel Convolution and BERT Fusion for Speech Emotion Recognition
por: Lee, Sanghyun, et al.
Publicado: (2020)

Biocuration: Distilling data into knowledge
Publicado: (2018)

PlagueKD: a knowledge graph–based plague knowledge database
por: Li, Jin, et al.
Publicado: (2022)

srBERT: automatic article classification model for systematic review using BERT
por: Aum, Sungmin, et al.
Publicado: (2021)

Vitamin D(3) signalling in the brain enhances the function of phosphoprotein enriched in astrocytes – 15 kD (PEA-15)
por: Obradovic, Darja, et al.
Publicado: (2009)

PED/PEA-15 interacts with the 67 kD laminin receptor and regulates cell adhesion, migration, proliferation and apoptosis
por: Formisano, Pietro, et al.
Publicado: (2012)

BERT-PPII: The Polyproline Type II Helix Structure Prediction Model Based on BERT and Multichannel CNN
por: Feng, Chuang, et al.
Publicado: (2022)

Diagnosing BERT with Retrieval Heuristics
por: Câmara, Arthur, et al.
Publicado: (2020)

Dependency parsing of biomedical text with BERT
por: Kanerva, Jenna, et al.
Publicado: (2020)

Getting Started with Google BERT
por: Ravichandiran, Sudharsan
Publicado: (2021)

FakeBERT: Fake news detection in social media with a BERT-based deep learning approach
por: Kaliyar, Rohit Kumar, et al.
Publicado: (2021)

Trajectory-BERT: Trajectory Estimation Based on BERT Trajectory Pre-Training Model and Particle Filter Algorithm
por: Wu, You, et al.
Publicado: (2023)

Attention and feature transfer based knowledge distillation
por: Yang, Guoliang, et al.
Publicado: (2023)

Knowledge distillation based on multi-layer fusion features
por: Tan, Shengyuan, et al.
Publicado: (2023)

TSSNote-CyaPromBERT: Development of an integrated platform for highly accurate promoter prediction and visualization of Synechococcus sp. and Synechocystis sp. through a state-of-the-art natural language processing model BERT
por: Mai, Dung Hoang Anh, et al.
Publicado: (2022)

BERT for Evidence Retrieval and Claim Verification
por: Soleimani, Amir, et al.
Publicado: (2020)

Rethinking Query Expansion for BERT Reranking
por: Padaki, Ramith, et al.
Publicado: (2020)

DTI-BERT: Identifying Drug-Target Interactions in Cellular Networking Based on BERT and Deep Learning Method
por: Zheng, Jie, et al.
Publicado: (2022)

PD-BertEDL: An Ensemble Deep Learning Method Using BERT and Multivariate Representation to Predict Peptide Detectability
por: Wang, Huiqing, et al.
Publicado: (2022)

KD4v: comprehensible knowledge discovery system for missense variant
por: Luu, Tien-Dao, et al.
Publicado: (2012)

Cannot write session to /tmp/vufind_sessions/sess_2ug266n38gp7hpgs63oc4e973n