Cargando…

PET: Parameter-efficient Knowledge Distillation on Transformer

Given a large Transformer model, how can we obtain a small and computationally efficient model which maintains the performance of the original model? Transformer has shown significant performance improvements for many NLP tasks in recent years. However, their large size, expensive computational cost...

Descripción completa

Detalles Bibliográficos
Autores principales:	Jeon, Hyojin, Park, Seungcheol, Kim, Jin-Gee, Kang, U.
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Public Library of Science 2023
Materias:	Research Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10325108/ https://www.ncbi.nlm.nih.gov/pubmed/37410716 http://dx.doi.org/10.1371/journal.pone.0288060

Ejemplares similares

Pea-KD: Parameter-efficient and accurate Knowledge Distillation on BERT
por: Cho, Ikhyun, et al.
Publicado: (2022)

Self-evolving vision transformer for chest X-ray diagnosis through knowledge distillation
por: Park, Sangjoon, et al.
Publicado: (2022)

Momentum contrast transformer for COVID-19 diagnosis with knowledge distillation
por: Dong, Aimei, et al.
Publicado: (2023)

Compressing deep graph convolution network with multi-staged knowledge distillation
por: Kim, Junghun, et al.
Publicado: (2021)

Communication-efficient federated learning via knowledge distillation
por: Wu, Chuhan, et al.
Publicado: (2022)

Data-Efficient Sensor Upgrade Path Using Knowledge Distillation
por: Van Molle, Pieter, et al.
Publicado: (2021)

Knowledge Distillation Facilitates the Lightweight and Efficient Plant Diseases Detection Model
por: Huang, Qianding, et al.
Publicado: (2023)

Memory-Replay Knowledge Distillation
por: Wang, Jiyue, et al.
Publicado: (2021)

Knowledge Fusion Distillation: Improving Distillation with Multi-scale Attention Mechanisms
por: Li, Linfeng, et al.
Publicado: (2023)

Knowledge distillation of multi-scale dense prediction transformer for self-supervised depth estimation
por: Song, Jimin, et al.
Publicado: (2023)

Biocuration: Distilling data into knowledge
Publicado: (2018)

An Efficient Approach Using Knowledge Distillation Methods to Stabilize Performance in a Lightweight Top-Down Posture Estimation Network
por: Park, Changhyun, et al.
Publicado: (2021)

Explaining Neural Networks Using Attentive Knowledge Distillation
por: Lee, Hyeonseok, et al.
Publicado: (2021)

Analysis of distillation characteristics via CFD (computational fluid dynamics) of Korean traditional ‘Sojutgori’ and study on structure for distillation efficiency enhancement
por: Kim, Ung‐soo, et al.
Publicado: (2022)

Attention and feature transfer based knowledge distillation
por: Yang, Guoliang, et al.
Publicado: (2023)

Knowledge distillation based on multi-layer fusion features
por: Tan, Shengyuan, et al.
Publicado: (2023)

Uncertainty-Aware Knowledge Distillation for Collision Identification of Collaborative Robots
por: Kwon, Wookyong, et al.
Publicado: (2021)

Knowledge distillation in deep learning and its applications
por: Alkhulaifi, Abdolmaged, et al.
Publicado: (2021)

Efficient entanglement distillation without quantum memory
por: Abdelkhalek, Daniela, et al.
Publicado: (2016)

Hydrophobic nanostructured wood membrane for thermally efficient distillation
por: Hou, Dianxun, et al.
Publicado: (2019)

Distilling the distillers: examining the political activities of the Distilled Spirits Council of the United States
por: Lesch, Matthew, et al.
Publicado: (2023)

Experience as knowledge: Disability, distillation and (reprogenetic) decision-making
por: Boardman, Felicity K.
Publicado: (2017)

Online knowledge distillation network for single image dehazing
por: Lan, Yunwei, et al.
Publicado: (2022)

Cervical Cell Image Classification-Based Knowledge Distillation
por: Gao, Wenjian, et al.
Publicado: (2022)

Enhancing Offensive Language Detection with Data Augmentation and Knowledge Distillation
por: Deng, Jiawen, et al.
Publicado: (2023)

Knowledge distillation for multi-depth-model-fusion recommendation algorithm
por: Yang, Mingbao, et al.
Publicado: (2022)

Lightweight Depth Completion Network with Local Similarity-Preserving Knowledge Distillation
por: Jeong, Yongseop, et al.
Publicado: (2022)

The Influence Mechanism of Knowledge Network Allocation Mechanism on Knowledge Distillation of High-Tech Enterprises
por: Yuan, Jianlin, et al.
Publicado: (2022)

Phase Behavior and Thermodynamic Model Parameters in Simulations of Extractive Distillation for Azeotrope Separation
por: Li, Min, et al.
Publicado: (2017)

RCKD: Response-Based Cross-Task Knowledge Distillation for Pathological Image Analysis
por: Kim, Hyunil, et al.
Publicado: (2023)

BERTtoCNN: Similarity-preserving enhanced knowledge distillation for stance detection
por: Li, Yang, et al.
Publicado: (2021)

Screening obstructive sleep apnea patients via deep learning of knowledge distillation in the lateral cephalogram
por: Kim, Min-Jung, et al.
Publicado: (2023)

A Fine-Grained Bird Classification Method Based on Attention and Decoupled Knowledge Distillation
por: Wang, Kang, et al.
Publicado: (2023)

Knowledge Distillation for Semantic Segmentation Using Channel and Spatial Correlations and Adaptive Cross Entropy
por: Park, Sangyong, et al.
Publicado: (2020)

KnowRU: Knowledge Reuse via Knowledge Distillation in Multi-Agent Reinforcement Learning
por: Gao, Zijian, et al.
Publicado: (2021)

GeneDistiller—Distilling Candidate Genes from Linkage Intervals
por: Seelow, Dominik, et al.
Publicado: (2008)

Approach Study for Mass Balance of Pesticide Residues in Distillers’ Stillage along with Distillate and Absence Verification of Pesticides in Distilled Spirits from Pilot-Scale of Distillation Column
por: Shin, Jung-Ah, et al.
Publicado: (2019)

Distillation /
por: Van Winkle, Matthew, 1910-
Publicado: (1967)

Author Correction: Attention and feature transfer based knowledge distillation
por: Yang, Guoliang, et al.
Publicado: (2023)

Book Review: Distilling knowledge: alchemy, chemistry, and the scientific revolution
por: Haycock, David Boyd
Publicado: (2006)

Cannot write session to /tmp/vufind_sessions/sess_5tddu1j1l60b0ha488tn1g2n22