Cargando…
EnViTSA: Ensemble of Vision Transformer with SpecAugment for Acoustic Event Classification
Recent successes in deep learning have inspired researchers to apply deep neural networks to Acoustic Event Classification (AEC). While deep learning methods can train effective AEC models, they are susceptible to overfitting due to the models’ high complexity. In this paper, we introduce EnViTSA, a...
Autores principales: | Lim, Kian Ming, Lee, Chin Poo, Lee, Zhi Yang, Alqahtani, Ali |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10674441/ https://www.ncbi.nlm.nih.gov/pubmed/38005472 http://dx.doi.org/10.3390/s23229084 |
Ejemplares similares
-
Plant-CNN-ViT: Plant Classification with Ensemble of Convolutional Neural Networks and Vision Transformer
por: Lee, Chin Poo, et al.
Publicado: (2023) -
HGR-ViT: Hand Gesture Recognition with Vision Transformer
por: Tan, Chun Keat, et al.
Publicado: (2023) -
Gait-ViT: Gait Recognition with Vision Transformer
por: Mogan, Jashila Nair, et al.
Publicado: (2022) -
Gait-CNN-ViT: Multi-Model Gait Recognition with Convolutional Neural Networks and Vision Transformer
por: Mogan, Jashila Nair, et al.
Publicado: (2023) -
ViTT: Vision Transformer Tracker
por: Zhu, Xiaoning, et al.
Publicado: (2021)