REVISTA
Algorithms

TODAS

Inicio / Algorithms / Vol: 15 Par: 5 (2022) / Art�culo

ART�CULO

TITULO

MKD: Mixup-Based Knowledge Distillation for Mandarin End-to-End Speech Recognition

Xing Wu

Yifan Jin

Jianjia Wang

Quan Qian and Yike Guo

Resumen

Large-scale automatic speech recognition model has achieved impressive performance. However, huge computational resources and massive amount of data are required to train an ASR model. Knowledge distillation is a prevalent model compression method which transfers the knowledge from large model to small model. To improve the efficiency of knowledge distillation for end-to-end speech recognition especially in the low-resource setting, a Mixup-based Knowledge Distillation (MKD) method is proposed which combines Mixup, a data-agnostic data augmentation method, with softmax-level knowledge distillation. A loss-level mixture is presented to address the problem caused by the non-linearity of label in the KL-divergence when adopting Mixup to the teacher?student framework. It is mathematically shown that optimizing the mixture of loss function is equivalent to optimize an upper bound of the original knowledge distillation loss. The proposed MKD takes the advantage of Mixup and brings robustness to the model even with a small amount of training data. The experiments on Aishell-1 show that MKD obtains a 15.6% and 3.3% relative improvement on two student models with different parameter scales compared with the existing methods. Experiments on data efficiency demonstrate MKD achieves similar results with only half of the original dataset.

Palabras claves

end-to-end speech recognition - knowledge distillation - model compression - data efficiency - mixup

Acceso

P�GINAS

pp. 0 - 0

N�MERO

Volumen: 15 Parte: 5 (2022)

MATERIAS

INGENIER�A Y CONSTRUCCI�N CIVIL
TECNOLOG�A

REVISTAS SIMILARES

Water
Acta Scientiarum: Technology
Information

DOI

https://doi.org/10.3390/a15050160

Art�culos similares

Camouflaged Object Detection That Does Not Require Additional Priors

Acceso

Yuchen Dong, Heng Zhou, Chengyang Li, Junjie Xie, Yongqiang Xie and Zhongbo Li

Camouflaged object detection (COD) is an arduous challenge due to the striking resemblance of camouflaged objects to their surroundings. The abundance of similar background information can significantly impede the efficiency of camouflaged object detecti... ver m�s

Revista: Applied Sciences

A Multi-View Interactive Approach for Multimodal Sarcasm Detection in Social Internet of Things with Knowledge Enhancement

Acceso

Hao Liu, Bo Yang and Zhiwen Yu

Multimodal sarcasm detection is a developing research field in social Internet of Things, which is the foundation of artificial intelligence and human psychology research. Sarcastic comments issued on social media often imply people?s real attitudes towa... ver m�s

Revista: Applied Sciences

SimKG-BERT: A Security Enhancement Approach for Healthcare Models Consisting of Fusing SimBERT and a Knowledge Graph

Acceso

Songpu Li, Xinran Yu and Peng Chen

Model robustness is an important index in medical cybersecurity, and hard-negative samples in electronic medical records can provide more gradient information, which can effectively improve the robustness of a model. However, hard negatives pose difficul... ver m�s

Revista: Applied Sciences

High-Resistance Connection Fault Diagnosis in Ship Electric Propulsion System Using Res-CBDNN

Acceso

Jia-Ling Xie, Wei-Feng Shi, Ting Xue and Yu-Hang Liu

The fault detection and diagnosis of a ship?s electric propulsion system is of great significance to the reliability and safety of large modern ships. The traditional fault diagnosis method based on mathematical models and expert knowledge is limited by ... ver m�s

Revista: Journal of Marine Science and Engineering

Causes of Asphalt Pavement Blistering: A Review

Acceso

Laura Moretti, Leonardo Palozza and Antonio D?Andrea

No theoretical model effectively explains the blistering process, which provokes functional distress in asphalt pavements worldwide. This study focuses on the possible causes of blistering, the physical processes that drive blistering, the role of asphal... ver m�s

Revista: Applied Sciences

Revistas destacadas

Acceso directo a los n�meros publicados en la revista Infrastructures

Infrastructures

Acceso directo a los n�meros publicados en la revista Informed Infraestructure

Informed Infraestructure

Acceso directo a los n�meros publicados en la revista BiT

Acceso directo a los n�meros publicados en la revista Revista de la Construcci�n

Revista de la Construcci�n

Ver todas las revistas