Inicio  /  Applied Sciences  /  Vol: 13 Par: 14 (2023)  /  Artículo
ARTÍCULO
TITULO

Voice Deepfake Detection Using the Self-Supervised Pre-Training Model HuBERT

Lanting Li    
Tianliang Lu    
Xingbang Ma    
Mengjiao Yuan and Da Wan    

Resumen

In recent years, voice deepfake technology has developed rapidly, but current detection methods have the problems of insufficient detection generalization and insufficient feature extraction for unknown attacks. This paper presents a forged speech detection method (HuRawNet2_modified) based on a self-supervised pre-trained model (HuBERT) to improve detection (and address the above problems). A combination of impulsive signal-dependent additive noise and additive white Gaussian noise was adopted for data boosting and augmentation, and the HuBERT model was fine-tuned on different language databases. On this basis, the size of the extracted feature maps was modified independently by the a-feature map scaling (a-FMS) method, with a modified end-to-end method using the RawNet2 model as the backbone structure. The results showed that the HuBERT model could extract features more comprehensively and accurately. The best evaluation indicators were an equal error rate (EER) of 2.89% and a minimum tandem detection cost function (min t-DCF) of 0.2182 on the database of the ASVspoof2021 LA challenge, which verified the effectiveness of the detection method proposed in this paper. Compared with the baseline systems in databases of the ASVspoof 2021 LA challenge and the FMFCC-A, the values of EER and min t-DCF decreased. The results also showed that the self-supervised pre-trained model with fine-tuning can extract acoustic features across languages. And the detection can be slightly improved when the languages of the pre-trained database, and the fine-tuned and tested database are the same.

 Artículos similares

       
 
Christogonus U. Onukwube, Daniel O. Aikhuele and Shahryar Sorooshian    
Water distribution networks are complex systems that aid in the delivery of water to residential and non-residential areas. However, the networks can be affected by different types of faults, which could lead to the wastage of treated water. As such, the... ver más
Revista: Applied Sciences

 
Baobao Liu, Heying Wang, Zifan Cao, Yu Wang, Lu Tao, Jingjing Yang and Kaibing Zhang    
Defect detection holds significant importance in improving the overall quality of fabric manufacturing. To improve the effectiveness and accuracy of fabric defect detection, we propose the PRC-Light YOLO model for fabric defect detection and establish a ... ver más
Revista: Applied Sciences

 
Zhou Fang, Xiaoyong Wang, Liang Zhang and Bo Jiang    
Currently, deep learning is extensively utilized for ship target detection; however, achieving accurate and real-time detection of multi-scale targets remains a significant challenge. Considering the diverse scenes, varied scales, and complex backgrounds... ver más

 
Rong Wang, Xinyang Zhou, Yi Liu, Dongqi Liu, Yu Lu and Miao Su    
To ensure the safety and durability of concrete structures, timely detection and classification of concrete cracks using a low-cost and high-efficiency method is necessary. In this study, a concrete surface crack damage detection method based on the ResN... ver más
Revista: Applied Sciences

 
Abdul Rahaman Wahab Sait and Ali Mohammad Alorsan Bani Awad    
Coronary artery disease (CAD) is the most prevalent form of cardiovascular disease that may result in myocardial infarction. Annually, it leads to millions of fatalities and causes billions of dollars in global economic losses. Limited resources and comp... ver más
Revista: Applied Sciences