Inicio  /  Information  /  Vol: 14 Par: 10 (2023)  /  Artículo
ARTÍCULO
TITULO

Comparative Analysis of Deep Learning Architectures and Vision Transformers for Musical Key Estimation

Manav Garg    
Pranshav Gajjar    
Pooja Shah    
Madhu Shukla    
Biswaranjan Acharya    
Vassilis C. Gerogiannis and Andreas Kanavos    

Resumen

The musical key serves as a crucial element in a piece, offering vital insights into the tonal center, harmonic structure, and chord progressions while enabling tasks such as transposition and arrangement. Moreover, accurate key estimation finds practical applications in music recommendation systems and automatic music transcription, making it relevant across academic and industrial domains. This paper presents a comprehensive comparison between standard deep learning architectures and emerging vision transformers, leveraging their success in various domains. We evaluate their performance on a specific subset of the GTZAN dataset, analyzing six different deep learning models. Our results demonstrate that DenseNet, a conventional deep learning architecture, achieves remarkable accuracy of 91.64%, outperforming vision transformers. However, we delve deeper into the analysis to shed light on the temporal characteristics of each deep learning model. Notably, the vision transformer and SWIN transformer exhibit a slight decrease in overall performance (1.82% and 2.29%, respectively), yet they demonstrate superior performance in temporal metrics compared to the DenseNet architecture. The significance of our findings lies in their contribution to the field of musical key estimation, where accurate and efficient algorithms play a pivotal role. By examining the strengths and weaknesses of deep learning architectures and vision transformers, we can gain valuable insights for practical implementations, particularly in music recommendation systems and automatic music transcription. Our research provides a foundation for future advancements and encourages further exploration in this area.

 Artículos similares

       
 
George Westergaard, Utku Erden, Omar Abdallah Mateo, Sullaiman Musah Lampo, Tahir Cetin Akinci and Oguzhan Topsakal    
Automated Machine Learning (AutoML) tools are revolutionizing the field of machine learning by significantly reducing the need for deep computer science expertise. Designed to make ML more accessible, they enable users to build high-performing models wit... ver más
Revista: Information

 
Hamed Taherdoost and Mitra Madanchian    
Blockchain technology has become a powerful disruptive force that upends established ideas in several industries. A fascinating point of convergence is that of blockchain technology and Business Process Management (BPM), where the distributed and immutab... ver más
Revista: Information

 
Marcin Klosok, Daria Gendosz de Carrillo, Piotr Laszczyca, Tomasz Plociniczak, Halina Jedrzejowska-Szypulka and Tomasz Sawczyn    
Revista: Applied Sciences

 
Siarhei Autsou, Karolina Kudelina, Toomas Vaimann, Anton Rassõlkin and Ants Kallaste    
Servomotors have found widespread application in many areas, such as manufacturing, robotics, automation, and others. Thus, the control of servomotors is divided into various principles and methods, leading to a high diversity of control systems. This ar... ver más
Revista: Applied Sciences

 
Carolina Bona-Sánchez, Heidi Salokangas and Kaisa Sorsa    
This study explores the complexities of cost behavior in the textile industry, conducting a comparative analysis between firms in the Nordic countries and Spain. Our main goal is to examine how distinct economic and corporate governance models impact the... ver más
Revista: Applied Sciences