Relational Action Bank with Semantic?Visual Attention for Few-Shot Action Recognition

Haoming Liang

Jinze Du

Hongchen Zhang

Bing Han and Yan Ma

Resumen

Recently, few-shot learning has attracted significant attention in the field of video action recognition, owing to its data-efficient learning paradigm. Despite the encouraging progress, identifying ways to further improve the few-shot learning performance by exploring additional or auxiliary information for video action recognition remains an ongoing challenge. To address this problem, in this paper we make the first attempt to propose a relational action bank with semantic?visual attention for few-shot action recognition. Specifically, we introduce a relational action bank as the auxiliary library to assist the network in understanding the actions in novel classes. Meanwhile, the semantic?visual attention is devised to adaptively capture the connections to the foregone actions via both semantic correlation and visual similarity. We extensively evaluate our approach via two backbone models (ResNet-50 and C3D) on HMDB and Kinetics datasets, and demonstrate that the proposed model can obtain significantly better performance compared against state-of-the-art methods. Notably, our results demonstrate an average improvement of about 6.2% when compared to the second-best method on the Kinetics dataset.

Palabras claves

semantic attention - visual attention - relational action bank - few-shot learning - action recognition

Acceso

P�GINAS

pp. 0 - 0

N�MERO

Volumen: 15 Parte: 3 (2023)

MATERIAS

INFRAESTRUCTURA

REVISTAS SIMILARES

Future Internet
Australasian Journal of Construction Economics and Building
Construction Economics and Building

DOI