Global Contextual Dependency Network for Object Detection

Junda Li

Chunxu Zhang and Bo Yang

Resumen

Current two-stage object detectors extract the local visual features of Regions of Interest (RoIs) for object recognition and bounding-box regression. However, only using local visual features will lose global contextual dependencies, which are helpful to recognize objects with featureless appearances and restrain false detections. To tackle the problem, a simple framework, named Global Contextual Dependency Network (GCDN), is presented to enhance the classification ability of two-stage detectors. Our GCDN mainly consists of two components, Context Representation Module (CRM) and Context Dependency Module (CDM). Specifically, a CRM is proposed to construct multi-scale context representations. With CRM, contextual information can be fully explored at different scales. Moreover, the CDM is designed to capture global contextual dependencies. Our GCDN includes multiple CDMs. Each CDM utilizes local Region of Interest (RoI) features and single-scale context representation to generate single-scale contextual RoI features via the attention mechanism. Finally, the contextual RoI features generated by parallel CDMs independently are combined with the original RoI features to help classification. Experiments on MS-COCO 2017 benchmark dataset show that our approach brings continuous improvements for two-stage detectors.

Palabras claves

object detection - global contextual dependency - multi-scale representations - attention mechanism

Acceso

PÁGINAS

pp. 0 - 0

NÚMERO

Volumen: 14 Parte: 1 (2022)

MATERIAS

INFRAESTRUCTURA

REVISTAS SIMILARES

DOI

https://doi.org/10.3390/fi14010027

Global Contextual Dependency Network for Object Detection

Artículos similares

Revistas destacadas