REVISTA
Applied Sciences

TODAS

Redirigiendo al acceso original de articulo en 17 segundos...

Inicio / Applied Sciences / Vol: 13 Par: 18 (2023) / Art�culo

ART�CULO

TITULO

Applying Object Detection and Embedding Techniques to One-Shot Class-Incremental Multi-Label Image Classification

Youngki Park and Youhyun Shin

Resumen

In this paper, we introduce an efficient approach to multi-label image classification that is particularly suited for scenarios requiring rapid adaptation to new classes with minimal training data. Unlike conventional methods that rely solely on neural networks trained on known classes, our model integrates object detection and embedding techniques to allow for the fast and accurate classification of novel classes based on as few as one sample image. During training, we use either Convolutional Neural Network (CNN)- or Vision Transformer-based algorithms to convert the provided sample images of new classes into feature vectors. At inference, a multi-object image is analyzed using low-threshold object detection algorithms, such as YOLOS or CutLER, identifying virtually all object-containing regions. These regions are subsequently converted into candidate vectors using embedding techniques. The k-nearest neighbors are identified for each candidate vector, and labels are assigned accordingly. Our empirical evaluation, using custom multi-label datasets featuring random objects and backgrounds, reveals that our approach substantially outperforms traditional methods lacking object detection. Notably, unsupervised object detection exhibited higher speed and accuracy than its supervised counterpart. Furthermore, lightweight CNN-based embeddings were found to be both faster and more accurate than Vision Transformer-based methods. Our approach holds significant promise for applications where classes are either rarely represented or continuously evolving.

Palabras claves

multi-label image classification - one-shot learning - object detection - embedding

Acceso

P�GINAS

pp. 0 - 0

N�MERO

Volumen: 13 Parte: 18 (2023)

MATERIAS

INGENIER�A Y CONSTRUCCI�N CIVIL
TECNOLOG�A

REVISTAS SIMILARES

Applied Sciences
Algorithms
Journal of Marine Science and Engineering

DOI

https://doi.org/10.3390/app131810468

Art�culos similares

Detection of Small Objects in Side-Scan Sonar Images Using an Enhanced YOLOv7-Based Approach

Acceso

Feihu Zhang, Wei Zhang, Chensheng Cheng, Xujia Hou and Chun Cao

Deep learning-based object detection methods have demonstrated remarkable effectiveness across various domains. Recently, there has been growing interest in applying these techniques to underwater environments. Conventional optical imaging methods face s... ver m�s

Revista: Journal of Marine Science and Engineering

Radio-Frequency Energy Harvesting Using Rapid 3D Plastronics Protoyping Approach: A Case Study

Acceso

Xuan Viet Linh Nguyen, Tony Gerges, Pascal Bevilacqua, Jean-Marc Duchamp, Philippe Benech, Jacques Verdier, Philippe Lombard, Pangsui Usifu Linge, Fabien Mieyeville, Michel Cabrera and Bruno Allard

Harvesting of ambient radio-frequency energy is largely covered in the literature. The RF energy harvester is considered most of the time as a standalone board. There is an interest to add the RF harvesting function on an already-designed object. Polymer... ver m�s

Revista: Journal of Low Power Electronics and Applications

Self-Supervised Noise Reduction in Low-Dose Cone Beam Computed Tomography (CBCT) Using the Randomly Dropped Projection Strategy

Acceso

Young-Joo Han and Ha-Jin Yu

Deep learning-based denoising methods have proved efficient for medical imaging. Obtaining a three-dimensional representation of a scanned object is essential, such as in the computed tomography (CT) system. A sufficient radiation dose needs to be irradi... ver m�s

Revista: Applied Sciences

Design and Acceleration of Field Programmable Gate Array-Based Deep Learning for Empty-Dish Recycling Robots

Acceso

Zhichen Wang, Hengyi Li, Xuebin Yue and Lin Meng

As the proportion of the working population decreases worldwide, robots with artificial intelligence have been a good choice to help humans. At the same time, field programmable gate array (FPGA) is generally used on edge devices including robots, and it... ver m�s

Revista: Applied Sciences

GenericConv: A Generic Model for Image Scene Classification Using Few-Shot Learning

Acceso

Mohamed Soudy, Yasmine M. Afify and Nagwa Badr

Scene classification is one of the most complex tasks in computer-vision. The accuracy of scene classification is dependent on other subtasks such as object detection and object classification. Accurate results may be accomplished by employing object det... ver m�s

Revista: Information

Revistas destacadas

Acceso directo a los n�meros publicados en la revista Infrastructures

Infrastructures

Acceso directo a los n�meros publicados en la revista Informed Infraestructure

Informed Infraestructure

Acceso directo a los n�meros publicados en la revista BiT

Acceso directo a los n�meros publicados en la revista Revista de la Construcci�n

Revista de la Construcci�n

Ver todas las revistas disponibles