Inicio  /  Computers  /  Vol: 10 Par: 9 (2021)  /  Artículo
ARTÍCULO
TITULO

Fine-Grained Cross-Modal Retrieval for Cultural Items with Focal Attention and Hierarchical Encodings

Shurong Sheng    
Katrien Laenen    
Luc Van Gool and Marie-Francine Moens    

Resumen

In this paper, we target the tasks of fine-grained image?text alignment and cross-modal retrieval in the cultural heritage domain as follows: (1) given an image fragment of an artwork, we retrieve the noun phrases that describe it; (2) given a noun phrase artifact attribute, we retrieve the corresponding image fragment it specifies. To this end, we propose a weakly supervised alignment model where the correspondence between the input training visual and textual fragments is not known but their corresponding units that refer to the same artwork are treated as a positive pair. The model exploits the latent alignment between fragments across modalities using attention mechanisms by first projecting them into a shared common semantic space; the model is then trained by increasing the image?text similarity of the positive pair in the common space. During this process, we encode the inputs of our model with hierarchical encodings and remove irrelevant fragments with different indicator functions. We also study techniques to augment the limited training data with synthetic relevant textual fragments and transformed image fragments. The model is later fine-tuned by a limited set of small-scale image?text fragment pairs. We rank the test image fragments and noun phrases by their intermodal similarity in the learned common space. Extensive experiments demonstrate that our proposed models outperform two state-of-the-art methods adapted to fine-grained cross-modal retrieval of cultural items for two benchmark datasets.

 Artículos similares

       
 
Zeqing Fan, Xiaobin Li, Tao Huang and Wei Chen    
A multilayer liquid-containing protective structure is composed of a liquid tank, ceramic, a honeycomb sandwich and homogeneous steel. This structure has superior resistance to combined blast wave and fragment loading. Due to the relatively complicated c... ver más

 
Liqiang Chen, Haijun Xuan, Wenbin Jia, Jianxin Liu, Zehui Fang and Yao Zheng    
The airworthiness standards of the transport category airplanes stipulate that the high energy rotor equipment must be of the sufficient containment capacity. It is of great importance to study the containment and weight reduction for the air turbine sta... ver más
Revista: Aerospace

 
Zhenning Wang, Jianping Yin, Zhijun Wang, Xudong Li and Jianya Yi    
As a natural environmental medium, soil has a wide range of sources and is often used as a material for building houses. It can also be used to construct simple protective structures in actual battlefield environments. In order to study the protective ef... ver más
Revista: Applied Sciences

 
Huan Dai, Hao Li and Yan Li    
The research on the fragmentation mechanism of seabed minerals under high ambient pressure significantly contributes to the exploitation of seafloor massive sulfides (SMS). In this paper, the uniaxial compressive strength (UCS) test and triaxial compress... ver más

 
Franti?ek Sejda, Karel Frydrý?ek, Leopold Pleva, Martin Pompach, Josef Hlinka, Marek Sadílek, Zuzana Murcinková, Pavel Krpec, Miroslav Havlícek, Roman Madeja, Jana Pometlová, Oldrich Ucen and Kamila Dostálová    
The presented article investigates the biomechanics of the calcaneal nail C-NAILTM by numerical calculations and, partially, experimentally. This nail is widely used in trauma and orthopaedics. A numerical model of implants directly interacting with the ... ver más
Revista: Applied Sciences