Inicio  /  Informatics  /  Vol: 6 Par: 2 (2019)  /  Artículo
ARTÍCULO
TITULO

Improving Semantic Similarity with Cross-Lingual Resources: A Study in Bangla?A Low Resourced Language

Rajat Pandit    
Saptarshi Sengupta    
Sudip Kumar Naskar    
Niladri Sekhar Dash and Mohini Mohan Sardar    

Resumen

Semantic similarity is a long-standing problem in natural language processing (NLP). It is a topic of great interest as its understanding can provide a look into how human beings comprehend meaning and make associations between words. However, when this problem is looked at from the viewpoint of machine understanding, particularly for under resourced languages, it poses a different problem altogether. In this paper, semantic similarity is explored in Bangla, a less resourced language. For ameliorating the situation in such languages, the most rudimentary method (path-based) and the latest state-of-the-art method (Word2Vec) for semantic similarity calculation were augmented using cross-lingual resources in English and the results obtained are truly astonishing. In the presented paper, two semantic similarity approaches have been explored in Bangla, namely the path-based and distributional model and their cross-lingual counterparts were synthesized in light of the English WordNet and Corpora. The proposed methods were evaluated on a dataset comprising of 162 Bangla word pairs, which were annotated by five expert raters. The correlation scores obtained between the four metrics and human evaluation scores demonstrate a marked enhancement that the cross-lingual approach brings into the process of semantic similarity calculation for Bangla.

 Artículos similares

       
 
Alexandra Nunes and Aníbal Matos    
Nowadays, semantic segmentation is used increasingly often in exploration by underwater robots. For example, it is used in autonomous navigation so that the robot can recognise the elements of its environment during the mission to avoid collisions. Other... ver más

 
Songnan Chen, Mengxia Tang, Ruifang Dong and Jiangming Kan    
The semantic segmentation of outdoor images is the cornerstone of scene understanding and plays a crucial role in the autonomous navigation of robots. Although RGB?D images can provide additional depth information for improving the performance of semanti... ver más
Revista: Applied Sciences

 
Li He, Qian Zhang, Jianyong Duan and Hao Wang    
Open-domain event extraction is a fundamental task that aims to extract non-predefined types of events from news clusters. Some researchers have noticed that its performance can be enhanced by improving dependency relationships. Recently, graphical convo... ver más
Revista: Applied Sciences

 
Xiao Chen, Mujiahui Yuan, Qi Yang, Haiyang Yao and Haiyan Wang    
Underwater target detection using optical images is a challenging yet promising area that has witnessed significant progress. However, fuzzy distortions and irregular light absorption in the underwater environment often lead to image blur and color bias,... ver más

 
Tao Peng, Kun She, Yimin Shen, Xiangliang Xu and Yue Yu    
Requirement traceability links are an essential part of requirement management software and are a basic prerequisite for software artifact changes. The manual establishment of requirement traceability links is time-consuming. When faced with large projec... ver más
Revista: Information