Inicio  /  Information  /  Vol: 10 Par: 11 (2019)  /  Artículo
ARTÍCULO
TITULO

Improving Basic Natural Language Processing Tools for the Ainu Language

Karol Nowakowski    
Michal Ptaszynski    
Fumito Masui and Yoshio Momouchi    

Resumen

Ainu is a critically endangered language spoken by the native inhabitants of northern Japan. This paper describes our research aimed at the development of technology for automatic processing of text in Ainu. In particular, we improved the existing tools for normalizing old transcriptions, word segmentation, and part-of-speech tagging. In the experiments we applied two Ainu language dictionaries from different domains (literary and colloquial) and created a new data set by combining them. The experiments revealed that expanding the lexicon had a positive impact on the overall performance of our tools, especially with test data unrelated to any of the training sets used.

 Artículos similares

       
 
Dipima Buragohain, Grisana Punpeng, Sureenate Jaratjarungkiat and Sushank Chaudhary    
Recent technology implementation in learning has inspired language educators to employ various e-learning techniques, strategies, and applications in their pedagogical practices while aiming at improving specific learning efficiencies of students. The cu... ver más
Revista: Informatics

 
Pau Fonseca i Casas, Iza Romanowska and Joan Garcia i Subirana    
Specification and Description Language (SDL) is a language that can represent the behavior and structure of a model completely and unambiguously. It allows the creation of frameworks that can run a model without the need to code it in a specific programm... ver más
Revista: Computers

 
Wenhua Yu, Mayire Ibrayim and Askar Hamdulla    
Text recognition is an important research topic in computer vision. Scene text, which refers to the text in real scenes, sometimes needs to meet the requirement of attracting attention, and there is the situation such as deformation. At the same time, th... ver más
Revista: Information

 
Wiem Chebil, Mohammad Wedyan, Moutaz Alazab, Ryan Alturki and Omar Elshaweesh    
This research proposes a new approach to improve information retrieval systems based on a multinomial naive Bayes classifier (MNBC), Bayesian networks (BNs), and a multi-terminology which includes MeSH thesaurus (Medical Subject Headings) and SNOMED CT (... ver más
Revista: Information

 
Tao Peng, Kun She, Yimin Shen, Xiangliang Xu and Yue Yu    
Requirement traceability links are an essential part of requirement management software and are a basic prerequisite for software artifact changes. The manual establishment of requirement traceability links is time-consuming. When faced with large projec... ver más
Revista: Information