Redirigiendo al acceso original de articulo en 23 segundos...
Inicio  /  Applied Sciences  /  Vol: 13 Par: 5 (2023)  /  Artículo
ARTÍCULO
TITULO

Prompt-Based Word-Level Information Injection BERT for Chinese Named Entity Recognition

Qiang He    
Guowei Chen    
Wenchao Song and Pengzhou Zhang    

Resumen

Named entity recognition (NER) is a subfield of natural language processing (NLP) that identifies and classifies entities from plain text, such as people, organizations, locations, and other types. NER is a fundamental task in information extraction, information retrieval, and text summarization, as it helps to organize the relevant information in a structured way. The current approaches to Chinese named entity recognition do not consider the category information of matched Chinese words, which limits their ability to capture the correlation between words. This makes Chinese NER more challenging than English NER, which already has well-defined word boundaries. To improve Chinese NER, it is necessary to develop new approaches that take into account category features of matched Chinese words, and the category information would help to effectively capture the relationship between words. This paper proposes a Prompt-based Word-level Information Injection BERT (PWII-BERT) to integrate prompt-guided lexicon information into a pre-trained language model. Specifically, we engineer a Word-level Information Injection Adapter (WIIA) through the original Transformer encoder and prompt-guided Transformer layers. Thus, the ability of PWII-BERT to explicitly obtain fine-grained character-to-word relevant information according to the category prompt is one of its key advantages. In experiments on four benchmark datasets, PWII-BERT outperforms the baselines, demonstrating the significance of fully utilizing the advantages of fusing the category information and lexicon feature to implement Chinese NER.

 Artículos similares

       
 
Yang Lu, Xiaochun Wang, Yijun He, Jiping Liu, Jiangbo Jin, Jian Cao, Juanxiong He, Yongqiang Yu, Xin Gao, Mirong Song and Yiming Zhang    
Two coupled climate models that participated in the CMIP6 project (Coupled Model Intercomparison Project Phase 6), the Earth System Model of Chinese Academy of Sciences version 2 (CAS-ESM2-0), and the Nanjing University of Information Science and Technol... ver más

 
Shifeng Chen, Jialin Wang and Ketai He    
The popularization of the internet and the widespread use of smartphones have led to a rapid growth in the number of social media users. While information technology has brought convenience to people, it has also given rise to cyberbullying, which has a ... ver más
Revista: Information

 
Shiqian Guo, Yansun Huang, Baohua Huang, Linda Yang and Cong Zhou    
This paper proposed a method for improving the XLNet model to address the shortcomings of segmentation algorithm for processing Chinese language, such as long sub-word lengths, long word lists and incomplete word list coverage. To address these issues, w... ver más
Revista: Applied Sciences

 
Shuiyan Li, Rongzhi Qi and Shengnan Zhang    
Compared with English named entity recognition (NER), Chinese NER faces significant challenges due to the flexible, non-standard word formation and vague word boundaries, which cause a lot of boundary ambiguity and reduce the accuracy of entity identific... ver más
Revista: Applied Sciences

 
Miao Zhang and Ling Lu    
Revista: Applied Sciences