ARTÍCULO
TITULO

Chinese Text Classification Model Based on Deep Learning

Yue Li    
Xutao Wang and Pengjian Xu    

Resumen

Text classification is of importance in natural language processing, as the massive text information containing huge amounts of value needs to be classified into different categories for further use. In order to better classify text, our paper tries to build a deep learning model which achieves better classification results in Chinese text than those of other researchers? models. After comparing different methods, long short-term memory (LSTM) and convolutional neural network (CNN) methods were selected as deep learning methods to classify Chinese text. LSTM is a special kind of recurrent neural network (RNN), which is capable of processing serialized information through its recurrent structure. By contrast, CNN has shown its ability to extract features from visual imagery. Therefore, two layers of LSTM and one layer of CNN were integrated to our new model: the BLSTM-C model (BLSTM stands for bi-directional long short-term memory while C stands for CNN.) LSTM was responsible for obtaining a sequence output based on past and future contexts, which was then input to the convolutional layer for extracting features. In our experiments, the proposed BLSTM-C model was evaluated in several ways. In the results, the model exhibited remarkable performance in text classification, especially in Chinese texts.

 Artículos similares

       
 
Jianzhuo Yan, Lihong Chen, Yongchuan Yu, Hongxia Xu, Qingcai Gao, Kunpeng Cao and Jianhui Chen    
With the rapid development of the internet and social media, extracting emergency events from online news reports has become an urgent need for public safety. However, current studies on the text mining of emergency information mainly focus on text class... ver más

 
Andrey Bogdanchikov, Dauren Ayazbayev and Iraklis Varlamis    
The rapid development of natural language processing and deep learning techniques has boosted the performance of related algorithms in several linguistic and text mining tasks. Consequently, applications such as opinion mining, fake news detection or doc... ver más

 
Xiaohui He, Chuan Liu, Lili Wu, Yongji Wang and Zhihui Tian    
Local chronicles are a kind of historical record in China that are written in detail and play an important role in the transmission of local history and culture. Due to the single-text-carrier form of local chronicles, people have limited access to infor... ver más

 
Xinlu Li, Yuanyuan Lei and Shengwei Ji    
Sentiment analysis of online Chinese buzzwords (OCBs) is important for healthy development of platforms, such as games and social networking, which can avoid transmission of negative emotions through prediction of users? sentiment tendencies. Buzzwords h... ver más
Revista: Future Internet

 
Jingbo Wang, Yu Xia and Yuting Wu    
The distribution and sentiment characteristics of tourists directly reflect the state of tourism development, and are an important reference for tourists to choose scenic areas. Sensing the tourist distributions and their sentiment variations can provide... ver más