A Short Text Classification Method Based on N-Gram and CNN

来源 :电子学报(英文) | 被引量 : 0次 | 上传用户:czfczfc
下载到本地 , 更方便阅读
声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架
论文部分内容阅读
Text classification is a fundamental task in Nature language process (NLP) application.Most existing research work relied on either explicate or implicit text representation to settle this kind of problems,while these techniques work well for sentence and can not simply apply to short text because of its shortness and sparseness feature.Given these facts that obtaining the simple word vector feature and ignoring the important feature by utilizing the traditional multi-size filter Convolution neural network (CNN) during the course of text classification task,we offer a kind of short text classification model by CNN,which can obtain the abundant text feature by adopting none linear sliding method and N-gram language model,and picks out the key features by using the concentration mechanism,in addition employing the pooling operation can preserve the text features at the most certain as far as possible.The experiment shows that this method we offered,comparing the traditional machine leing algorithm and convolutional neural network,can markedly improve the classification result during the short text classification.
其他文献
Text similarity measurements are the basis for measuring the degree of matching between two or more texts.Traditional large-scale similarity detection methods b
期刊
期刊
To efficiently handle high-dimensional continuous optimization problems,a Modified tree-seed algorithm(MTSA) is proposed by coupling a newly introduced control
Erhai Lake is the seventh largest freshwater lake in China where has been encountered with water pollution by the algae.One of important indicator of the water
As the abstraction and equivalent tech-nologies,simulation and bisimulation have been applied to the simplifications of some classical and uncertain models stru
期刊
With the rapid development of cloud storage,an increasing number of data owners prefer to outsource their data to the cloud server,which can greatly reduce the
The weighted traversal patt is important in software system for a better understanding of the intal structure and behavior of software.To mine important patts o
In this work,a novel high overload Ka-band power sensor with a Micro-electro-mechanical system (MEMS) cantilever beam is investigated in order to improve the me