论文部分内容阅读
因特网上的信息以前所未有的速度剧烈膨胀,促使对信息过滤的研究成为热点。该本文比较了在基于中文的信息过滤中,不同的文本分割方法、机械分词方法、以及特征抽取阈值对信息过滤的结果的影响,对中文信息过滤使用合适的特征抽取方法提供了指导,得出了使用N元方法可与机械分词方法相比较的结论。
The rapid expansion of information on the Internet at an unprecedented rate has prompted the study of information filtering to become a hot topic. This paper compares the effects of different text segmentation methods, mechanical segmentation methods and feature extraction thresholds on the results of information filtering in Chinese-based information filtering, and provides guidance on the use of appropriate feature extraction methods for Chinese information filtering. The use of N yuan method can be compared with the mechanical word segmentation conclusion.