【摘 要】
:
To avoid the curse of dimensionality, text categorization (TC) algorithms based on machine learning (ML) have to use an feature selection (FS) method to reduce
【机 构】
:
Information Engineering School,China State Information Center
论文部分内容阅读
To avoid the curse of dimensionality, text categorization (TC) algorithms based on machine learning (ML) have to use an feature selection (FS) method to reduce the dimensionality of feature space. Although having been widely used, FS process will generally cause information losing and then have much side-effect on the whole performance of TC algorithms. On the basis of the sparsity characteristic of text vectors, a new TC algorithm based on lazy feature selection (LFS) is presented. As a new type of embedded feature selection approach, the LFS method can greatly reduce the dimension of features without any information losing, which can improve both efficiency and performance of algorithms greatly. The experiments show the new algorithm can simultaneously achieve much higher both performance and efficiency than some of other classical TC algorithms.
其他文献
The effects of four parameters, gas flow, rotational speed, refining time, and stewing time, on the rotary impeller refinement of 7075 Al were studied. The effe
Motion estimation is an important and intensive task in video coding applications. Since the complex-ity of integer pixel search has been greatly reduced by the
By using the concept of finite-part integral, a set of hypersingular integro-differential equations for multiple interracial cracks in a three-dimensional infin
An embedded cryptosystem needs higher reconfiguration capability and security. After analyzing the newly emerging side-channel attacks on elliptic curve cryptos
Radio frequency identification (RFID) has prominent advantages compared with other auto-identification technologies. Combining RFID with network technology, phy
To improve the agility, dynamics, composability, reusability, and development efficiency restricted by monolithic federation object model (FOM), a modular FOM i
一年级新生刚开学时,总会出现一些似曾相识的场景:教师站在教室门口迎接新同学,隐隐约约听见一阵哭声,一探头,看见一位家长和孩子在教室门口拉拉扯扯,孩子一个劲地哭,家长好不容易把
The existing form and grain refining effects of small zirconium addition in pure Mg, Mg-Yb and Mg-Zn binary alloys, and Mg-Zn-Yb ternary alloy (ZK60-Yb) were in
西方美学观念最重要的一条,美必须是和谐的,科学技术首先要和人类生存的环境和谐,但现如今,各国在追求科技进步的同时却和环境保护这一主题越来越远,科学技术和环境不和谐的
In-situ Al2O3/TiAl composites were successfully synthesized from the starting powders of Ti, Al, TiO2 and Nb2O5. The oxidation behavior of the composites at 900