【摘 要】
:
This paper presents an effective keyword search method for data-centric extensive markup language (XML) documents.The method divides an XML document into compac
【机 构】
:
Department of Computer Science and Technology
【基金项目】
:
国家高技术研究发展计划(863计划);the Basic Research Foundation of Tsinghua National Laboratory for Information Science and Technology (TNList);HP Labs Innovation Research Program
论文部分内容阅读
This paper presents an effective keyword search method for data-centric extensive markup language (XML) documents.The method divides an XML document into compact connected integral subtrees,called self-integral trees (Si-Trees),to capture the structural information in the XML document.The Si-Trees are generated based on a schema guide.Meaningful self-integral trees (MSI-Trees) are identified,which contain all or some of the input keywords for the keyword search in the XML documents.Indexing is used to accelerate the retrieval of MSI-Trees related to the input keywords.The MSI-Trees are ranked to identify the top-k results with the highest ranks.Extensive tests demonstrate that this method costs 10-100 ms to answer a keyword query,and outperforms existing approaches by 1-2 orders of magnitude.
其他文献
总结改革开放30年来.武汉市的城乡规划在城市发展战略选择、城市空问拓辰和城市重点地区开发、重大项目建设中的重要作用.其中,对城市战略发展的意义在于,在国家和省市政府的
This paper describes a distributed estimation scheme (DES) for a bandwidth constrained ad hoc sensor network.The DES is universal in the sense that operations o
Based on the text orientation classification, a new measurement approach to semantic orientation of words was proposed. According to the integrated and detailed
A ZSM-5/MOR co-crystalline zeolite was synthesized without using the template. The physico-chemical proper-ties of the zeolite were characterized by XRD, FT-IR,
This paper presents a model predictive control(MPC) scheme for the retrieval of an electrodynamic tethered sub-satellite in an inclined orbit.The scheme account
Aimed at the generation of high-quality test set in the shortest possible time, the test generation for combinational circuits (CC) based on the chaotic particl
Hydraulic excavator is one type of the most widely applied construction equipment for various applications mainly because of its versatility and mobility. Among
The strong stiction of adjacent surfaces with meniscus is a major design concern in the devices with a micro-sized interface.Today, more and more research works
Because of different system capacities of base station (BS) or access point (AP) and ununiformity of traffic distribution in different cells, quantities of new