Keyword Searches in Data-Centric XML Documents Using Tree Partitioning

来源 :清华大学学报(英文版) | 被引量 : 0次 | 上传用户:zwx2738
下载到本地 , 更方便阅读
声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架
论文部分内容阅读
This paper presents an effective keyword search method for data-centric extensive markup language (XML) documents.The method divides an XML document into compact connected integral subtrees,called self-integral trees (Si-Trees),to capture the structural information in the XML document.The Si-Trees are generated based on a schema guide.Meaningful self-integral trees (MSI-Trees) are identified,which contain all or some of the input keywords for the keyword search in the XML documents.Indexing is used to accelerate the retrieval of MSI-Trees related to the input keywords.The MSI-Trees are ranked to identify the top-k results with the highest ranks.Extensive tests demonstrate that this method costs 10-100 ms to answer a keyword query,and outperforms existing approaches by 1-2 orders of magnitude.
其他文献
总结改革开放30年来.武汉市的城乡规划在城市发展战略选择、城市空问拓辰和城市重点地区开发、重大项目建设中的重要作用.其中,对城市战略发展的意义在于,在国家和省市政府的
This paper describes a distributed estimation scheme (DES) for a bandwidth constrained ad hoc sensor network.The DES is universal in the sense that operations o
Based on the text orientation classification, a new measurement approach to semantic orientation of words was proposed. According to the integrated and detailed
A ZSM-5/MOR co-crystalline zeolite was synthesized without using the template. The physico-chemical proper-ties of the zeolite were characterized by XRD, FT-IR,
This paper presents a model predictive control(MPC) scheme for the retrieval of an electrodynamic tethered sub-satellite in an inclined orbit.The scheme account
Aimed at the generation of high-quality test set in the shortest possible time, the test generation for combinational circuits (CC) based on the chaotic particl
Hydraulic excavator is one type of the most widely applied construction equipment for various applications mainly because of its versatility and mobility. Among
The strong stiction of adjacent surfaces with meniscus is a major design concern in the devices with a micro-sized interface.Today, more and more research works
Because of different system capacities of base station (BS) or access point (AP) and ununiformity of traffic distribution in different cells, quantities of new
本文通过对荣华二采区10