论文部分内容阅读
设计化学主题数据库,实现中国科学院化学化工领域的数据库集成,方法是比较3种主流的数据集成法(数据仓库、联邦数据库集成模式),归纳出联邦数据库集成模式,其优势在于保留了成员数据子库的独立性,各子库可独立地进行维护和更新,它适用于数据类型差别较大,分布异构而且不便统一更新的中国科学院化学化工数据子库实现数据集成。针对中国科学院化学化工领域数据子库的特征,在传统的联邦数据库集成模式上增加数据集成模型作为扩展,以便将数据资源组织起来,构成一个基于化合物唯一标识的相互联系的数据集成平台。在数据集成模型的设计上,比较了以学科分类为根节点和以化合物为根节点2种不同的模型建立方式,其中以化合物为根节点的概念树模型(数据集成模型)能够明显简化数据库用户的检索步骤,有利于化学化工数据库的集成与表达。在用户接口方面,本文着重设计了统一检索入口和可视化显示界面,前者解决了用户在不同的专业数据库之间跳转的问题,后者将来自不同数据源的检索结果按照预设数据模型,分层级分节点的显示给用户。
The design of the chemical subject database, the realization of the Chinese Academy of Sciences, chemical and chemical database integration, the method is to compare the three kinds of mainstream data integration (data warehouse, federal database integration model), summarized the federal database integration model, the advantage is to retain the member data The independence of the library, each sub-library can be independently maintained and updated, it applies to data types with different data types, heterogeneous heterogeneous and inconvenient to update the unified unified database of chemical and chemical data of the Chinese Academy of Sciences to achieve data integration. According to the characteristics of the data sub-database of Chinese Academy of Sciences in the field of chemical and chemical engineering, the data integration model is added as an extension to the traditional federated database integration mode so as to organize the data resources to form an interconnected data integration platform based on the compound unique identification. In the design of data integration model, we compared two different model establishment models based on subject classification as root node and compound as root node, in which the concept tree model with compound as the root node (data integration model) can significantly simplify the database users The retrieval step is conducive to chemical and chemical database integration and expression. In the aspect of user interface, this paper focuses on designing a unified search portal and visual display interface. The former solves the problem of users jumping between different professional databases. The latter combines the retrieval results from different data sources according to the preset data model, Hierarchical sub-node display to the user.