论文部分内容阅读
《说文解字》含有关于先秦文献语言和文字的多方面的资料。深入的研究它,是研究汉民族语言文字学史的重要内容。现有电子版《说文》大多只是《说文》原文文本的电子化,应用价值有限。要想更好地应用计算机研究《说文》,就必须对其文本进行详细标注,而标注规范的合理性和可用性,决定了标注的价值。本文选择XML作为标记语言,在详细分析《说文》内容结构的基础上,设计了符合其特点的XML标注规范(Schema);并在对《说文》全文进行XML标注基础上开发了更加符合用户需求的全文检索工具。
“Shuowen Jiezi” contains a lot of information about the language and script of the pre-Qin literature. Studying it in depth is an important part of studying the history of Han nationality language and literature. The existing electronic versions of “Shuo Wen” are mostly electronic versions of “Shuo Wen” and have limited application value. In order to better apply the computer research “Shuo Wen”, we must make a detailed annotation of the text, and the rationality and usability of the annotation specification determine the annotation value. This paper chooses XML as the markup language, based on a detailed analysis of the content structure of “Shuo Wen”, we design a XML Schema that conforms to its characteristics. On the basis of XML annotation of the full text of Shuo Wen, Full-text search tool for user needs.