广西师范大学学报(自然科学版) ›› 2011, Vol. 29 ›› Issue (3): 157-162.

• • 上一篇    下一篇

改进的基于文本节点的XML文档查询方法

王齐, 刘全明   

  1. 山西大学计算机与信息技术学院,山西太原030006
  • 收稿日期:2011-05-15 出版日期:2011-08-20 发布日期:2018-12-03
  • 通讯作者: 刘全明(1973—),男,山西太原人,山西大学副教授,博士。E-mail:liuqm@sxu.edu.cn
  • 基金资助:
    国家自然科学基金资助项目(60970053)

An Improved Query Method for XML Document Based on Text Node

WANG Qi, LIU Quan-ming   

  1. School of Computer and Information Technology,Shanxi University,Taiyuan Shanxi 030006,China
  • Received:2011-05-15 Online:2011-08-20 Published:2018-12-03

摘要: 随着XML相关标准和技术的不断推广和应用,网络上出现了大量的XML数据,XML已成为网络数据表示和交换的标准。在海量XML文档中进行精确快速地查询成为当前XML研究的主要方向。由于XML文档自身的半结构化特性,对存储在其中的数据进行查询就需要建立相应的索引并制定特殊的查询算法。对基于文本节点的XML索引进行改进,在原索引的基础上增加文本节点的数据类型标识,得到一种新的查询方法。该方法结合索引中的数据类型标识,不仅可以支持原有查询算法中的字符串匹配查询,还能支持算术查询及比较查询。由于采用了关系数据库与原始XML文档相结合的查询方式,新的查询方法以增加较少的外存储空间为代价换取了减少大量内存空间及CPU时间的占用,与原方法相比,查询效率得到了显著提升。

关键词: XML, 文本节点, 算术查询, 查询处理器

Abstract: With the continuous promotion and application of theXML-related standards and technology,a large number of XML data appears on thenetwork,and XML has become the standards of representation and exchange for network data.Precise and rapid query for the mass XML documents has become a main direction of the current research on XML.Because of semi-structured features of XML documents,it needs to create corresponding index and develop special querymethod to query data in XML documents.An index for XML documents based on textnode was improved,and the data type of the text node was added to the original index.At the same time,a new query method was proposed,which combining with the index identified data type,this new method can support not only string matching query in the original query,but also arithmetic query and comparative query.As aresult of the combination query of relational database and the raw XML documents,the new query method adds less external memory,but reduces the occupation of memory space and CPU time remarkably.Compared with the original method,the query efficiency is significantly improved.

Key words: XML, textnode-index, arithmetic query, query-engine

中图分类号: 

  • TP392
[1] FLORESCU D,KOSSMANN D.A performance evaluation of alternative mapping schemes for storing XML data in a relational database,No.3680[R].Rennes,France:Institute National de Recherche en Informatique et en automatique,1999.
[2] FLORESCU D,KOSSMANN D.Storing and querying XML data using an RDBMS[J].IEEE Data Engineering Bulletin,1999,3:27-34.
[3] DEUTSCH A,FERNANDEZ M,SUCIU D.Storing semistructured data with STORED[C]//SIGMOD'99Proc of the 1999 ACM SIGMOD International Conference on Management of Data.New York:ACM,1999:431-442.
[4] FEGARAS L,ELMASRI R.Query engines for web-accessible XML data[C]//Proc of the 27th International Conference on Very Large Data Based.San Francisco,USA:Morgan Kaufmann,2001:251-260.
[5] HOU Jing-yu,ZHANG Yan-chun,KAMBAYASHI Y.Object-oriented representation for XMLdata[C]//Proc of the 3rd International Symposium on Cooperative Database Systems for Advanced Applications.Washington,DC:IEEE Computer Sociely,2001:40-49.
[6] RENNER A.XML data and object databases:the perfect couple[C]//The 17th International Conference on Data Engineering.Washington DC:IEEE ComputerSociety,2001:143-148.
[7] 王静,孟小峰,王珊.基于区域划分的XML结构连接[J].软件学报,2004,15(5):720-729.
[8] 张博,耿志华,周傲英.一种支持高效XML路径查询的自适应结构索引[J].软件学报,2009,20(7):1812-1824.
[9] 胡叶,洪晓光,王兰野.基于路径索引的XML小枝模式匹配方法[J].计算机研究与发展,2010,47(S1):106-112.
[10] 沈煌,郭红.基于DTD的不完全值索引结构及查询处理算法[J].广西师范大学学报:自然科学版,2007,25(4):180-183.
[11] 王齐,陶世群.一种基于文本节点的XML文档索引[J].山西大学学报:自然科学版,2005,28(2):142-146.
[12] LEE Y K,YOO S J,YOON K.Index structures for structured documents[C]//EDWARD A F,GARY M.Proc of the 1st ACM Int'l Conf on Digital Libraries.NewYork:ACM Press,1996:91-99.
[1] 高世健, 王丽珍, 冯岭, 陈红梅. 基于凝聚层次聚类的co-location模式挖掘[J]. 广西师范大学学报(自然科学版), 2011, 29(2): 167-173.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!
版权所有 © 广西师范大学学报(自然科学版)编辑部
地址:广西桂林市三里店育才路15号 邮编:541004
电话:0773-5857325 E-mail: gxsdzkb@mailbox.gxnu.edu.cn
本系统由北京玛格泰克科技发展有限公司设计开发