To read this content please select one of the options below:

Fast incremental indexing with effective and efficient searching in XML element retrieval

Atsushi Keyaki (Information Science, Nara Institute of Science and Technology, Nara, Japan)
Jun Miyazaki (Information Science, Nara Institute of Science and Technology, Nara, Japan)
Kenji Hatano (Faculty of Culture and Information Science, Doshisha University, Kyoto, Japan)
Goshiro Yamamoto (Information Science, Nara Institute of Science and Technology, Nara, Japan)
Takafumi Taketomi (Information Science, Nara Institute of Science and Technology, Nara, Japan)
Hirokazu Kato (Information Science, Nara Institute of Science and Technology, Nara, Japan)

International Journal of Web Information Systems

ISSN: 1744-0084

Article publication date: 14 June 2013

212

Abstract

Purpose

The purpose of this paper is to propose methods for fast incremental indexing with effective and efficient query processing in XML element retrieval. The effectiveness of a search system becomes lower if document updates are not handled when these occur frequently on the Web. The search accuracy is also reduced if drastic changes in document statistics are not managed. However, existing studies of XML element retrieval do not consider document updates, although these studies have attained both effectiveness and efficiency in query processing. Thus, the authors add a function for handling document updates to the existing techniques for XML element retrieval.

Design/methodology/approach

Though it will be important to enable fast updates of indices, preliminary experiments have shown that a simple incremental update approach has two problems: some kinds of statistics are inaccurate, and it takes a long time to update indices. Therefore, two methods are proposed: one to approximate term weights accurately with a small number of documents, even for dynamically changing statistics; and the other to eliminate unnecessary update targets.

Findings

Experimental results show that this proposed system can update indices up to 32 per cent faster than the simple incremental updates while the search accuracy improved by 4 per cent compared with the simple approach. The proposed methods can also be fast and accurate in query processing, even if document statistics change drastically.

Originality/value

The paper shows that there could be a more practical XML element search engine, which can access the latest XML documents accurately and efficiently.

Keywords

Citation

Keyaki, A., Miyazaki, J., Hatano, K., Yamamoto, G., Taketomi, T. and Kato, H. (2013), "Fast incremental indexing with effective and efficient searching in XML element retrieval", International Journal of Web Information Systems, Vol. 9 No. 2, pp. 142-164. https://doi.org/10.1108/IJWIS-04-2013-0008

Publisher

:

Emerald Group Publishing Limited

Copyright © 2013, Emerald Group Publishing Limited

Related articles