An XML-enabled data extraction toolkit for web sources

L Liu, C Pu, W Han - Information Systems, 2001 - Elsevier
The amount of useful semi-structured data on the web continues to grow at a stunning pace.
Often interesting web data are not in database systems but in HTML pages, XML pages, or
text files. Data in these formats are not directly usable by standard SQL-like query
processing engines that support sophisticated querying and reporting beyond keyword-
based retrieval. Hence, the web users or applications need a smart way of extracting data
from these web sources. One of the popular approaches is to write wrappers around the …
Showing the best result for this search. See all results