Half-Xor: A Fully-Dynamic Sketch for Estimating the Number of Distinct Values in Big Tables. Abstract: Calculating the number of distinct values (i.e., NDV) in a column of a big table is costly yet fundamental to a variety of database applications such as data compression and profiling.
Jan 29, 2024
Our Half-Xor sketch consists of a compact bit matrix and a small counter array, and it needs to set a few bits and update a counter when handling a data ...
Jan 29, 2024 · Calculating the number of distinct values (i.e., NDV) in a column of a big table is costly yet fundamental to a variety of database ...
Jul 30, 2024 · To solve this issue, we propose a novel sketch method, Half-Xor . Our Half-Xor sketch consists of a compact bit matrix and a small counter array ...
This repository includes our realization of Half-Xor, which can be used to estimate the number of distinct values (i.e. NDV) in a column of big tables when ...
Half-Xor: A Fully-Dynamic Sketch for Estimating the Number of Distinct Values in Big Tables. P Wang, D Xie, J Zhao, J Li, Z Li, R Li, Y Ren. IEEE Transactions ...
Half-Xor: A Fully-Dynamic Sketch for Estimating the Number of Distinct Values in Big Tables. IEEE Transactions on Knowledge and Data Engineering.
This paper proposes a novel sketch-based distributed method that achieves sub-linear communication costs for distributed sampling-based NDV estimation under ...
Half-Xor: A Fully-Dynamic Sketch for Estimating the Number of Distinct Values in Big Tables ... Our Half-Xor sketch consists of a compact bit matrix and a ...
Sep 4, 2024 · Half-Xor: A Fully-Dynamic Sketch for Estimating the Number of Distinct Values in Big Tables. IEEE Trans. Knowl. Data Eng. 36(7): 3111-3125 ...