PSH: A probabilistic signature hash method with hash neighborhood candidate generation for fast edit-distance string comparison on big data | IEEE Conference Publication | IEEE Xplore