Modeling for text compression

T Bell, IH Witten, JG Cleary - ACM Computing Surveys (CSUR), 1989 - dl.acm.org
Models are best formed adaptively, based on the text seen so far. This paper surveys
successful strategies for adaptive modeling that are suitable for use in practical text compression

Compression of deep learning models for text: A survey

M Gupta, P Agrawal - ACM Transactions on Knowledge Discovery from …, 2022 - dl.acm.org
… of distributed training of the models. Fortunately, there are a … communities on compression
of large deep learning models. In … organization of model compression methods for text. In this …

Using compression-based language models for text categorization

WJ Teahan, DJ Harper - Language modeling for information retrieval, 2003 - Springer
… We wish to use language models developed for text compression as the basis of a text
The motivation for using these models is that they are weH grounded in information theory, and …

[PDF][PDF] Text categorization using compression models

E Frank, C Chui, IH Witten - 2000 - researchcommons.waikato.ac.nz
… Then, given a test document (different from the training documents), we compress it according
to each model and calculate the gain in per-symbol compression obtained by using M a …

[HTML][HTML] Large text compression benchmark

M Mahoney - 2011 - mattmahoney.net
… A fundamental problem in both NLP and text compression is modeling: the ability to
distinguish between high probability strings like recognize speech and low probability strings like …

[PDF][PDF] Fast Text Compression with Neural Networks.

MV Mahoney - FLAIRS, 2000 - cdn.aaai.org
… level n-gram models now in use, but have usually been avoided because they are too slow
to be practical. We introduce a model that produces better compression than popular Limpel-…

The zero-frequency problem: Estimating the probabilities of novel events in adaptive text compression

IH Witten, TC Bell - Ieee transactions on information theory, 1991 - ieeexplore.ieee.org
… taken to the problem in adaptive text compression. Although several methods have been …
a well-founded model. We propose the application of a Poisson process model of novelty. Its …

Model compression

C Buciluǎ, R Caruana, A Niculescu-Mizil - Proceedings of the 12th ACM …, 2006 - dl.acm.org
compress the function that is learned by a complex model into a much smaller, faster model
… The key difficulty when compressing complex ensembles into simpler models this way is the …

A compression-based toolkit for modelling and processing natural language text

WJ Teahan - Information, 2018 - mdpi.com
… As an alternative, we wish to adopt language models based on well performing text compression
techniques such as PPM which have already been found to be highly effective in many …

Context modeling for text compression

DS Hirschberg, DA Lelewer - Image and Text Compression, 1992 - Springer
… -l model did not provide satisfactory compression performance and the order-2-1-and-0
model produces compression … The order-2-and-0 model allows faster encoding and decoding …