Lossless Text Image Compression using Two Dimensional Run Length Encoding

Authors

DOI:

https://doi.org/10.15575/join.v4i2.330

Keywords:

Lossless compression, Huffman Encoding, Run-length Encoding, Text Image

Abstract

Text images are used in many types of conventional data communication where texts are not directly represented by digital character such as ASCII but represented by an image, for instance facsimile file or scanned documents. We propose a combination of Run Length Encoding (RLE) and Huffman coding for two dimensional binary image compression namely 2DRLE. Firstly, each row in an image is read sequentially. Each consecutive recurring row is kept once and the number of occurrences is stored. Secondly, the same procedure is performed column-wise to the image produced by the first stage to obtain an image without consecutive recurring row and column. The image from the last stage is then compressed using Huffman coding. The experiment shows that the 2DRLE achieves a higher compression ratio than conventional Huffman coding for image by achieving more than 8:1 of compression ratio without any distortion.

References

M. Hart, "The history and philosophy of Project Gutenberg," in Project Gutenberg, Project Gutenberg, 1992, pp. 1-11.

Broder, Andrei and M. Mitzenmacher, "Pattern-based compression of text images," in Proceedings of Data Compression Conference-DCC'96, Snowbird, 1996.

M. D. Reavy and C. G. Boncelet, "An algorithm for compression of bilevel images," IEEE Transactions on Image Processing, vol. 10, no. 5, pp. 669-676, 2001.

Y. Ye and P. Cosman, "Dictionary design for text image compression with JBIG2," IEEE Transactions on Image Processing, vol. 10, no. 6, pp. 818-828, 2001.

M. Reyes, X. Zhao, D. Neuhoff and T. Pappas, "Lossy compression of bilevel images based on Markov random fields," in IEEE International Conference on Image Processing Vol. 2, San Antonio, 2007.

Y. Guo, C. Lu, J. Allebach and C. Bouman, "Model-based iterative restoration for binary document image compression with dictionary learning," in IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, 2017.

E. L. Hauck, "Data compression using run length encoding and statistical encoding". U.S. Patent Patent 4,626,829, 2 December 1986.

V. Watson, "Run-length encoding". US Patent Patent 10/143,542., 24 July 2003.

Downloads

Published

2020-02-14

Issue

Section

Article

Citation Check