Hybrid Lossless Compression Techniques for English Text

Authors

  • Jannat Tariq Electrical Engineering Technical College, Middle Technical University, Baghdad, Iraq.
  • Mahmood F. Mosleh Electrical Engineering Technical College, Middle Technical University, Baghdad, Iraq.
  • Maha Abdulameer Middle Technical University, Baghdad, Iraq
  • Huthaifa A. Obeidat Jerash University, Jerash, Jordan
  • Omar A. Obeidat Wayne State University, Detroit, Michigan, MI 48202, USA

DOI:

https://doi.org/10.51173/jt.v5i1.1059

Keywords:

Compression, LCT, CR, SR

Abstract

Since the demand for data transfer and storage is always increasing, sending data in its original form will take a long time to send and receive. Compression is an important issue for digital communications systems because it imposes an important rule while reducing complexity and power requirements. The goal of compression is to reduce the file size without compromising the quality of the information, which leads to more capacity saving and reduces the required bandwidth in terms of the communications system. This paper proposes a system that consists of a hybrid of two lossless techniques, including a concatenation of Huffman and LZ4 in order to enhance the traditional techniques. The result of the proposed system demonstrates that the proposed combination techniques reduce the file size significantly, achieving between 73.649 % and 79.708 % in terms of average saving ratio (SR). The above would give us credible, cost-effective, and affordable lossless encoding systems for electronic communication systems.

Downloads

Download data is not yet available.

Author Biographies

Mahmood F. Mosleh, Electrical Engineering Technical College, Middle Technical University, Baghdad, Iraq.

Department of Computer Engineering Techniques

Maha Abdulameer, Middle Technical University, Baghdad, Iraq

Electrical Engineering Technical College

Huthaifa A. Obeidat, Jerash University, Jerash, Jordan

Department of Communications and Electronics Engineering

Omar A. Obeidat, Wayne State University, Detroit, Michigan, MI 48202, USA

College of Engineering

References

J. Howarth, “Top 2022 big data statistics,” Available online: https://explodingtopics.com/blog/big-data-stats .

I.M. Pu, “Fundamental Data Compression; Butterworth-Heinemann,” Oxford, UK, 2005.

D. Salomon, G. Motta, “Handbook of Data Compression,” London, New York, Springer, 2010.

S. Porwal, Y. Chaudhary, J. Joshi, and M. Jain, “Data compression methodologies for lossless data and comparison between algorithms,” Int. J. Eng. Sci. Innov. Technol. (IJESIT) 2013, 2, 142–147.

A. Hanumanthaiah, A. Gopinath, C. Arun, B. Hariharan and R. Murugan, “Comparison of Lossless Data Compression Techniques in Low-Cost Low-Power (LCLP) IoT Systems,” 2019 9th International Symposium on Embedded Computing and System Design (ISED), 2019, pp. 1-5, doi: 10.1109/ISED48680.2019.9096229.

A. Gopinath and M. Ravisankar, “Comparison of Lossless Data Compression Techniques,” 2020 International Conference on Inventive Computation Technologies (ICICT), 2020, pp. 628-633, doi: 10.1109/ICICT48043.2020.9112516

B. Vijayalakshmi and N. Sasirekha. "Comparative Analysis of Lossless Text Compression Methods with Novel Tamil Compression Technique." vol 9 (2021): 38-44.

A. P. Sridhar and P. V. Lakshmi, “An Efficient Lossless Medical Data Compression using LZW compression for Optimal Cloud Data Storage,” vol. 25, no. 6, pp. 17144-17160, 2021.

F. S. Mahammad and V. M. Viswanatham, “Performance analysis of data compression algorithms for heterogeneous architecture through parallel approach, ” 2018.

Z. N. Li, M. S. Drew, and J. Liu, “Fundamentals of multimedia,” Springer, 2004.

R. M. Fano, “The Transmission of Information,” Massachusetts Institute of Technology, Research Laboratory of Electronics, 1949.

D. A. Huffman, “A Method for the Construction of Minimum-Redundancy Codes,” in Proceedings of the IRE, vol. 40, no. 9, pp. 1098-1101, Sept. 1952, doi: 10.1109/JRPROC.1952.273898.

J. Ziv and A. Lempel, “A Universal Algorithm for Sequential Data Compression,” IEEE Trans. Inf. Theory, vol. 23, no. 3, pp. 337–343, 1977, doi: 10.1109/TIT.1977.1055714.

W. Liu, F. Mei, C. Wang, M. O’Neill and E. E. Swartzlander, “Data Compression Device Based on Modified LZ4 Algorithm,” in IEEE Transactions on Consumer Electronics, vol. 64, no. 1, pp. 110-117, Feb. 2018, doi: 10.1109/TCE.2018.2810480.

I. H. Witten, R. M. Neal, and J. G. Cleary, “Arithmetic coding for data compression,” Commun. ACM, vol. 30, no. 6, pp. 520–540, 1987, doi: 10.1145/214762.214771.

A. H. Robinson and C. Cherry, “Results of a prototype television bandwidth compression scheme,” in Proceedings of the IEEE, vol. 55, no. 3, pp. 356-364, March 1967, doi: 10.1109/PROC.1967.5493.

M. Nelson and J.-L. Gailly, “The Data Compression Book Chapter 1 Introduction to Data Compression,” 2007.

W. Zhan and A. El-Maleh, “A new scheme of test data compression based on equal-run-length coding (ERLC),” vol. 45, no. 1, pp. 91-98, 2012.

S. T. Klein and D. Shapira, “Practical fixed length Lempel–Ziv coding,” vol. 163, pp. 326-333, 2014.

S. Roman, “Introduction to coding and information theory,” vol. 34, no. 09. 1997.

T. C. Bell, J. G. Cleary, and I. H. Witten, “the Canterbury Corpus,” Available online: https://corpus.canterbury.ac.nz/.

General block diagram of a DC procedure

Downloads

Published

2023-04-01

How to Cite

Jannat Tariq, Mahmood F. Mosleh, Maha Abdulameer, Huthaifa A. Obeidat, & Omar A. Obeidat. (2023). Hybrid Lossless Compression Techniques for English Text. Journal of Techniques, 5(1), 52–57. https://doi.org/10.51173/jt.v5i1.1059

Issue

Section

Engineering