Computer Science ›› 2021, Vol. 48 ›› Issue (3): 1-8.doi: 10.11896/jsjkx.201100134

Special Issue: Advances on Multimedia Technology

• Advances on Multimedia Technology • Previous Articles     Next Articles

Advances in End-to-End Optimized Image Compression Technologies

LIU Dong, WANG Ye-fei, LIN Jian-ping, MA Hai-chuan, YANG Run-yu   

  1. Department of Electronic Engineering and Information Science,University of Science and Technology of China,Hefei 230027,China
  • Received:2020-11-19 Revised:2020-12-02 Online:2021-03-15 Published:2021-03-05
  • About author:LIU Dong,born in 1983,Ph.D,professor,is a senior member of China Computer Federation.His main research interests include multimedia signal processing and so on.
  • Supported by:
    National Natural Science Foundation of China (61772483).

Abstract: Image compression is the application of data compression technologies on digital images,aiming to reduce redundancy in image data,so as to store and transmit data with a more efficient format.In traditional image compression methods,image compression is divided into several steps,such as prediction,transform,quantization and entropy coding,and each step is optimized by manually designed algorithm separately.In recent years,end-to-end image compression methods based on deep neural networks have achieved fruitful results.Compared with the traditional methods,end-to-end image compression can be optimized jointly,which often achieves higher compression efficiency than the traditional methods.In this paper,the end-to-end image compression methods and network structures are introduced,and the key technologies of end-to-end image compression are described,including quantization technology,probability modeling and entropy coding technology,as well as encoder-side bit allocation technology.Then it introduces the research of extended applications of end-to-end image compression,including scalable coding,variable bit rate compression,visual perception and machine perception oriented compression.Finally,the compression efficiency of end-to-end image compression is compared with the traditional methods,and the compression performance is demonstrated.Experimental results show that the compression efficiency of the state-of-the-art end-to-end image compression method is much higher than that of the traditional image coding methods including JPEG,JPEG2000 and HEVC intra.Compared with the newest coding standard VVC intra,the end-to-end image compression method can save up to 48.40% of the coding rate while maintain the same MS-SSIM.

Key words: Compression efficiency, Deep neural network, End-to-end optimization, HEVC, Image compression, JPEG, JPEG2000, VVC

CLC Number: 

  • TN919.81
[1]WALLACE G K.The JPEG still picture compression standard[J].IEEE Transactions on Consumer Electronics,1992,38(1):18-34.
[2]RABBANI M.JPEG2000:Image compression fundamentals,standards and practice[J].Journal of Electronic Imaging,2002,11(2):286.
[3]BPG Image format [CP/OL].https://bellard.org/bpg,2015.
[4]SULLIVAN G J,OHM J R,HAN W J,et al.Overview of the high efficiency video coding (HEVC) standard[J].IEEE Transactions on Circuits and Systems for Video Technology,2012,22(12):1649-1668.
[5]Versatile video coding reference software version 7.1 (VTM-7.1) [CP/OL].https://vcgit.hhi.fraunhofer.de/jvet/VVCSoftware_VTM/tags/VTM-7.1.
[6]TODERICI G,O’MALLEY S M,HWANG S J,et al.Variable rate image compression with recurrent neural networks[J].ar-Xiv:1511.06085,2015.
[7]HUFFMAN D A.A method for the construction of minimum-redundancy codes[J].Proceedings of the IRE,1952,40(9):1098-1101.
[8]MARPE D,SCHWARZ H,WIEGAND T.Context-based adaptive binary arithmetic coding in the H.264/AVC video compression standard[J].IEEE Transactions on Circuits and Systems for Video Technology,2003,13(7):620-636.
[9]LEE J,CHO S,KIM M.An end-to-end joint learning scheme of image compression and quality enhancement with improved entropy minimization[J].arXiv:1912.12817,2019.
[10]HINTON G E,SALAKHUTDINOV R R.Reducing the dimensionality of data with neural networks[J].Science,2006,313(5786):504-507.
[11]TODERICI G,VINCENT D,JOHNSTON N,et al.Full resolution image compression with recurrent neural networks[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2017:5306-5314.
[12]COVER T M,THOMAS J A.Elements of information theory[M].New York,US:John Wiley & Sons,1991.
[13]BALLÉ J,LAPARRA V,SIMONCELLI E P.End-to-end optimized image compression[J].arXiv:1611.01704,2016.
[14]MINNEN D,BALLÉ J,TODERICI G D.Joint autoregressiveand hierarchical priors for learned image compression[C]//Advances in Neural Information Processing Systems.2018:10771-10780.
[15]BALLÉ J,MINNEN D,SINGH S,et al.Variational image compression with a scale hyperprior[J].arXiv:1802.01436,2018.
[16]JOHNSTON N,VINCENT D,MINNEN D,et al.Improvedlossy image compression with priming and spatially adaptive bit rates for recurrent networks[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2018:4385-4393.
[17]BALLÉ J,LAPARRA V,SIMONCELLI E P.End-to-end optimization of nonlinear transform codes for perceptual quality[C]//2016 Picture Coding Symposium (PCS).IEEE,2016:1-5.
[18]LEE J,CHO S,BEACK S K.Context-adaptive entropy model for end-to-end optimized image compression[J].arXiv:1809.10452,2018.
[19]LI M,ZHANG K,ZUO W,et al.Learning context-based non-local entropy modeling for image compression[J].arXiv:2005.04661,2020.
[20]MA H C,LIU D,YAN N,et al.End-to-end optimized versatile image compression with wavelet-like transform[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2020(99).
[21]LIU J,LU G,HU Z,et al.A unified end-to-end framework for efficient deep image compression[J].arXiv:2002.03370,2020.
[22]ZHONG Z,AKUTSU H,AIZAWA K.Channel-level variablequantization network for deep image compression[J].arXiv:2007.12619,2020.
[23]LI M,ZUO W,GU S,et al.Learning convolutional networks for content-weighted image compression[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2018:3214-3223.
[24]RIPPEL O,BOURDEV L.Real-time adaptive image compression[J].arXiv:1705.05823,2017.
[25]CHEN L H,BAMPIS C G,LI Z,et al.Perceptually optimizing deep image compression[J].arXiv:2007.02711,2020.
[26]LEE J,KIM D,KIM Y,et al.A training method for image compression networks to improve perceptual quality of reconstructions[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops.2020:144-145.
[27]HU Y,YANG S,YANG W,et al.Towards coding for human and machine vision:A scalable image coding approach[C]//2020 IEEE International Conference on Multimedia and Expo (ICME).IEEE,2020:1-6.
[28]DUAN S,CHEN H,GU J.JPAD-SE:High-level semantics for joint perception-accuracy-distortion enhancement in image compression[J].arXiv:2005.12810,2020.
[29]GUO Z,ZHANG Z,CHEN Z.Deep scalable image compression via hierarchical feature decorrelation[C]//2019 Picture Coding Symposium (PCS).IEEE,2019:1-5.
[30]AKBARI M,LIANG J,HAN J,et al.Learned variable-rate image compression with residual divisive normalization[C]//2020 IEEE International Conference on Multimedia and Expo (ICME).IEEE,2020:1-6.
[31]CHOI Y,EL-KHAMY M,LEE J.Variable rate deep imagecompression with a conditional autoencoder[C]//Proceedings of the IEEE International Conference on Computer Vision.2019:3146-3154.
[32]YANG F,HERRANZ L,VAN DE WEIJER J,et al.Variable rate deep image compression with modulated autoencoder[J].IEEE Signal Processing Letters,2020,27:331-335.
[33]GUO T,WANG J,CUI Z,et al.Variable rate image compression with content adaptive optimization[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops.2020:122-123.
[34]CHRISTOPOULOS C,ASKELOF J,LARSSON M.Efficientmethods for encoding regions of interest in the upcoming JPEG2000 still image coding standard[J].IEEE Signal Processing Letters,2000,7(9):247-249.
[35]MENTZER F,AGUSTSSON E,TSCHANNEN M,et al.Conditional probability models for deep image compression[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2018:4394-4402.
[36]ALEXANDRE D,CHANG C P,PENG W H,et al.Learned image compression with soft bit-based rate-distortion optimization[C]//2019 IEEE International Conference on Image Processing (ICIP).IEEE,2019:1715-1719.
[37]LI M,ZUO W M,GU S H,et al.Learning content-weighteddeep image compression[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2020,PP(99).
[38]CAMPOS J,MEIERHANS S,DJELOUAH A,et al.Contentadaptive optimization for neural image compression[J].arXiv:1906.01223,2019.
[39]CUI Z,WANG J,BAI B,et al.G-VAE:A continuously variable rate deep image compression framework[J].arXiv:2003.02012,2020.
[40]AKBARI M,LIANG J,HAN J,et al.Learned variable-rate ima-ge compression with residual divisive normalization[C]//2020 IEEE International Conference on Multimedia and Expo (ICME).IEEE,2020:1-6.
[41]CHEN T,MA Z.Variable bitrate image compression with quality scaling factors[C]//2020 IEEE International Conference on Acoustics,Speech and Signal Processing (ICASSP).IEEE,2020:2163-2167.
[42]ZHOU J,NAKAGAWA A,KATO K,et al.Variable rate image compression method with dead-zone quantizer[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops.2020:162-163.
[43]CAI J,ZHANG L.Deep image compression with iterative non-uniform quantization[C]//2018 25th IEEE International Conference on Image Processing (ICIP).IEEE,2018:451-455.
[44]CHENG Z,SUN H,KATTO J.Low bitrate image compression with discretized Gaussian mixture likelihoods[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops.2020:126-127.
[45]CHENG Z,SUN H,TAKEUCHI M,et al.Learned image compression with discretized Gaussian mixture likelihoods and attention modules[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.2020:7939-7948.
[46]WEN S,ZHOU J,NAKAGAWA A,et al.Variational autoencoder based image compression with pyramidal features and context entropy model[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops.2019:1-4.
[47]LADUNE T,PHILIPPE P,HAMIDOUCHE W,et al.Binaryprobability model for learning based image compression[C]//2020 IEEE International Conference on Acoustics,Speech and Signal Processing (ICASSP).IEEE,2020:2168-2172.
[48]GUO Z,WU Y,FENG R,et al.3-D context entropy model for improved practical image compression[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops.2020:116-117.
[49]LIU H,CHEN T,GUO P,et al.Gated context model with embedded priors for deep image compression[J].arXiv:1902.10480,2019.
[50]LIU H,CHEN T,GUO P,et al.Non-local attention optimized deep image compression[J].arXiv:1904.09757,2019.
[51]LI M,MA K,YOU J,et al.Efficient and effective context-based convolutional entropy modeling for image compression[J].IEEE Transactions on Image Processing,2020,29:5900-5911.
[52]HU Y,YANG W,LIU J.Coarse-to-fine hyper-prior modeling for learned image compression[C]//AAAI.2020:11013-11020.
[53]MINNEN D,SINGH S.Channel-wise autoregressive entropymodels for learned image compression[J].arXiv:2007.08739,2020.
[54]LIU H,CHEN T,SHEN Q,et al.Practical stacked non-local attention modules for image compression[C]//CVPR Workshops.2019:1-4.
[55]CHEN T,LIU H,MA Z,et al.Neural image compression via non-local attention optimization and improved context modeling[J].arXiv:1910.06244,2019.
[56]WU L,HUANG K,SHEN H.A GAN-based tunable imagecompression system[C]//The IEEE Winter Conference on Applications of Computer Vision.2020:2334-2342.
[57]AKUTSU H,NARUKO T.End-to-end learned ROI image compression[C]//CVPR Workshops.2019:1-5.
[58]AKUTSU H,SUZUKI A,ZHONG Z,et al.Ultra low bitrate learned image compression by selective detail decoding[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops.2020:118-119.
[59]CAI C,CHEN L,ZHANG X,et al.End-to-end optimized ROI image compression[J].IEEE Transactions on Image Processing,2019,29:3442-3457.
[60]WANG C,HAN Y,WANG W.An end-to-end deep learning image compression framework based on semantic analysis[J].Applied Sciences,2019,9(17):3580.
[61]XIA Q,LIU H,MA Z.Object-based image coding:A learning-driven revisit[C]//2020 IEEE International Conference on Multimedia and Expo (ICME).IEEE,2020:1-6.
[62]CHEN L C,PAPANDREOU G,KOKKINOS I,et al.DeepLab:Semantic image segmentation with deep convolutional nets,atrous convolution,and fully connected CRFs[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2017,40(4):834-848.
[63]JIA C,LIU Z,WANG Y,et al.Layered image compression using scalable auto-encoder[C]//2019 IEEE Conference on Multimedia Information Processing and Retrieval (MIPR).IEEE,2019:431-436.
[64]ZHANG Z,CHEN Z,LIN J,et al.Learned scalable image compression with bidirectional context disentanglement network[C]//2019 IEEE International Conference on Multimedia and Expo (ICME).IEEE,2019:1438-1443.
[65]AGUSTSSON E,TSCHANNEN M,MENTZER F,et al.Gen-erative adversarial networks for extreme learned image compression[C]//Proceedings of the IEEE International Conference on Computer Vision.2019:221-231.
[66]DASH S,KUMARAVELU G,NAGANOOR V,et al.Com-pressNet:Generative compression at extremely low bitrates[C]//2020 IEEE Winter Conference on Applications of Computer Vision (WACV).IEEE,2020:2314-2322.
[67]PATEL Y,APPALARAJU S,MANMATHA R.Deep perceptual compression[J].arXiv:1907.08310,2019.
[68]KUDO S,ORIHASHI S,TANIDA R,et al.GAN-based image compression using mutual information maximizing regularization[C]//2019 Picture Coding Symposium (PCS).IEEE,2019:1-5.
[69]MENTZER F,TODERICI G,TSCHANNEN M,et al.High-fidelity generative image compression[J].arXiv:2006.09965,2020.
[70]BLAU Y,MICHAELI T.Rethinking lossy compression:Therate-distortion-perception tradeoff[J].arXiv:1901.07821,2019.
[71]LUO J,LI S,DAI W,et al.Noise-to-compression variational autoencoder for efficient end-to-end optimized image coding[C]//2020 Data Compression Conference (DCC).IEEE,2020:33-42.
[72]KODAK E.Kodak lossless true color image suite (1993) [DB/OL].http://r0k.us/graphics/kodak/.
[73]ASUNI N,GIACHETTI A.TESTIMAGES:A large-scale ar-chive for testing visual devices and basic image processing algorithms[C]//Smart Tools and Apps for Graphics - Eurographics Italian Chapter Conference.2014:63-70.
[74]WANG Z,SIMONCELLI E P,BOVIK A C.Multiscale structural similarity for image quality assessment[C]//The Thrity-Seventh Asilomar Conference on Signals,Systems & Computers,2003.IEEE,2003,2:1398-1402.
[1] WEI Hui, CHEN Ze-mao, ZHANG Li-qiang. Anomaly Detection Framework of System Call Trace Based on Sequence and Frequency Patterns [J]. Computer Science, 2022, 49(6): 350-355.
[2] JIAO Xiang, WEI Xiang-lin, XUE Yu, WANG Chao, DUAN Qiang. Automatic Modulation Recognition Based on Deep Learning [J]. Computer Science, 2022, 49(5): 266-278.
[3] GAO Jie, LIU Sha, HUANG Ze-qiang, ZHENG Tian-yu, LIU Xin, QI Feng-bin. Deep Neural Network Operator Acceleration Library Optimization Based on Domestic Many-core Processor [J]. Computer Science, 2022, 49(5): 355-362.
[4] FAN Hong-jie, LI Xue-dong, YE Song-tao. Aided Disease Diagnosis Method for EMR Semantic Analysis [J]. Computer Science, 2022, 49(1): 153-158.
[5] ZHOU Xin, LIU Shuo-di, PAN Wei, CHEN Yuan-yuan. Vehicle Color Recognition in Natural Traffic Scene [J]. Computer Science, 2021, 48(6A): 15-20.
[6] XU Yi-fei, XIONG Shu-hua, SUN Wei-heng, HE Xiao-hai, CHEN Hong-gang. HEVC Post-processing Algorithm Based on Non-local Low-rank and Adaptive Quantization Constraint Prior [J]. Computer Science, 2021, 48(5): 155-162.
[7] MA Lin, WANG Yun-xiao, ZHAO Li-na, HAN Xing-wang, NI Jin-chao, ZHANG Jie. Network Intrusion Detection System Based on Multi-model Ensemble [J]. Computer Science, 2021, 48(11A): 592-596.
[8] PAN Yu, ZOU Jun-hua, WANG Shuai-hui, HU Gu-yu, PAN Zhi-song. Deep Community Detection Algorithm Based on Network Representation Learning [J]. Computer Science, 2021, 48(11A): 198-203.
[9] LIU Tian-xing, LI Wei, XU Zheng, ZHANG Li-hua, QI Xiao-ya, GAN Zhong-xue. Monte Carlo Tree Search for High-dimensional Continuous Control Space [J]. Computer Science, 2021, 48(10): 30-36.
[10] ZHANG Yan-mei, LOU Yin-cheng. Deep Neural Network Based Ponzi Scheme Contract Detection Method [J]. Computer Science, 2021, 48(1): 273-279.
[11] DING Zi-ang, LE Cao-wei, WU Ling-ling and FU Ming-lei. PM2.5 Concentration Prediction Method Based on CEEMD-Pearson and Deep LSTM Hybrid Model [J]. Computer Science, 2020, 47(6A): 444-449.
[12] SHANG Jin-yue, BI Xiu-li, XIAO Bin, LI Wei-sheng. Image Forgery Detection Based on DCT Coefficients Hashing [J]. Computer Science, 2020, 47(6): 310-315.
[13] SHANG Jun-yuan, YANG Le-han, HE Kun. Analyzing Latent Representation of Deep Neural Networks Based on Feature Visualization [J]. Computer Science, 2020, 47(5): 190-197.
[14] TANG Guo-qiang,GAO Da-qi,RUAN Tong,YE Qi,WANG Qi. Clinical Electronic Medical Record Named Entity Recognition Incorporating Language Model and Attention Mechanism [J]. Computer Science, 2020, 47(3): 211-216.
[15] CAI Yu-han,XIONG Shu-hua,SUN Wei-heng,Karn PRADEEP,HE Xiao-hai. Video Compression Algorithm Combining Frame Rate Up-conversion with HEVC Standard Based on Motion Vector Refinement [J]. Computer Science, 2020, 47(2): 76-82.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!