基于卷积神经网络的虚拟现实视频帧内预测编码

doi:10.11896/jsjkx.211100179

Computer Science ›› 2022, Vol. 49 ›› Issue (7): 127-131.doi: 10.11896/jsjkx.211100179

• Computer Graphics & Multimedia • Previous Articles Next Articles

Virtual Reality Video Intraframe Prediction Coding Based on Convolutional Neural Network

LIU Yue-hong¹, NIU Shao-hua², SHEN Xian-hao¹

1 College of Information Science and Engineering,Guilin University of Technology,Guilin,Guangxi 541004,China
2.School of Mechanical and Electrical Engineering,Beijing Institute of Technology,Beijing 100081,China

Received:2021-11-17 Revised:2022-03-15 Online:2022-07-15 Published:2022-07-12
About author:LIU Yue-hong,born in 1980,master.Her main research interests include fiber optic communication and intelligent hardware and virtual reality.
SHEN Xian-hao,born in 1980,Ph.D,professor.His main research interests include deep learning and virtual testing.
Supported by:
National Natural Science Foundation of China(61961010),Science Foundation of Guangxi Province(2018GXNSFBA050029,2020GXNSFAA297255) and Guangxi Science and Technology Major Special Project(Gui Ke AA19046004).

Abstract

Abstract: In order to improve the performance of virtual reality video intraframe prediction coding,convolutional neural network algorithm is used to select video frame coding unit(CU) to reduce the complexity of video image coding.Firstly,quantization parameters are set to obtain the virtual reality video frame samples,then the image coding tree is constructed,and the convolutional neural network (CNN) frame coding unit optimization model is established.The image brightness of frame samples is taken as the CNN input,combined with the image rate distortion cost threshold,the optimization results of the frame coding unit are obtained through training.Using CNN training optimization,the coding tree(CTU) structure with different depths and an appro-priate number of CU modules can be obtained according to the intraframe coding requirements of different texture modules of the image.Experiments show that,by reasonably setting the convolution kernel size and quantization parameters,CNN algorithm can obtain better image quality and less coding time than common video intraframe prediction coding algorithms.

Key words: Coding unit, Convolution kernel size, Convolutional neural network, Intraframe coding, Virtual reality

CLC Number:

TP317.4

LIU Yue-hong, NIU Shao-hua, SHEN Xian-hao. Virtual Reality Video Intraframe Prediction Coding Based on Convolutional Neural Network[J].Computer Science, 2022, 49(7): 127-131.

References

[1]MA S Q,CHE X P,YU Q,et al.Research on event-based vir-tual reality user experience evaluation method [J].Computer Science,2021,48(2):167-174.
[2]ZHU W,YI Y,WANG T Q,et al.A fast division algorithm of depth image intra coding unit [J].Computer Science,2019,46(10):286-294.
[3]YI Q M,XIE Z H,SHI M.a fast decision combination algorithm for hevc intra coding [J].Small Microcomputer System,2019,40(1):199-204.
[4]JIN Z P,AN P,YANG C,et al.Post-processing for intra coding through perceptual adversarial learning and progressive refinement[J].Neurocomputing,2020,394:158-167.
[5]ZHANG R,JIA K,LIU P,et al.Fast intra-mode decision for depth map coding in 3D-HEVC[J].Journal of Real-Time Image Processing,2020,17(5):1637-1646.
[6]JIANG X,XU Q,SUN T,et al.Detection of HEVC DoubleCompression with the Same Coding Parameters Based on Analysis of Intra Coding Quality Degradation Process[J].IEEE Transactions on Information Forensics and Security,2020,15:250-263.
[7]PARASCHIV E G,RUIZ-COLL D,PANTOJA M,et al.Parallelization and improvement of the MDV-SW algorithm for HEVC intra-prediction coding[J].Journal of Supercomputing,2019,75(3):1150-1162.
[8]TAI K H,CHEN M J,LIN J R,et al.Acceleration for HEVC Encoder by Bimodal Segmentation of Rate-Distortion Cost and Accurate Determination of Early Termination and Early Split[J].IEEE Access,2019,7:45259-45273.
[9]SHARMA A K,CHAURASIA S,SRIVASTAVA D K.Sentimental Short Sentences Classification by Using CNN Deep Learning Model with Fine Tuned Word2Vec[J].Procedia Computer Science,2020,167:1139-1147.
[10]ZHOU C,ZHOU J,CAI Y U,et al.Multi-channel Sliced Deep RCNN with Residual Network for Text Classification[J].Chinese Journal of Electronics,2020,29(5):92-98.
[11]LIU Y,SUEN C Y,LIU Y,et al.Scene Classification Using Hierarchical Wasserstein CNN[J].IEEE Transactions on Geo-science and Remote Sensing,2019,57(5):2494-2509.
[12]GUO B,ZHANG C,LIU J,et al.Improving text classification with weighted word embeddings via a multi-channel TextCNN model[J].Neurocomputing,2019,363(21):366-374.
[13]HUANG S,SI PT,ZHANG Q Y,et al.Fast intra coding algorithm of hevc SCC based on decision tree [J].Journal of Opto-electronics·Laser,2019,30(4):420-427.
[14]REN Y,PENG Z J,CUI X,et al.Fast division of FVC intra co-ding units combined with random forest [J].Chinese Journal of Image and Graphics,2019,24(5):724-733.
[15]ZHAO J,WANG Y,ZHANG Q.Adaptive CU Split DecisionBased on Deep Learning and Multifeature Fusion for H.266/VVC[J].Scientific Programming,2020,2020:1-11.

Related Articles 15

[1]	QU Qian-wen, CHE Xiao-ping, QU Chen-xin, LI Jin-ru. Study on Information Perception Based User Presence in Virtual Reality [J]. Computer Science, 2022, 49(9): 146-154.
[2]	ZHOU Le-yuan, ZHANG Jian-hua, YUAN Tian-tian, CHEN Sheng-yong. Sequence-to-Sequence Chinese Continuous Sign Language Recognition and Translation with Multi- layer Attention Mechanism Fusion [J]. Computer Science, 2022, 49(9): 155-161.
[3]	CHEN Yong-quan, JIANG Ying. Analysis Method of APP User Behavior Based on Convolutional Neural Network [J]. Computer Science, 2022, 49(8): 78-85.
[4]	ZHU Cheng-zhang, HUANG Jia-er, XIAO Ya-long, WANG Han, ZOU Bei-ji. Deep Hash Retrieval Algorithm for Medical Images Based on Attention Mechanism [J]. Computer Science, 2022, 49(8): 113-119.
[5]	DAI Zhao-xia, LI Jin-xin, ZHANG Xiang-dong, XU Xu, MEI Lin, ZHANG Liang. Super-resolution Reconstruction of MRI Based on DNGAN [J]. Computer Science, 2022, 49(7): 113-119.
[6]	XU Ming-ke, ZHANG Fan. Head Fusion:A Method to Improve Accuracy and Robustness of Speech Emotion Recognition [J]. Computer Science, 2022, 49(7): 132-141.
[7]	YANG Yue, FENG Tao, LIANG Hong, YANG Yang. Image Arbitrary Style Transfer via Criss-cross Attention [J]. Computer Science, 2022, 49(6A): 345-352.
[8]	YANG Jian-nan, ZHANG Fan. Classification Method for Small Crops Combining Dual Attention Mechanisms and Hierarchical Network Structure [J]. Computer Science, 2022, 49(6A): 353-357.
[9]	WU Zi-bin, YAN Qiao. Projected Gradient Descent Algorithm with Momentum [J]. Computer Science, 2022, 49(6A): 178-183.
[10]	ZHANG Jia-hao, LIU Feng, QI Jia-yin. Lightweight Micro-expression Recognition Architecture Based on Bottleneck Transformer [J]. Computer Science, 2022, 49(6A): 370-377.
[11]	WANG Jian-ming, CHEN Xiang-yu, YANG Zi-zhong, SHI Chen-yang, ZHANG Yu-hang, QIAN Zheng-kun. Influence of Different Data Augmentation Methods on Model Recognition Accuracy [J]. Computer Science, 2022, 49(6A): 418-423.
[12]	SUN Jie-qi, LI Ya-feng, ZHANG Wen-bo, LIU Peng-hui. Dual-field Feature Fusion Deep Convolutional Neural Network Based on Discrete Wavelet Transformation [J]. Computer Science, 2022, 49(6A): 434-440.
[13]	ZHAO Zheng-peng, LI Jun-gang, PU Yuan-yuan. Low-light Image Enhancement Based on Retinex Theory by Convolutional Neural Network [J]. Computer Science, 2022, 49(6): 199-209.
[14]	HU Fu-yuan, WAN Xin-jun, SHEN Ming-fei, XU Jiang-lang, YAO Rui, TAO Zhong-ben. Survey Progress on Image Instance Segmentation Methods of Deep Convolutional Neural Network [J]. Computer Science, 2022, 49(5): 10-24.
[15]	XU Hua-chi, SHI Dian-xi, CUI Yu-ning, JING Luo-xi, LIU Cong. Time Information Integration Network for Event Cameras [J]. Computer Science, 2022, 49(5): 43-49.

Metrics

Viewed

Full text

Abstract

Cited

Shared

Discussed

Comments

Recommended 0

No Suggested Reading articles found!

Virtual Reality Video Intraframe Prediction Coding Based on Convolutional Neural Network

PDF (PC)

Abstract

Cite this article

share this article

References

Related Articles 15

Metrics

Comments

Recommended 0