Computer Science ›› 2021, Vol. 48 ›› Issue (5): 170-176.doi: 10.11896/jsjkx.210100104

• Computer Graphics & Multimedia • Previous Articles     Next Articles

Motion-estimation Based Space-temporal Feature Aggregation Network for Multi-frames Rain Removal

MENG Xiang-yu1, XUE Xin-wei1,2, LI Wen-lin1, WANG Yi1,2   

  1. 1 DUT-RU International School of Information Science & Engineering of Dalian University of Technology,Dalian,Liaoning 116621,China
    2 Key Laboratory for Ubiquitous Network and Service Software of Liaoning Province,Dalian,Liaoning 116621,China
  • Received:2021-01-13 Revised:2021-03-30 Online:2021-05-15 Published:2021-05-09
  • About author:MENG Xiang-yu,born in 1998,postgraduate.His main research interests include computer vision and image processing.(lnmengxiangyu@mail.dlut.edu.cn)
    XUE Xin-wei,born in 1984,Ph.D,lecturer,graduate supervisor,is a member of China Computer Federation.Her main research interests include machine learning and computer vision.
  • Supported by:
    National Natural Science Foundation of China(61806036, 61976037) and Fundamental Research Funds for the Central Universities(DUT19TD19).

Abstract: Outdoor videos obtained under rainy weather cause visual quality degradation,which affects the processing effects of visual tasks such as object recognition and tracking.In order to enhance the quality of video and complete the effective recovery of the details in the motion video,many methods have been proposed in video rain removal.At this stage,most of the video rain removal methods based on convolutional neural networks employ single-frame enhancement and multi-frame fusion to remove rain.But the movement of some pixels between adjacent frames with direct enhancement is difficult to be completed in the temporal dimension.And the manner cannot effectively achieve end-to-end training,making the final result still relatively blurry and many detailed information losses.In order to effectively solve the above problems,this paper proposes a multi-frame fusion rain removal network based on the combination of motion estimation and space-temporal feature aggregation,ME-Derain for short.First,the optical flow estimation method is used to establish a reference frame to complete the alignment of adjacent frames,and then an encoder-decoder structure is introduced.The convolutional neural network connected by the residual connection and the time-related attention enhancement mechanism together form a multi-frame fusion network.Finally,the enhancement module related to the spatial sequence is used to obtain the rain removal video.A large number of experiments on different data sets show that the proposed method is better than most common methods at this stage and can obtain better rain removal effect.

Key words: Convolutional neural network, Optical flow, Video enhancement, Video rain removal

CLC Number: 

  • TP391
[1]YANG W,TAN R T,WANG S,et al.Single image deraining:From model-based to data-driven and beyond[J].arXiv:1912.07150,2019.
[2]WANG H,WU Y,LI M,et al.A survey on rain removal fromvideo and single image[J].arXiv:1909.08326,2019.
[3]BREWER N,LIU N.Using the shape characteristics of rain toidentify and remove rain from video[C]//Joint IAPR International Workshops on Statistical Techniques in Pattern Recognition (SPR) and Structural and Syntactic Pattern Recognition (SSPR).Berlin,Heidelberg:Springer,2008:451-458.
[4]LIU J,YANG W,YANG S,et al.Erase or Fill? Deep Joint Recurrent Rain Removal and Reconstruction in Videos[C]//Proceedings of 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.Salt Lake City:IEEE Press,2018:3233-3242.
[5]SU S,DELBRACIO M,WANG J,et al.Deep Video Deblurring for Hand-Held Cameras[C]//Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.Honolulu:IEEE Press,2017:1279-1288.
[6]LUO Y,XU Y,JI H.Removing Rain from a Single Image via Discriminative Sparse Coding[C]//Proceedings of IEEE International Conference on Computer Vision.Santiago:IEEE Press,2015:3397-3405.
[7]YANG W,TAN R T,FENG J,et al.Deep Joint Rain Detection and Removal from a Single Image[C]//Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.Honolulu:IEEE Press,2017:1685-1694.
[8]ZHANG H,PATEL V M.Density-aware single image de-rai-ning using a multi-stream dense network[C]//Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.Salt Lake City:IEEE Press,2018:695-704.
[9]FU X Y,SUN Q,HUANG Y,et al.Single Image De-raining Method Based on Deep Adjacently Connected Networks [J].Computer Science,2020,47(2):106-111.
[10]WANG M H,HE H J,LI C.Single image rain removal based on selective kernel convolution using a residual refine factor [J].Journal of Image and Graphics,2020,25(12):2484-2493.
[11]YOU S,TAN R,KAWAKAMI R,et al.Adherent RaindropModeling,Detection and Removal in Video[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2015,38(9):1721-1733.
[12]LI M,XIE Q,ZHAO Q,et al.Video rain streak removal by multi-scale convolutional sparse coding [C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.Salt Lake City:IEEE Press,2018:6644-6653.
[13]KIM J H,LEE C,SIM J Y,et al.Single-image deraining using an adaptive nonlocal means filter[C]//Proceedings of IEEE International Conference on Image Processing.Paris:IEEE Press,2014:914-917.
[14]DOSOVITSKIY A,FISCHER P,ILG E,et al.FlowNet:Lear-ning Optical Flow with Convolutional Networks[C]//Procee-dings of IEEE International Conference on Computer Vision.Santiago:IEEE Press,2015:2758-2766.
[15]JADERBERG M,SIMONYAN K,ZISSERMAN A,et al.Spatial transformer networks [C]//Proceedings of the 28th International Conference on Neural Information Processing Systems.Montreal:ACM,2015:2017-2025.
[16]XUE X,DING Y,MENG X,et al.Investigating Collaborative Layer Projection for Robust Rain Scene Modeling[J].IEEE Access,2020,8:161765-161775.
[17]WANG Z,BOVIK A C,SHEIKH H R,et al.Image quality assessment:from error visibility to structural similarity[J].IEEE Trans on Image Processing,2004,13(4):600-612.
[18]CHEN J,TAN C H,HOU J,et al.Robust Video Content Alignment and Compensation for Rain Removal in a CNN Framework[C]//Proceedings of IEEE/CVF Conference on Computer Vision and Pattern Recognition.Salt Lake City:IEEE Press,2018:6286-6295.
[19]KINGMA D P,BA J.Adam:A Method for Stochastic Optimization[J].arXiv:1909.08326,2019.
[20]JIANG T X,HUANG T Z,ZhAO X L,et al.FastDeRain:A Novel Video Rain Streak Removal Method Using Directional Gradient Priors[J].IEEE Transactions on Image Processing,2018,28(4):2089-2102.
[21]HUYNH-THU Q,GHANBARI M.Scope of validity of PSNR in image/video quality assessment[J].Electronics Letters,2008,44(13):800-801.
[22]RANJAN A,BLACK M J.Optical Flow Estimation Using aSpatial Pyramid Network[C]//Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.Honolulu:IEEE Press,2017:4161-4170.
[23]SUN D,YANG X,LIU M Y,et al.PWC-Net:CNNs for Optical Flow Using Pyramid,Warping,and Cost Volume[C]//Procee-dings of IEEE Conference on Computer Vision and Pattern Re-cognition.Salt Lake City:IEEE Press,2018:8934-8943.
[24]HUI T W,TANG X,LOY C C.LiteFlowNet:A Lightweight Convolutional Neural Network for Optical Flow Estimation[C]//Proceedings of IEEE/CVF Conference on Computer Vision and Pattern Recognition.Salt Lake City:IEEE Press,2018:8981-8989.
[25]MEISTER S,HUR J,ROTH S.Unflow:Unsupervised learning of optical flow with a bidirectional census loss [C]//Proceedings of the AAAI Conference on Artificial Intelligence.New Orleans:AAAI,2018,32(1):7251-7259.
[1] ZHOU Le-yuan, ZHANG Jian-hua, YUAN Tian-tian, CHEN Sheng-yong. Sequence-to-Sequence Chinese Continuous Sign Language Recognition and Translation with Multi- layer Attention Mechanism Fusion [J]. Computer Science, 2022, 49(9): 155-161.
[2] CHEN Yong-quan, JIANG Ying. Analysis Method of APP User Behavior Based on Convolutional Neural Network [J]. Computer Science, 2022, 49(8): 78-85.
[3] ZHU Cheng-zhang, HUANG Jia-er, XIAO Ya-long, WANG Han, ZOU Bei-ji. Deep Hash Retrieval Algorithm for Medical Images Based on Attention Mechanism [J]. Computer Science, 2022, 49(8): 113-119.
[4] DAI Zhao-xia, LI Jin-xin, ZHANG Xiang-dong, XU Xu, MEI Lin, ZHANG Liang. Super-resolution Reconstruction of MRI Based on DNGAN [J]. Computer Science, 2022, 49(7): 113-119.
[5] LIU Yue-hong, NIU Shao-hua, SHEN Xian-hao. Virtual Reality Video Intraframe Prediction Coding Based on Convolutional Neural Network [J]. Computer Science, 2022, 49(7): 127-131.
[6] XU Ming-ke, ZHANG Fan. Head Fusion:A Method to Improve Accuracy and Robustness of Speech Emotion Recognition [J]. Computer Science, 2022, 49(7): 132-141.
[7] WU Zi-bin, YAN Qiao. Projected Gradient Descent Algorithm with Momentum [J]. Computer Science, 2022, 49(6A): 178-183.
[8] YANG Yue, FENG Tao, LIANG Hong, YANG Yang. Image Arbitrary Style Transfer via Criss-cross Attention [J]. Computer Science, 2022, 49(6A): 345-352.
[9] YANG Jian-nan, ZHANG Fan. Classification Method for Small Crops Combining Dual Attention Mechanisms and Hierarchical Network Structure [J]. Computer Science, 2022, 49(6A): 353-357.
[10] ZHANG Jia-hao, LIU Feng, QI Jia-yin. Lightweight Micro-expression Recognition Architecture Based on Bottleneck Transformer [J]. Computer Science, 2022, 49(6A): 370-377.
[11] WANG Jian-ming, CHEN Xiang-yu, YANG Zi-zhong, SHI Chen-yang, ZHANG Yu-hang, QIAN Zheng-kun. Influence of Different Data Augmentation Methods on Model Recognition Accuracy [J]. Computer Science, 2022, 49(6A): 418-423.
[12] SUN Jie-qi, LI Ya-feng, ZHANG Wen-bo, LIU Peng-hui. Dual-field Feature Fusion Deep Convolutional Neural Network Based on Discrete Wavelet Transformation [J]. Computer Science, 2022, 49(6A): 434-440.
[13] ZHAO Zheng-peng, LI Jun-gang, PU Yuan-yuan. Low-light Image Enhancement Based on Retinex Theory by Convolutional Neural Network [J]. Computer Science, 2022, 49(6): 199-209.
[14] LIU Lin-yun, CHEN Kai-yan, LI Xiong-wei, ZHANG Yang, XIE Fang-fang. Overview of Side Channel Analysis Based on Convolutional Neural Network [J]. Computer Science, 2022, 49(5): 296-302.
[15] ZHANG Wen-xuan, WU Qin. Fine-grained Image Classification Based on Multi-branch Attention-augmentation [J]. Computer Science, 2022, 49(5): 105-112.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!