Computer Science ›› 2023, Vol. 50 ›› Issue (3): 208-215.doi: 10.11896/jsjkx.220100149

• Computer Graphics & Multimedia • Previous Articles     Next Articles

Polarized Self-attention Constrains Color Overflow in Automatic Coloring of Image

LIU Hang1, PU Yuanyuan1,2, LYU Dahua1, ZHAO Zhengpeng1, XU Dan1, QIAN Wenhua1   

  1. 1 School of Information Science and Engineering,Yunnan University,Kunming 650504,China
    2 University Key Laboratory of Internet of Things Technology and Application,Yunnan Province,Kunming 650504,China
  • Received:2022-01-16 Revised:2022-09-08 Online:2023-03-15 Published:2023-03-15
  • About author:LIU Hang,born in 1995,postgraduate.His main research interests include deep learning and image colorization.
    PU Yuanyuan,born in 1972,professor,Ph.D supervisor,is a member of China Computer Federation.Her main research interests include digital image proces-sing,non-photorealistic rendering,scientific understanding of visual arts.
  • Supported by:
    National Natural Science Foundation of China(62162068,61271361,61761046,62061049),Yunnan Science and Technology Department Project(2018FB100) and Key Program of the Applied Basic Research Programs of Yunnan(202001BB050043,2019FA044).

Abstract: Auto coloring transforms grayscale images into reasonable colored versions of natural color images,allowing the restoration of color for old photographs,black-and-white films,videos,etc.Therefore,it is widely concerned in the realms of computer vision and graphics.Nevertheless,allocating colors to grayscale images is a highly challenging mission with a color overflow pro-blem.To address the problem,a technique for automatic coloring of images with polarized self-attention constrained color overflow is proposed.At first,separating instances in the foreground from the background minimizes the coloring effect of the background against the foreground,to mitigate the color overflow among the foreground and background.Second,the polarized self-attention block splits the features into color channels and spatial locations for more accurate and specific coloring,which reduces the color overflow within the global image,instance objects.At last,the fusion module is combined to integrate the global features and instance features through different weights to accomplish the ultimate coloring.Experiment results show that the main indexes FID and LPIPS are improved by 9.7% and 10.9% respectively,and the indexes SSIM and PSNR are optimal compared with ChromaGAN and MemoGAN.

Key words: Image coloring, Deep learning, Target detection, Self attention, Color overflow

CLC Number: 

  • TP391
[1]QU Y,WONG T T,HENG P A.Manga colorization[J].ACM Transactions on Graphics,2006,25(3):1214-1220.
[2]DOU Z,WNAG N,WANG S J,et al.Sketch colorization method with drawing prior[J].Computer Science,2022,49(4):195-202.
[3]LIU X,WAN L,QU Y,et al.Intrinsic colorization[J].ACM Transactions on Graphics,2008,27(5):1-9.
[4]CHIA Y S,ZHUO S,GUPTA R K,et al.Semantic colorization with internet images[J].Acm Transactions on Graphics,2011,30(6CD):156.1-156.7.
[5]ZHANG R,ZHU J Y,ISOLA P,et al.Real-time user-guided image colorization with learned deep priors[J].Acm Transactions on Graphics,2017,36(4):1-11
[6]ZHANG R,ISOLA P,EFROS A A.Colorful Image Colorization[C]//European Conference on Computer Vision.Springer International Publishing.2016:649-666.
[7]ZHAO J,LIU L,SNOEK C,et al.Pixel-level semantics guided image colorization[J].arXiv:1808.01597,2018.
[8]ZOU A,SHEN X,ZHANG X,et al.Neutral Color CorrectionAlgorithm for Color Transfer Between Multicolor Images[C]//Advances in Graphic Communication,Printing and Packaging Technology and Materials.2021:176-182.
[9]YOO S,BAHNG H,CHUNG S,et al.Coloring With LimitedData:Few-Shot Colorization via Memory-Augmented Networks[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.2019:11283-11292.
[10]VITORIA P,RAAD L,BALLESTER C.ChromaGAN:Adversarial Picture Colorization with Semantic Class Distribution[C]//Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision.2020:2445-2454.
[11]GOODFELLOW I,POUGET-ABADIE J,MIRZA M,et al.Ge-nerative Adversarial Nets.[J].arXiv:1411.1784,2014.
[12]RUSSAKOVSKY O,DENG J,SU H,et al.ImageNet large scale visual recognition challenge[J].International Journal of Computer Vision,2015,115(3):211-252.
[13]HE K,GKIOXARI G,DOLLÁR P,et al.Mask R-CNN[C]//Proceedings of the IEEE International Conference on Computer Vision.2017:2961-2969.
[14]REN S,HE K,GIRSHICK R,et al.Faster r-cnn:Towards real-time object detection with region proposal networks[J].IEEE Transactions on Pattern Analysis & Machine Intelligence,2017,39(6):1137-1149.
[15]ZHU J Y,PHILIPP K,SHECHTMAN E,et al.Generative visual manipulation on the natural image manifold[C]//European Conference on Computer Vision.Cham:Springer,2016:597-613.
[16]CAESAR H,UIJLINGS J,FERRARI V.COCO-Stuff:Thingand Stuff Classes in Context[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2018:1209-1218.
[17]OBUKHOV A,KRASNYANSKIY M.Quality Assessment Me-thod for GAN Based on Modified Metrics Inception Score and Fréchet Inception Distance[C]//Proceedings of the Computational Methods in Systems and Software.2020:102-114.
[18]ZHANG R,ISOLA P,EFROS A A,et al.The Unreasonable Ef-fectiveness of Deep Features as a Perceptual Metric[C]//Pro-ceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2018:586-595.
[19]WANG Q,WU B,ZHU P,et al.Eca-net:Efficient channel attention for deep convolutional neural networks[J].arXiv:1910.03151,2020.
[20]JIE H,LI S,SAMUEL A,et al.Squeeze-and-Excitation Networks[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2019:7132-7147.
[21]YANG Y B.SA-Net:Shuffle Attention for Deep Convolutional Neural Networks[C]//2021 IEEE International Conference on Acoustics,Speech and Signal Processing(IICASSP 2021).2021:2235-2239.
[22]WOO S,PARK J,LEE J Y,et al.CBAM:Convolutional Block Attention Module[C]//Proceedings of the European Conference on Computer Vision(ECCV).2018:3-19.
[23]LIU H,LIU F,FAN X,et al.Polarized self-attention:Towards high-quality pixel-wise regression[J].arXiv:2107.00782,2021.
[1] DONG Yongfeng, HUANG Gang, XUE Wanruo, LI Linhao. Graph Attention Deep Knowledge Tracing Model Integrated with IRT [J]. Computer Science, 2023, 50(3): 173-180.
[2] HUA Xiaofeng, FENG Na, YU Junqing, HE Yunfeng. Shooting Event Detection of Free Kick in Soccer Video Based on Rule Reasoning [J]. Computer Science, 2023, 50(3): 181-190.
[3] MEI Pengcheng, YANG Jibin, ZHANG Qiang, HUANG Xiang. Sound Event Joint Estimation Method Based on Three-dimension Convolution [J]. Computer Science, 2023, 50(3): 191-198.
[4] BAI Xuefei, MA Yanan, WANG Wenjian. Segmentation Method of Edge-guided Breast Ultrasound Images Based on Feature Fusion [J]. Computer Science, 2023, 50(3): 199-207.
[5] CHEN Liang, WANG Lu, LI Shengchun, LIU Changhong. Study on Visual Dashboard Generation Technology Based on Deep Learning [J]. Computer Science, 2023, 50(3): 238-245.
[6] ZHANG Yi, WU Qin. Crowd Counting Network Based on Feature Enhancement Loss and Foreground Attention [J]. Computer Science, 2023, 50(3): 246-253.
[7] YING Zonghao, WU Bin. Backdoor Attack on Deep Learning Models:A Survey [J]. Computer Science, 2023, 50(3): 333-350.
[8] LIANG Jiali, HUA Baojian, SU Shaobo. Tensor Instruction Generation Optimization Fusing with Loop Partitioning [J]. Computer Science, 2023, 50(2): 374-383.
[9] ZOU Yunzhu, DU Shengdong, TENG Fei, LI Tianrui. Visual Question Answering Model Based on Multi-modal Deep Feature Fusion [J]. Computer Science, 2023, 50(2): 123-129.
[10] WANG Pengyu, TAI Wenxin, LIU Fang, ZHONG Ting, LUO Xucheng, ZHOU Fan. Self-supervised Flight Trajectory Prediction Based on Data Augmentation [J]. Computer Science, 2023, 50(2): 130-137.
[11] GUO Nan, LI Jingyuan, REN Xi. Survey of Rigid Object Pose Estimation Algorithms Based on Deep Learning [J]. Computer Science, 2023, 50(2): 178-189.
[12] LI Junlin, OUYANG Zhi, DU Nisuo. Scene Text Detection with Improved Region Proposal Network [J]. Computer Science, 2023, 50(2): 201-208.
[13] HUA Jie, LIU Xueliang, ZHAO Ye. Few-shot Object Detection Based on Feature Fusion [J]. Computer Science, 2023, 50(2): 209-213.
[14] CAI Xiao, CEHN Zhihua, SHENG Bin. SPT:Swin Pyramid Transformer for Object Detection of Remote Sensing [J]. Computer Science, 2023, 50(1): 105-113.
[15] WANG Bin, LIANG Yudong, LIU Zhe, ZHANG Chao, LI Deyu. Study on Unsupervised Image Dehazing and Low-light Image Enhancement Algorithms Based on Luminance Adjustment [J]. Computer Science, 2023, 50(1): 123-130.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!