Computer Science ›› 2022, Vol. 49 ›› Issue (11): 126-133.doi: 10.11896/jsjkx.220500193

• Computer Graphics & Multimedia • Previous Articles     Next Articles

Variational Domain Adaptation Driven Semantic Segmentation of Urban Scenes

JIN Yu-jie1,2, CHU Xu1,3, WANG Ya-sha1,4, ZHAO Jun-feng1,2   

  1. 1 Key Lab of High Confidence Software Technologies(Peking University),Ministry of Education,Beijing 100871,China
    2 School of Computer Science and Technology,Peking University,Beijing 100871,China
    3 Department of Computer Science and Technology,Tsinghua University,Beijing 100084,China
    4 National Engineering Research Center for Software Engineering,Peking University,Beijing 100871,China
  • Received:2022-05-20 Revised:2022-07-22 Online:2022-11-15 Published:2022-11-03
  • About author:JIN Yu-jie,born in 1999,postgraduate.His main research interests include machine learning and data mining.
    WANG Ya-sha,born in 1975,Ph.D,professor,Ph.D supervisor,is a member of China Computer Federation.His main research interests include smart city and big data analysis.
  • Supported by:
    National Natural Science Foundation of China(62172011).

Abstract: Semantic segmentation of urban scenes aims to identify and segment persons,obstacles,roads,signs and other elements from the image,and provide information of free space on the road for vehicles.It is one of the key technologies of automatic dri-ving.High performance semantic segmentation systems rely heavily on a large number of real annotation data required for trai-ning.However,labeling each pixel in the image is costly and often difficult to achieve.One way is to collect photo-realistic synthe-tic data from video games,where pixel-level annotation can be automatically generated at a low cost,to train the machine learning model to segment the images in the real world,which corresponds to domain adaptation.Different from the current mainstream semantic segmentation domain adaptation methods based on Vapnik-Chervonenkis dimension theory or Rademacher complexity theory,our method is inspired by the target domain Gibbs risk upper bound compatible with pseudo labels based on PAC-Bayes theory,and considers the average situation of the hypothetical space rather than the worst situation,so as to avoid excessively constraining the domain discrepancy in the latent space which leads to the problem that the upper bound of target domain genera-lization error cannot be estimated and optimized effectively.Under the guidance of the above ideas,this paper proposes a varia-tional inference method for semantic segmentation adaptation(VISA).The dropout variational family is used for variational infe-rence.While solving the ideal posterior distribution in the hypothesis space,an approximate Bayes classifier can be quickly obtained,and the estimation of the upper bound of risk is more accurate by minimizing the entropy of the target domain and filtering pixels.Experiments show that the mean intersection over the union(mIoU) of VISA is 0.5% ~ 6.6% higher than that of baseline methods,and has high accuracy in pedestrian,vehicle and other urban scene elements.

Key words: Semantic segmentation, Domain adaptation, PAC-Bayes theory, Variational inference, Deep neural network

CLC Number: 

  • TP181
[1]ZHAO H,SHI J,QI X,et al.Pyramid scene parsing network[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2017:2881-2890.
[2]YU F,KOLTUNV,FUNKHOUSER T.Dilated residual net-works[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2017:472-480.
[3]CHEN L C,PAPANDREOU G,KOKKINOS I,et al.Deeplab:Semantic image segmentation with deep convolutional nets,atrous convolution,and fully connected crfs[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2017,40(4):834-848.
[4]RICHTER S R,VINEET V,ROTH S,et al.Playing for data:Ground truth from computer games[C]//European Conference on ComputerVision.Cham:Springer,2016:102-118.
[5]HOFFMAN J,WANG D,YU F,et al.Fcns in the wild:Pixel-level adversarial and constraint-based adaptation[C]//CoRR.2016.
[6]ZHANG Y,DAVID P,GONG B.Curriculum domain adaptation for semantic segmentation of urban scenes[C]//Proceedings of the IEEE International Conference on Computer Vision.2017:2020-2030.
[7]WANG M,DENG W.Deep visual domain adaptation:A survey[J].Neurocomputing,2018,312:135-153.
[8]TSAI Y H,HUNG W C,SCHULTER S,et al.Learning toadapt structured output space for semantic segmentation[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2018:7472-7481.
[9]VU T H,JAIN H,BUCHER M,et al.Advent:Adversarial entropy minimization for domain adaptation in semantic segmentation[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.2019:2517-2526.
[10]BEN-DAVID S,BLITZER J,CRAMMER K,et al.Analysis of representations for domain adaptation[C]//Advances in Neural Information Processing Systems 19:Proceedings of the 2006 Conference.MIT Press,2006.
[11]MANSOUR Y,MOHRI M,ROSTAMIZADEH A.Domainadaptation:Learning bounds and algorithms[J].arXiv:0902.3430,2009.
[12]LIU H,LONG M,WANG J,et al.Transferable adversarialtraining:A general approach to adapting deep classifiers[C]//International Conference on Machine Learning.PMLR,2019:4013-4022.
[13]JIN C,NETRAPALLI P,JORDAN M.What is local optimality in nonconvex-nonconcave minimax optimization?[C]//International Conference on Machine Learning.PMLR,2020:4880-4889.
[14]CHU X.Feature Map Sharing towards High-dimensional Un-der-Labeled Data Analysis[D].Beijing:Peking University,2021.
[15]LONG J,SHELHAMER E,DARRELL T.Fully convolutional networks for semantic segmentation[C]//Proceedings of the IEEE Conferenceon Computer Vision and Pattern Recognition.2015:3431-3440.
[16]LONG M,CAO Y,WANG J,et al.Learning transferable fea-tures with deep adaptation networks[C]//International Confe-rence on Machine Learning.PMLR,2015:97-105.
[17]GANIN Y,USTINOVA E,AJAKAN H,et al.Domain-adversa-rial training of neural networks[J].TheJournal of Machine Learning Research,2016,17(1):2096-2030.
[18]JUDY H,ERIC T,TAESUNG P,et al.Cycada:Cycle-consistent adversarial domain adaptation[C]//Proceedings of the 35th International Conference on Machine Learning.2018.
[19]GONG R,LI W,CHEN Y,et al.Dlow:Domain flow for adaptation and generalization[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.2019:2477-2486.
[20]CHEN Y,LI W,VAN GOOL L.Road:Reality oriented adaptation for semantic segmentation of urban scenes[C]//Procee-dings of the IEEE Conference on Computer Vision and Pattern Recognition.2018:7892-7901.
[21]SUN B,SAENKO K.From Virtual to Reality:Fast Adaptation of Virtual Object Detectors to Real Domains[C]//BMVC.2014.
[22]VAZQUEZ D,LOPEZ A M,MARIN J,et al.Virtual and real world adaptation for pedestrian detection[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2013,36(4):797-809.
[23]GERMAIN P,HABRARD A,LAVIOLETTE F,et al.A PAC-Bayesian approach for domain adaptation with specialization to linear classifiers[C]//International Conference on Machine Learning.PMLR,2013:738-746.
[24]GERMAIN P,HABRARD A,LAVIOLETTE F,et al.PAC-Bayes and domain adaptation[J].Neurocomputing,2020,379:379-397.
[25]LIN T Y,GOYAL P,GIRSHICK R,et al.Focal lossfor dense object detection[C]//Proceedings of the IEEE International Conference on Computer Vision.2017:2980-2988.
[26]CHEN M,XUE H,CAI D.Domain adaptation for semantic segmentation with maximum squares loss[C]//Proceedings of the IEEE/CVF International Conference on Computer Vision.2019:2090-2099.
[27]CORDTS M,OMRAN M,RAMOS S,et al.The cityscapes dataset for semantic urban scene understanding[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2016:3213-3223.
[28]DENG J,DONG W,SOCHER R,et al.Imagenet:A large-scale hierarchical image database[C]//2009 IEEE Conference on Computer Vision and Pattern Recognition.IEEE,2009:248-255.
[29]HE K,ZHANG X,REN S,et al.Deep residual learning forimage recognition[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2016:770-778.
[30]LUO Y,LIU P,GUAN T,et al.Significance-aware information bottleneck for domain adaptive semantic segmentation[C]//Proceedings of the IEEE/CVF International Conference on Computer Vision.2019:6778-6787.
[1] CHENG Cheng, JIANG Ai-lian. Real-time Semantic Segmentation Method Based on Multi-path Feature Extraction [J]. Computer Science, 2022, 49(7): 120-126.
[2] WEI Hui, CHEN Ze-mao, ZHANG Li-qiang. Anomaly Detection Framework of System Call Trace Based on Sequence and Frequency Patterns [J]. Computer Science, 2022, 49(6): 350-355.
[3] HU Fu-yuan, WAN Xin-jun, SHEN Ming-fei, XU Jiang-lang, YAO Rui, TAO Zhong-ben. Survey Progress on Image Instance Segmentation Methods of Deep Convolutional Neural Network [J]. Computer Science, 2022, 49(5): 10-24.
[4] JIAO Xiang, WEI Xiang-lin, XUE Yu, WANG Chao, DUAN Qiang. Automatic Modulation Recognition Based on Deep Learning [J]. Computer Science, 2022, 49(5): 266-278.
[5] GAO Jie, LIU Sha, HUANG Ze-qiang, ZHENG Tian-yu, LIU Xin, QI Feng-bin. Deep Neural Network Operator Acceleration Library Optimization Based on Domestic Many-core Processor [J]. Computer Science, 2022, 49(5): 355-362.
[6] CHEN Kai, LIU Man, WANG Zhi-teng, MAO Shao-chen, SHEN Qiu-hui, ZHANG Hong-jun. Study on Data Filling Based on Global-attributes Attention Neural Process Model [J]. Computer Science, 2022, 49(10): 111-117.
[7] FAN Hong-jie, LI Xue-dong, YE Song-tao. Aided Disease Diagnosis Method for EMR Semantic Analysis [J]. Computer Science, 2022, 49(1): 153-158.
[8] NING Qiu-yi, SHI Xiao-jing, DUAN Xiang-yu, ZHANG Min. Unsupervised Domain Adaptation Based on Style Aware [J]. Computer Science, 2022, 49(1): 271-278.
[9] LIU Kai, ZHANG Hong-jun, CHEN Fei-qiong. Name Entity Recognition for Military Based on Domain Adaptive Embedding [J]. Computer Science, 2022, 49(1): 292-297.
[10] WANG Shi-yun, YANG Fan. Remote Sensing Image Semantic Segmentation Method Based on U-Net Feature Fusion Optimization Strategy [J]. Computer Science, 2021, 48(8): 162-168.
[11] ZHOU Xin, LIU Shuo-di, PAN Wei, CHEN Yuan-yuan. Vehicle Color Recognition in Natural Traffic Scene [J]. Computer Science, 2021, 48(6A): 15-20.
[12] WU Lan, WANG Han, LI Bin-quan. Unsupervised Domain Adaptive Method Based on Optimal Selection of Self-supervised Tasks [J]. Computer Science, 2021, 48(6A): 357-363.
[13] LIU Dong, WANG Ye-fei, LIN Jian-ping, MA Hai-chuan, YANG Run-yu. Advances in End-to-End Optimized Image Compression Technologies [J]. Computer Science, 2021, 48(3): 1-8.
[14] ZHAN Rui, LEI Yin-jie, CHEN Xun-min, YE Shu-han. Street Scene Change Detection Based on Multiple Difference Features Network [J]. Computer Science, 2021, 48(2): 142-147.
[15] MA Chuang, TIAN Qing, SUN He-yang, CAO Meng, MA Ting-huai. Unsupervised Domain Adaptation Based on Weighting Dual Biases [J]. Computer Science, 2021, 48(2): 217-223.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!