Computer Science ›› 2024, Vol. 51 ›› Issue (2): 117-134.doi: 10.11896/jsjkx.230400197
• Computer Graphics & Multimedia • Previous Articles Next Articles
CAI Jiacheng1, DONG Fangmin1, SUN Shuifa2, TANG Yongheng3
CLC Number:
[1]SAXENA A,SUN M,NG A.Make3D:learning 3D scene structure from a single still image[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2009,31(5):824-840. [2]GEIGER A,LENZ P,URTASUN R.Are we ready for autonomous driving?The KITTI vision benchmark suite[C]//Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition(CVPR).IEEE Computer Society,2012:3354-3361. [3]SNAVELY N,SEITZ S M,SZELISKI R.Photo tourism:Exploring photo collections in 3D[J].ACM Transactions on Graphics(TOG),2006,25(3):835-846. [4]ZHANG R,TSAI P S,CRYER J E,et al.Shape from Shading:A Survey[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,1999,21(8):690-706. [5]BARTOLI A,GERARD Y,CHADEBECQ F,et al.Shape-from-Template[J].IEEE Transactions Pattern Analysis and Machine Intelligence,2015,37(10):2099- 2118. [6]SAXENA A,CHUNG S H,NG A Y.Learning Depth from Single Monocular Images[C]//Proceedings of the 18th Interna-tional Conference on Neural Information Processing Systems(NIPS’05).MIT Press,2005:1161-1168. [7]SAXENA A,SCHULTE J,NG A Y.Depth estimation usingmonocular and stereo cues[C]//International Joint Conference on Artifical Intelligence.Morgan Kaufmann Publishers Inc,2007:2197-2203. [8]LIU B,GOULD S,KOLLER D.Single Image Depth Estimation From Predicted Semantic Labels[C]//The Twenty-Third IEEE Conference on Computer Vision and Pattern Recognition.IEEE,2010:1253-1260. [9]WANG Y,WANG R,DAI Q.A Parametric Model for Describing the Correlation Between Single Color Images and Depth Maps[J].IEEE Signal Processing Letters,2014,21(7):800-803. [10]KARSCH K,LIU C,KANG S B.Depth Extraction from Video Using Non-parametric Sampling[J].IEEE Transactions on Pattern Analysis & Machine Intelligence,2012,36(11):2144. [11]KONRAD J,MENG W,ISHWAR P.2D-to-3D image conver-sion by learning depth from examples[C]//Computer Vision & Pattern Recognition Workshops.IEEE,2012:16-22. [12]LIU T,MO Y,XU G,et al.Depth estimation of monocular video using non-parametric fusion of multiple cues[J].Journal of Southeast University(Natural Science Edition),2015,45(5):834-839. [13]LIU M,SALZMANN M,HE X.Discrete-Continuous Depth Estimation from a Single Image[C]//IEEE Conference on Computer Vision & Pattern Recognition.IEEE Computer Society,2014:716-723. [14]EIGEN D,PUHRSCH C,FERGUS R.Depth map predictionfrom a single image using a multi-scale deep network[J].Advances in Neural Information Processing Systems,2014,27:2366-2374. [15]EIGEN D,FERGUS R.Predicting depth,surface normals and semantic labels with a common multi-scale convolutional architecture[C]//Proceedings of the IEEE International Conference on Computer Vision.IEEE,2015:2650-2658. [16]LIU F,SHEN C,LIN G.Deep Convolutional Neural Fields for Depth Estimation from a Single Image[C]//2015 IEEE Confe-rence on Computer Vision and Pattern Recognition(CVPR).IEEE,2015:5162-5170. [17]LI B,SHEN C,DAI Y,et al.Depth and surface normal estimation from monocular images using regression on deep features and hierarchical CRFs[C]//Computer Vision & Pattern Recognition.IEEE Computer Society,2015:1119-1127. [18]WANG Q,ZHAO H,HU Z,et al.Discrete convolutional CRF networks for depth estimation from monocular infrared images[J].International Journal of Machine Learning and Cybernetics,2021,12:187-200. [19]YUAN W,GU X,DAI Z,et al.NeW CRFs:Neural Window Fully-connected CRFs for Monocular Depth Estimation[C]//2022 IEEE/ CVF Conference on Computer Vision and Pattern Recognition(CVPR).2022:3906-3915. [20]HE K,ZHANG X,REN S,et al.Deep Residual Learning forImage Recognition[J].Deep residual learning for image recognition[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2016:770- 778. [21]LAINA I,RUPPRECHT C,BELAGIANNIS V,et al.DeeperDepth Prediction with Fully Convolutional Residual Networks[C]//2016 Fourth International Conference on 3D Vision(3DV).IEEE,2016:239-248. [22]CHEN W,FU Z,YANG D,et al.Single-Image Depth Perception in the Wild[C]//In Proceedings of the 30th International Conference on Neural Information Processing Systems(NIPS’16).Curran Associates Inc,2016:730- 738. [23]DAI J,LI Y,HE K,et al.R-FCN:Object Detection via Region-based Fully Convolutional Networks[C]//Proceedings of the 30th International Conference on Neural Information Processing Systems(NIPS’16).Curran Associates Inc,2016:379-387. [24]CAO Y,WU Z,SHEN C.Estimating Depth From MonocularImages as Classification Using Deep Fully Convolutional Resi-dual Networks[J].IEEE Transactions on Circuits & Systems for Video Technology,2018,28(11):3174-3182. [25]FU H,GONG M,WANG C,et al.Deep ordinal regression network for monocular depth estimation[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2018:2002-2011. [26]KUZNIETSOV Y,STUCKLER,LEIBE B.Semi-SupervisedDeep Learning for Monocular Depth Map Prediction[C]//2017 IEEE Conference on Computer Vision and Pattern Recognition(CVPR).2017:2215-2223. [27]GUIZILINI V,LI J,AMBRUS R,et al.Robust Semi-Supervised Monocular Depth Estimation with Reprojected Distances[C]//Proceedings of the Conference on Robot Learning.PMLR,2019. [28]JI R,LI K,WANG Y,et al.Semi-supervised adversarial mono-cular depth estimation[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2019,42(10):2410-2422. [29]RAMIREZ P Z,POGGI M,TOSI F,et al.Geometry meets semantics for semi-supervised monocular depth estimation[C]//Proceedings of the Asian Conlerence on Computer Vision.2018:298-313. [30]GARG R,BG V K,CARNEIRO G,et al.Unsupervised cnn for single view depth estimation:Geometry to the rescue[C]//European Conference on Computer Vision.Lecture Notes in Computer Science,2016:740-756. [31]ZHOU T,BROWN M,SNAVELY N,et al.Unsupervised lear-ning of depth and ego-motion from video[C]//2017 IEEE Conference on Computer Vision and Pattern Recognition(CVPR).IEEE,2017:1851-1858. [32]GODARD C,AODHA O M,BROSTOW G J.Unsupervised Monocular Depth Estimation with Left-Right Consistency[C]//2017 IEEE Conference on Computer Vision and Pattern Recognition(CVPR).IEEE,2017:270-279. [33]POGGI M,TOSI F,MATTOCCIA S.Learning monocular depth estimation with unsupervised trinocular assumptions[C]//2018 International Conference on 3D Vision(3DV)IEEE,2018:324-333. [34]GODARD C,AODHA O M,FIRMAN M,et al.Digging IntoSelf-Supervised Monocular Depth Estimation[C]//2019 IEEE/CVF International Conference on Computer Vision(ICCV).IEEE,2019:3827-3837. [35]MAHJOURIAN R,WICKE M,ANGELOVA A,et al.Unsu-pervised Learning of Depth and Ego-Motion from Monocular Video Using 3D Geometric Constraints[C]//2017 IEEE Confe-rence on Computer Vision and Pattern Recognition(CVPR).IEEE,2018:6612-6619. [36]VIJAYANARASIMHAN S,RICCO S,SCHMID C,et al.SfM-Net:Learning of Structure and Motion from Video[J].arXiv:1704.07804,2017. [37]BIAN J W,LI Z,WANG N,et al.Unsupervised Scale-consistent Depth and Ego-motion Learning from Monocular Video[C]//Neural Information Processing Systems.2019. [38]WANG G,WANG H,LIU Y,et al.Unsupervised learning of monocular depth and ego-motion using multiple masks[C]//2019 International Conference on Robotics and Automation(ICRA).IEEE,2019:4724-4730. [39]SUN Q,TANG Y,ZHANG C,et al.Unsupervised estimation of monocular depth and VO in dynamic environments via hybrid masks[J].IEEE Transactions on Neural Networks and Learning Systems,2021,33(5):2023-2033. [40]WANG C,BUENAPOSADA J M,ZHU R,et al.Learning depth from monocular videos using direct methods[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2018:2022-2030. [41]ZHAN H,GARG R,WEERASEKERA C S,et al.Unsupervised learning of monocular depth estimation and visual odometry with deep feature reconstruction[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2018:340-349. [42]DAI R Y,FANG Z J,GAO Y B.Unsupervised Monocular Depth Estimation by Fusing Dilated Convolutional Network and SLAM[J].Laser & Optoelectronics Progress,2020,57(6):061007. [43]WANG R,PIZER S M,FRAHM J M.Recurrent neural network for (un-)supervised learning of monocular video visual odometry and depth[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.2019:5555-5564. [44]FENG T,GU D.SGANVO:Unsupervised deep visual odometry and depth estimation with stacked generative adversarial networks[J].IEEE Robotics and Automation Letters,2019,4(4):4431-4437. [45]ALMALIOGLU Y,SAPUTRA M R U,DE GUSMAO P P B,et al.Ganvo:Unsupervised deep monocular visual odometry and depth estimation with generative adversarial networks[C]//2019 International Conference on Robotics and Automation(ICRA).IEEE,2019:5474-5480. [46]YIN Z,SHI J.Geonet:Unsupervised learning of dense depth,optical flow and camera pose[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2018:1983-1992. [47]ZOU Y,LUO Z,HUANG J B.Df-net:Unsupervised joint lear-ning of depth and flow using cross-task consistency[C]//Proceedings of the European Conference on Computer Vision(ECCV).2018:36-53. [48]HUR J,ROTH S.Self-supervised monocular scene flow estimation[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.2020:7396-7405. [49]GUIZILINI V,AMBRUS R,PILLAI S,et al.3d packing forself-supervised monocular depth estimation[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.2020:2485-2494. [50]KLINGNER M,TERMÖHLEN J A,MIKOLAJCZYK J,et al.Self-supervised monocular depth estimation:Solving the dynamic object problem by semantic guidance[C]//Computer Vision-ECCV 2020:16th European Conference,Glasgow,UK,August 23-28,2020,Proceedings,Part XX 16.Springer International Publishing,2020:582-600. [51]CHEN Y,SCHMID C,SMINCHISESCU C.Self-supervisedlearning with geometric constraints in monocular video:Connecting flow,depth,and camera[C]//Proceedings of the IEEE /CVF International Conference on Computer Vision.2019:7063-7072. [52]WATSON J,MAC AODHA O,et al.The temporal opportunist:Self-supervised multi-frame monocular depth[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.2021:1164- 1174. [53]GUIZILINI V,AMBRUŞ R,CHEN D,et al.Multi-frame self-supervised depth with transformers[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.2022:160-170. [54]FENG Z,YANG L,JING L,et al.Disentangling Object Motion and Occlusion for Unsupervised Multi-frame Monocular Depth[C]//Computer Vision-ECCV 2022:17th European Conference,Tel Aviv,Israel,October 23-27,2022,Proceedings,Part XXXII.Cham:Springer Nature Switzerland,2022:228-244. [55]WANG X,ZHU Z,HUANG G,et al.Crafting Monocular Cues and Velocity Guidance for Self-Supervised Multi-Frame Depth Learning[J].arXiv:2208.09170,2022. [56]GOODFELLOW I,POUGET-ABADIE J,MIRZA M,et al.Ge-nerative adversarial networks[J].Communications of the ACM,2020,63(11):139-144. [57]JUNG H,KIM Y,MIN D,et al.Depth Prediction from a Single Image with Conditional Adversarial Networks[C]//2017 IEEE International Conference on Image Processing(ICIP).IEEE,2017:1717-1721. [58]LI S,XUE F,WANG X,et al.Sequential Adversarial Learning for Self-Supervised Deep Visual Odometry[C]//Proceedings of the IEEE/ CVF International Conference on Computer Vision.2019:2851-2860. [59]ALEOTTI F,TOSI F,POGGI M,et al.Generative Adversarial Networks for Unsupervised Monocular Depth Prediction[C]//Proceedings of the European Conference on Computer Vision(ECCV)Workshops.2018. [60]VANKADARI M,GARG S,MAJUMDER A,et al.Unsuper-vised Monocular Depth Estimation for Night-Time Images Using Adversarial Domain Feature Adaptation[C]//Computer Vision-ECCV 2020:16th European Conference,Glasgow,UK,August 23-28,2020,Proceedings,Part XXVIII 16.Springer International Publishing,2020:443-459. [61]LIU L,SONG X,WANG M,et al.Self-Supervised MonocularDepth Estimation for All Day Images Using Domain Separation[C]//Proceedings of the IEEE/CVF International Conference on Computer Vision.2021:12737-12746. [62]GUPTA H,MITRA K.Unsupervised Single Image Underwater Depth Estimation[C]//2019 IEEE International Conference on Image Processing(ICIP).IEEE,2019:624-628. [63]XU Y,WANG Y,HUANG R,et al.Unsupervised Learning of Depth Estimation and Camera Pose With Multi-Scale GANs[J].IEEE Transactions on Intelligent Transportation Systems,2022,23(10):17039-17047. [64]POGGI M,ALEOTTI F,TOSI F,et al.Towards real-time unsupervised monocular depth estimation on cpu[C]//2018 IEEE/RSJ International Conference on Intelligent Robots and Systems(IROS).IEEE,2018:5848-5854. [65]POGGI M,TOSI F,ALEOTTI F,et al.Real-Time Self-Supervised Monocular Depth Estimation Without GPU[J].IEEE Transactions on Intelligent Transportation Systems,2022,23(10):17342-17353. [66]CIPOLLETTA A,PELUSO V,CALIMERA A,et al.Energy-quality scalable monocular depth estimation on low-power cpus[J].IEEE Internet of Things Journal,2021,9(1):25-36. [67]LIU J,LI Q,CAO R,et al.MiniNet:An extremely lightweight convolutional neural network for real-time unsupervised mono-cular depth estimation[J].ISPRS Journal of Photogrammetry and Remote Sensing,2020,166:255-267. [68]ZHANG Z,LATHUILIERE S,RICCI E,et al.Online depthlearning against forgetting in monocular videos[C]//Procee-dings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.2020:4494-4503. [69]HUANG B,NGUYEN A,WANG S,et al.Simultaneous Depth Estimation and Surgical Tool Segmentation in Laparoscopic Images[J].IEEE Transactions on Medical Robotics and Bionics,2022,4(2):335-338. [70]XU C,HUANG B,ELSON D S.Self-Supervised MonocularDepth Estimation with 3-D Displacement Module for Laparoscopic Images[J].IEEE Transactions on Medical Robotics and Bionics,2022,4(2):331-334. [71]SHIMADA T,NISHIKAWA H,KONG X,et al.Depth Estimation from Monocular Infrared Images for Autonomous Flight of Drones[C]//2022 International Conference on Electronics,Information,and Communication(ICEIC).IEEE,2022:1-6. [72]ZHANG Y,YU Q,LOW K H,et al.A Self-Supervised Monocular Depth Estimation Approach Based on UAV Aerial Images[C]//2022 IEEE/AIAA 41st Digital Avionics Systems Confe-rence(DASC).IEEE,2022:1-8. [73]SHIMHADA T,NISHIKAWA H,KONG X,et al.MonocularDepth Estimation with Optical Flow Attention for Autonomous Drones[C]//2022 19th International SoC Design Conference(ISOCC).IEEE,2022:197-198. [74]ZHANG J H,ZHANG J,LANG X Q.Depth Estimation and Clarification Method for Monocular Images in Rainy Scenes[J/OL].(2022-10-18)[2023-10-30].http://kns.cnki.net/kcms/detail/21.1106.tp.20221018.0917.002.html. [75]QI C,LV H,MU G,et al.Vehicle Depth Estimation Based on Monocular Camera in Extreme Weather[C]//2022 6th CAA International Conference on Vehicular Control and Intelligence(CVCI).IEEE,2022:1-6. [76]LI K,MA J,LI H,et al.Discern Depth Under Foul Weather:Estimate PM2.5 for Depth Inference[J].IEEE Transactions on Industrial Informatics,2019,16(6):3918-3927. [77]SONG L,LI R,JIAO Y,et al.Seedling Plant Height Measurement Method Based on Resnext Monocular Depth Estimation[J].Journal of Agricultural Engineering,2022,38(3):155-163. [78]ZHOU Y C,DENG H B,XU T Y,et al.Unsupervised Tomato Plant Image Depth Estimation Model Based on Dense Autoencoder[J].Journal of Agricultural Engineering,2020,36(11):182-192. [79]HUANG J,JIANG Z,GUI W,et al.Depth Estimation from aSingle Image of Blast Furnace Burden Surface Based on Edge Defocus Tracking[J].IEEE Transactions on Circuits and Systems for Video Technology,2022,32(9):6044-6057. [80]FU J,WANG W,CHANG L.Visual Aided Training of Billiards Based on Depth Estimation[C]//2022 IEEE Conference on Tele-communications,Optics and Computer Science(TOCS).IEEE,2022:397-403. [81]KANEDA K,OOBA T,SHIMADA H,et al.Estimation Method of Calorie Intake by Deep Learning Using Depth Images Obtained through a Single Camera Smartphone[C]//2021 IEEE 10th Global Conference on Consumer Electronics(GCCE).IEEE,2021:874-875. [82]WANG P,HUANG X,CHENG X,et al.The ApolloScape Open Dataset for Autonomous Driving and Its Application[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2019,42(10):2702-2719. [83]CORDTS M,OMRAN M,Ramos S,et al.The Cityscapes Dataset for Semantic Urban Scene Understanding[C]//IEEE.2016:3213-3223. [84]SILBERMAN N,FERGUS R.Indoor scene segmentation using a structured light sensor[C]//2011 IEEE International Confe-rence on Computer Vision Workshops(ICCV Workshops).IEEE,2011:601-608. [85]MCCORMAC J,HANDA A,LEUTENEGGER S,et al.Scene-Net RGB-D:Can 5M Synthetic Images Beat Generic Image Net Pre-training on Indoor Segmentation?[C]//2017 IEEE International Conference on Computer Vision(ICCV).IEEE,2017:2678- 2687. [86]ROS G,SELLART L,MATERZYNSKA J,et al.The SYNTHIA Dataset:A Large Collection of Synthetic Images for Semantic Segmentation of Urban Scenes[C]//Computer Vision & Pattern Recognition.IEEE,2016:3234-3243. |
[1] | CHEN Runhuan, DAI Hua, ZHENG Guineng, LI Hui , YANG Geng. Urban Electricity Load Forecasting Method Based on Discrepancy Compensation and Short-termSampling Contrastive Loss [J]. Computer Science, 2024, 51(4): 158-164. |
[2] | LIN Binwei, YU Zhiyong, HUANG Fangwan, GUO Xianwei. Data Completion and Prediction of Street Parking Spaces Based on Transformer [J]. Computer Science, 2024, 51(4): 165-173. |
[3] | SONG Hao, MAO Kuanmin, ZHU Zhou. Algorithm of Stereo Matching Based on GAANET [J]. Computer Science, 2024, 51(4): 229-235. |
[4] |
XUE Jinqiang, WU Qin.
Progressive Multi-stage Image Denoising Algorithm Combining Convolutional Neural Network and Multi-layer Perceptron [J]. Computer Science, 2024, 51(4): 243-253. |
[5] | HUANG Kun, SUN Weiwei. Traffic Speed Forecasting Algorithm Based on Missing Data [J]. Computer Science, 2024, 51(3): 72-80. |
[6] | ZHENG Cheng, SHI Jingwei, WEI Suhua, CHENG Jiaming. Dual Feature Adaptive Fusion Network Based on Dependency Type Pruning for Aspect-basedSentiment Analysis [J]. Computer Science, 2024, 51(3): 205-213. |
[7] | CHEN Jinyin, LI Xiao, JIN Haibo, CHEN Ruoxi, ZHENG Haibin, LI Hu. CheatKD:Knowledge Distillation Backdoor Attack Method Based on Poisoned Neuronal Assimilation [J]. Computer Science, 2024, 51(3): 351-359. |
[8] | HUANG Wenke, TENG Fei, WANG Zidan, FENG Li. Image Segmentation Based on Deep Learning:A Survey [J]. Computer Science, 2024, 51(2): 107-116. |
[9] | ZHANG Feng, HUANG Shixin, HUA Qiang, DONG Chunru. Novel Image Classification Model Based on Depth-wise Convolution Neural Network andVisual Transformer [J]. Computer Science, 2024, 51(2): 196-204. |
[10] | WANG Yangmin, HU Chengyu, YAN Xuesong, ZENG Deze. Study on Deep Reinforcement Learning for Energy-aware Virtual Machine Scheduling [J]. Computer Science, 2024, 51(2): 293-299. |
[11] | HUANG Changxi, ZHAO Chengxin, JIANG Xiaoteng, LING Hefei, LIU Hui. Screen-shooting Resilient DCT Domain Watermarking Method Based on Deep Learning [J]. Computer Science, 2024, 51(2): 343-351. |
[12] | CUI Zhenyu, ZHOU Jiahuan, PENG Yuxin. Survey on Cross-modality Object Re-identification Research [J]. Computer Science, 2024, 51(1): 13-25. |
[13] | GE Huibin, WANG Dexin, ZHENG Tao, ZHANG Ting, XIONG Deyi. Study on Model Migration of Natural Language Processing for Domestic Deep Learning Platform [J]. Computer Science, 2024, 51(1): 50-59. |
[14] | JING Yeyiran, YU Zeng, SHI Yunxiao, LI Tianrui. Review of Unsupervised Domain Adaptive Person Re-identification Based on Pseudo-labels [J]. Computer Science, 2024, 51(1): 72-83. |
[15] | JIN Yu, CHEN Hongmei, LUO Chuan. Interest Capturing Recommendation Based on Knowledge Graph [J]. Computer Science, 2024, 51(1): 133-142. |
|