计算机科学 ›› 2025, Vol. 52 ›› Issue (12): 141-149.doi: 10.11896/jsjkx.250400075
曹明伟, 黄宝龙, 赵海峰
CAO Mingwei, HUANG Baolong, ZHAO Haifeng
摘要: 神经辐射场(Neural Radiance Fields,NeRFs)因其高效的场景建模和表达能力,已经成为视图合成和渲染领域的重要基础方法。然而,在动态环境中,NeRF在应对复杂光照变化和瞬态对象干扰方面仍存在挑战。由于光照条件发生变化,因此场景中存在大量不一致外观,进而影响视图合成质量。同时,场景中的动态干扰影响了合成图像的真实感。针对上述问题,提出了一种基于外观增强和语义分割的神经辐射场(Appearance Enhancement and Semantic Segmentation-based Neural Radiance Fields,AS-NeRF)。该方法通过结合锥形体采样与集成位置编码机制,提高外观特征的融合效率,增强模型对光照和相机参数变化的适应能力,从而提升渲染结果的色彩一致性和真实感。此外,采用端到端的轻量级分割网络预测瞬态可视性掩模,有效分离动态对象,避免瞬态元素对合成图像质量的影响。为了验证该方法的有效性,在Photo Tourism数据集上进行了实验,并与多种现有方法进行定性与定量对比分析,实验结果表明所提出的方法在合成精度上优于现有经典方法,并进一步验证了分割掩模在瞬态物体分离中的准确性。
中图分类号:
| [1]MILDENHALL B,SRINIVASAN P P,TANCIK M,et al.NeRF:Representing Scenes as Neural Radiance Fields for View Synthesis[J].Communications of the ACM,2021,65(1):99-106. [2]MARTIN-BRUALLA R,RADWAN N,SAJJADI M S M,et al.NeRF in the Wild:Neural Radiance Fields for Unconstrained Photo Collections[C]//Proceedings of the IEEE/CVF Confe-rence on Computer Vision and Pattern Recognition.2021:7210-7219. [3]CHEN X,ZHANG Q,LI X,et al.Hallucinated Neural Radiance Fields in the Wild[C]//Proceedings of the IEEE/CVF Confe-rence on Computer Vision and Pattern Recognition.2022:12943-12952. [4]WANG S,XU H,LI Y,et al.IE-NeRF:Exploring TransientMask Inpainting to Enhance Neural Radiance Fields in the Wild[J].Neurocomputing,2025,618:129112. [5]LEE J,KIM I,HEO H,et al.Semantic-Aware Occlusion Filtering Neural Radiance Fields in the Wild[J].arXiv:2303.03966,2023. [6]SNAVELY N,SEITZ S,SZELISKI R.Photo Tourism:Explo-ring Photo Collections in 3D[J].ACM Transactions on Grap-hics,2006,25(3):835-846. [7]CHAURASIA G,DUCHENE S,SORKINE-HORNUNG O,et al.Depth Synthesis and Local Warps for Plausible Image-Based Navigation[J].ACM Transactions on Graphics,2013,32(3):1-12. [8]FITZGIBBON A,WEXLER Y,ZISSERMAN A.Image-BasedRendering Using Image-Based Priors[J].International Journal of Computer Vision,2005,63:141-151. [9]LEVOY M,HANRAHAN P.Light Field Rendering [C]//Proceedings of the 23rd Annual Conference on Computer Graphics and Interactive Techniques.1996:31-42. [10]CHENG C M,LIN S J,LAI S H,et al.Improved Novel View Synthesis from Depth Image with Large Baseline[C]//Procee-dings of the 19th International Conference on Pattern Recognition.IEEE,2008:1-4. [11]DEBEVEC P E,TAYLOR C J,MALIK J.Modeling and Rendering Architecture from Photographs:A Hybrid Geometry-and Image-Based Approach [C]//Proceedings of the 23rd Annual Conference on Computer Graphics and Interactive Techniques.1996:11-20. [12]BUEHLER C,BOSSE M,MCMILLAN L,et al.Unstructured Lumigraph Rendering[C]//Proceedings of the 28th Annual Conference on Computer Graphics and Interactive Techniques.2001:425-432. [13]RIEGLER G,KOLTUN V.Free View Synthesis[C]//Procee-dings of the 16th European Conference Computer Vision.2020:623-640. [14]FLYNN J,NEULANDER I,PHILBIN J,et al.Deepstereo:Learning to Predict New Views from The World’s Imagery[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2016:5515-5524. [15]GROSSMAN J P,DALLY W J.Point Sample Rendering[C]//Rendering Techniques’ 98:Proceedings of the Eurographics Workshop in Vienna.1998:181-192. [16]ZWICKER M,PFISTER H,VAN BAAR J,et al.Surface Splatting[C]//Proceedings of the 28th Annual Conference on Computer Graphics and Interactive Techniques.2001:371-378. [17]DEBEVEC P,YU Y,BORSHUKOV G.Efficient View-Depen-dent Image-Based Rendering with Projective Texture-Mapping[C]//Rendering Techniques’ 98:Proceedings of the Eurogra-phics Workshop in Vienna.1998:105-116. [18] SHAN Q,ADAMS R,CURLESS B,et al.The Visual TuringTest for Scene Reconstruction[C]//Proceedings of the IEEE International Conference on 3D Vision.2013:25-32. [19]KUTULAKOS K N,SEITZ S M.A Theory of Shape by Space Carving[J].International Journal of Computer Vision,2000,38:199-218. [20]PENNER E,ZHANG L.Soft 3d Reconstruction for View Synthesis[J].ACM Transactions on Graphics,2017,36(6):1-11. [21]AKENINE-MOLLER T,HAINES E,HOFFMAN N.Real-Time Rendering[M].AK Peters/crc Press,2019. [22]COOK R L,PORTER T,CARPENTER L.Distributed RayTracing[C]//Proceedings of the 11th Annual Conference on Computer Graphics and Interactive Techniques.1984:137-145. [23]PURCELL T J,BUCK I,MARK W R,et al.Ray Tracing on Programmable Graphics Hardware[J].ACM Transactions on Graphics,2002,21(3):703-712. [24]TATARCHENKO M,DOSOVITSKIY A,BROX T.Multi-View 3d Models from Single Images With a Convolutional Network[C]//Proceedings of the 14th European Conference on Computer Vision.2016:322-337. [25]HEDMAN P,PHILIP J,PRICE T,et al.Deep Blending forFree-Viewpoint Image-Based Rendering[J].ACM Transactions on Graphics,2018,37(6):1-15. [26]KO J,CHO K,CHOI D,et al.3d Gan Inversion with Pose Optimization[C]//Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision.2023:2967-2976. [27]LIU L,GU J,LIN K Z,et al.Neural Sparse Voxel Fields[C]//Proceedings of the 34th International Conference on Neural Information Processing Systems.2020:15651-15663. [28]MÜLLER T,EVANS A,SCHIED C,et al.Instant NeuralGraphics Primitives with A Multiresolution Hash Encoding[J].ACM Transactions on Graphics,2022,41(4):1-15. [29]FRIDOVICH-KEIL S,YU A,TANCIK M,et al.Plenoxels:Radiance Fields without Neural Networks[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.2022:5501-5510. [30]SUCAR E,LIU S,ORTIZ J,et al.IMAP:Implicit Mapping and Positioning in Real-Time[C]//Proceedings of the IEEE/CVF International Conference on Computer Vision.2021:6229-6238. [31]SUN C,SUN M,CHEN H T.Direct Voxel Grid Optimization:Super-Fast Convergence for Radiance Fields Reconstruction[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.2022:5459-5469. [32]CHIBANE J,BANSAL A,LAZOVA V,et al.Stereo Radiance Fields(Srf):Learning View Synthesis for Sparse Views of Novel Scenes[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.2021:7911-7920. [33]CHEN A,XU Z,ZHAO F,et al.MVSNeRF:Fast Generalizable Radiance Field Reconstruction from Multi-View Stereo[C]//Proceedings of the IEEE/CVF International Conference on Computer Vision.2021:14124-14133. [34]YU A,YE V,TANCIK M,et al.PixelNeRF:Neural Radiance Fields from One or Few Images[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.2021:4578-4587. [35]KIM M,SEO S,HAN B.InfoNeRF:Ray Entropy Minimization for Few-Shot Neural Volume Rendering[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.2022:12912-12921. [36]WANG Z,BAGAUTDINOV T,LOMBARDI S,et al.Learning Compositional Radiance Fields of Dynamic Human Heads[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.2021:5704-5713. [37]WANG X B,ZHANG H,GAO H,et al.Talking Portrait Synthesis Method Based on Regional Saliency and Spatial Feature Extraction [J].Computer Science,2025,52(3):58-67. [38]PARK J J,FLORENCE P,STRAUB J,et al.DeepSDF:Learning Continuous Signed Distance Functions for Shape Representation[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.2019:165-174. [39]MESCHEDER L,OECHSLE M,NIEMEYER M,et al.Occu-pancy Networks:Learning 3d Reconstruction in Function Space[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.2019:4460-4470. [40]SAITO S,HUANG Z,NATSUME R,et al.Pifu:Pixel-Aligned Implicit Function for High-Resolution Clothed Human Digitization[C]//Proceedings of the IEEE/CVF International Confe-rence on Computer Vision.2019:2304-2314. [41]SITZMANN V,MARTEL J,BERGMAN A,et al.Implicit Neural Representations with Periodic Activation Functions[C]//Proceedings of the 33th International Conference on Neural Information Processing Systems(NeurlPS).2020:7462-7473. [42]LOMBARDI S,SIMON T,SARAGIH J M,et al.Neural Volumes[J].ACM Transactions on Graphics,2019,38:1-14. [43]BARRON J T,MILDENHALL B,VERBIN D,et al.Mip-NeRF 360:Unbounded Anti-Aliased Neural Radiance Fields[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.2022:5470-5479. [44]BARRON J T,MILDENHALL B,TANCIK M,et al.Mip-Nerf:A Multiscale Representation for Anti-Aliasing Neural Radiance Fields[C]//Proceedings of the IEEE/CVF International Confe-rence on Computer Vision.2021:5855-5864. [45]WU T,TANG S,ZHANG R,et al.CGNet:A Light-WeightContext Guided Network for Semantic Segmentation[J].IEEE Transactions on Image Processing,2020,30:1169-1179. [46]MAO Q,LEE H Y,TSENG H Y,et al.Mode Seeking Generative Adversarial Networks for Diverse Image Synthesis[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.2019:1429-1437. |
|
||