计算机科学 ›› 2021, Vol. 48 ›› Issue (6A): 107-112.doi: 10.11896/jsjkx.201100116

• 图像处理&多媒体技术 • 上一篇    下一篇

基于眼动点视觉先验与边缘优化的显著性检测

刘翔宇1, 蹇木伟1, 鲁祥伟1, 何为凯2, 李晓峰3, 尹义龙3   

  1. 1 山东财经大学计算机科学与技术学院 济南250014
    2 山东交通学院航空学院 济南250357
    3 山东建筑大学计算机科学与技术学院 济南250000
  • 出版日期:2021-06-10 发布日期:2021-06-17
  • 通讯作者: 蹇木伟(jianmuweihk@163.com)
  • 作者简介:997865855@qq.com
  • 基金资助:
    国家自然科学基金(61976123);山东省泰山青年学者计划;山东省基础研究重点发展计划(ZR2020ZD44)

Saliency Detection Based on Eye Fixation Prediction and Boundary Optimization

LIU Xiang-yu1, JIAN Mu-wei1, LU Xiang-wei1, HE Wei-kai2, LI Xiao-feng3, YIN Yi-long3   

  1. 1 School of Computer Science and Technology,Shandong University of Finance and Economics,Jinan 250014,China
    2 School of Aeronautics,Shandong Jiaotong University,Jinan 250357,China
    3 School of Computer Science and Technology,Shandong Jianzhu University,Jinan 250000,China
  • Online:2021-06-10 Published:2021-06-17
  • About author:LIU Xiang-yu,master.His main researchinterests include image processing and visual saliency detection.
    JIAN Mu-wei,professor,doctoral supervisor,CCF computer vision committee,CCF multimedia committee,CCF machine learning and pattern recognition communications committee,etc.His main research interests include image processing,pattern recognition,multimedia computing,and so on.
  • Supported by:
    National Natural Science Foundation of China(61976123),Taishan Young Scholars Program of Shandong Province and Key Development Program for Basic Research of Shandong Province(ZR2020ZD44).

摘要: 图像显著性检测是计算机视觉中的基础研究课题之一。当前基于深度学习的方法虽然能够有效提高显著性检测结果的准确性,但是在显著性目标的物体边缘细节提取方面还不能令人满意。为此,提出了一种基于眼动点预测先验的边缘细化网络用于显著性目标提取。首先,对输入图像进行眼动点预测,将生成的特征图像作为后续显著性检测的视觉先验;其次,利用多注意力机制VGG16网络进行显著性目标特征提取;最后,对特征图像进行质量优化处理,进一步提升图像显著图的质量。实验结果表明,在3个公开数据集(DUTS,ECSSD,HKU-IS)上,所提方法与其他6个主流方法相比,取得了更好的显著性检测效果。

关键词: 边缘优化, 显著性检测, 眼动点预测, 注意力机制

Abstract: Saliency detection is one of the most fundamental challenges in computer vision.Although the rapid development of deep learning has greatly improved the accuracy of saliency-detection results,the extraction of details of salient object is still unsatisfactory.Therefore,this paper proposes an edge refinement network based on eye-fixation prediction priori for salient object detection.Firstly,eye-fixation extraction is carried out on the original image and the extracted feature image is used as the visual priori of subsequent saliency detection.Secondly,the multi-attention mechanism of VGG16 network is used for feature extraction,and finally the feature image is refined to improve the quality of the saliency image.Experimental results show that,compared with other 6 state-of-the-art methods,the proposed method achieves better results in 3 open-accessed data sets(i.e.DUTS,ECSSD,HKU-IS).

Key words: Attention mechanism, Edge refinement, Eye-fixation prediction, Saliency detection

中图分类号: 

  • TP391
[1] LAURENT I,KOCH C,NIEBUR E.A model of saliency-based visual attention for rapid scene analysis[J].IEEE Transactions on Pattern Analysis & Machine Intelligence,2002,20(11):1254-1259.
[2] LE MEUR O,LE CALLET P,BARBA D,et al.A coherentcomputational approach to model bottom-up visual attention[J].IEEE TPAMI,2006,28(5):802-817.
[3] MATHE S,SMINCHISESCU C.Dynamic eye movement datasets and learnt saliency models for visual action recognition[C]//ECCV.2012.
[4] MATHE S,SMINCHISESCU C.Action from still image dataset and inverse optimal control to learn task specific visual scanpaths[C]//NIPS.2013.
[5] MATHE S,SMINCHISESCU C.Actions in the eye:Dynamic gaze datasets and learnt saliency models for visual recognition[J].IEEE TPAMI,2015,37(7):1408-1424.
[6] ZHANG L,TONG M H,MARKS T K,et al.SUN:A bayesian framework for saliency using natural statistics[J].Journal of Vision,2008,8(7):32-32.
[7] GAO D,VASCONCELOS N.Discriminant saliency for visualrecognition from cluttered scenes[C]//NIPS.2005.
[8] BRUCE N,TSOTSOS J.Saliency based on information maximization[C]//NIPS.2006.
[9] HAREL J,KOCH C,PERONA P.Graph-based visual saliency[C]//NIPS.2007.
[10] HOU X,ZHANG L.Saliency detection:A spectral residual approach[C]//CVPR.2007.
[11] JUDD T,EHINGER K,DURAND F,et al.Learning to predict where humans look[C]//ICCV.2009.
[12] KRUTHIVENTI S,GUDISA V,DHOLAKIYA J,et al.Saliency Unified:A Deep Architecture for simultaneous Eye Fixation Prediction and Salient Object Segmentation[C]//Computer Vision & Pattern Recognition.IEEE,2016.
[13] VIG E,DORR M,COX D.Large-scale optimization of hierarchical features for saliency prediction in natural images[C]//CVPR.2014.
[14] ZHOU Z,ZHANG L.AFSnet:Fixation Prediction in MovieScenes with Auxiliary Facial Saliency[C]//International Conference on Brain Inspired Cognitive Systems.Cham:Springer,2018.
[15] VOLODYMYR M,HEESS N,GRAVES A.Recurrent models of visual attention[C]//Advances in Neural Information Processing Systems.2014.
[16] ZHANG X.Progressive Attention Guided Recurrent Networkfor Salient Object Detection[C]//IEEE/CVF Conference on Computer Vision and Pattern Recognition.IEEE,2018.
[17] LIU N,HAN J,YANG M H.PiCANet:Learning Pixel-WiseContextual Attention for Saliency Detection[C]//IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).IEEE,2018.
[18] WANG W,SHEN J.Deep Visual Attention Prediction[J].IEEE Transactions onImage Processing,2017(99):1-1.
[19] CORNIA M.Predicting Human Eye Fixations via an LSTM-based Saliency Attentive Model[J].IEEE Transactions on Image Processing,2016(27):5142-5154.
[20] DANDAN Z.MAFL:Multi-scale Adversarial Feature Learning for Saliency Detection[C]//2018 International Conference.2018.
[21] LINA W.Deep Group-Wise Fully Convolutional Network forCo-Saliency Detection With Graph Propagation[J].IEEE Transactions on Image Processing,2019(99).
[22] HECHT-NIELSEN R.Theory of the backpropagation neuralnetwork[J].Neural Networks,1988,1(Supplement-1):445-448.
[23] ZAKARIA L,KANNALA J.Context Aware Query Image Representation for Particular Object Retrieval[C]//Scandinavian Conference on Image Analsysi.Cham:Springer,2017:88-99.
[24] HU J,SHEN L,SUN G.Squeeze-and-Excitation Networks[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2018:7132-7141.
[25] WANG L,LU H,WANG Y,et al.Learning to detect salient objects with image-level supervision[C]//Proc.IEEE Conf.Comput.Vis.Pattern Recognit.(CVPR).2017:136-145.
[26] YAN Q,XU L,SHI J,et al.Hierarchical saliency detection[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2013:1155-1162.
[27] LI G,YU Y.Visual saliency based on multiscale deep features[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2015:5455-5463.
[28] ZHANG P,WANG D,LU H,et al.Amulet:Aggregating multi-level convolutional features for salient object detection[C]//IEEE International Conference on Computer Vision(ICCV 2017).Venice,Italy,2017:202-211.
[29] LI G,YU Y.Visual saliency detection based on multiscale deep cnn features[J].IEEE Transactions on Image Processing,2016,25(11):5012-5024.
[30] LUO Z,MISHRA A,ACHKAR A,et al.Non-local deep features for salient object detection[C]//2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).IEEE,2017:6593-6601.
[31] ZHANG P,WANG D,LU H,et al.Learning uncertain convolu-tional features for accurate saliency detection[C]//IEEE Inter-national Conference on Computer Vision (ICCV 2017).Venice,
Italy,2017:212-221.
[32] LI G,YU Y.Deep contrast learning for salient object detection[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2016:478-487.
[33] LIU N,HAN J.Dhsnet:Deep hierarchical saliency network for salient object detection[C]//Proceedings of the IEEE Confe-rence on Computer Vision and Pattern Recognition.2016:678-686.
[34] SIMONYAN K,ZISSERMAN A.Very deep convolutionalnetworks for large-scale image recognition[J].arXiv:1409.1556,2014.
[35] RONNEBERGER O,FISCHER P,BROX T.Unet Convolutional networks for biomedical image segmentation[C]//International Conference on Medical Image Computingand Computer-assisted Intervention.Springer,2015:234-241.
[36] ZHAO T,WU X.Pyramid feature attention network for saliency detection[C]//IEEE Conf.Comput.Vis.Pattern Recog..2019:3085-3094.
[37] QIN X.BASNet:Boundary-Aware Salient Object Detection[C]//2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).IEEE,2019.
[38] WANG W.Salient Object Detection Driven by Fixation Prediction[C]//IEEE CVPR.IEEE,2018.
[1] 饶志双, 贾真, 张凡, 李天瑞.
基于Key-Value关联记忆网络的知识图谱问答方法
Key-Value Relational Memory Networks for Question Answering over Knowledge Graph
计算机科学, 2022, 49(9): 202-207. https://doi.org/10.11896/jsjkx.220300277
[2] 周芳泉, 成卫青.
基于全局增强图神经网络的序列推荐
Sequence Recommendation Based on Global Enhanced Graph Neural Network
计算机科学, 2022, 49(9): 55-63. https://doi.org/10.11896/jsjkx.210700085
[3] 戴禹, 许林峰.
基于文本行匹配的跨图文本阅读方法
Cross-image Text Reading Method Based on Text Line Matching
计算机科学, 2022, 49(9): 139-145. https://doi.org/10.11896/jsjkx.220600032
[4] 周乐员, 张剑华, 袁甜甜, 陈胜勇.
多层注意力机制融合的序列到序列中国连续手语识别和翻译
Sequence-to-Sequence Chinese Continuous Sign Language Recognition and Translation with Multi- layer Attention Mechanism Fusion
计算机科学, 2022, 49(9): 155-161. https://doi.org/10.11896/jsjkx.210800026
[5] 熊丽琴, 曹雷, 赖俊, 陈希亮.
基于值分解的多智能体深度强化学习综述
Overview of Multi-agent Deep Reinforcement Learning Based on Value Factorization
计算机科学, 2022, 49(9): 172-182. https://doi.org/10.11896/jsjkx.210800112
[6] 姜梦函, 李邵梅, 郑洪浩, 张建朋.
基于改进位置编码的谣言检测模型
Rumor Detection Model Based on Improved Position Embedding
计算机科学, 2022, 49(8): 330-335. https://doi.org/10.11896/jsjkx.210600046
[7] 朱承璋, 黄嘉儿, 肖亚龙, 王晗, 邹北骥.
基于注意力机制的医学影像深度哈希检索算法
Deep Hash Retrieval Algorithm for Medical Images Based on Attention Mechanism
计算机科学, 2022, 49(8): 113-119. https://doi.org/10.11896/jsjkx.210700153
[8] 孙奇, 吉根林, 张杰.
基于非局部注意力生成对抗网络的视频异常事件检测方法
Non-local Attention Based Generative Adversarial Network for Video Abnormal Event Detection
计算机科学, 2022, 49(8): 172-177. https://doi.org/10.11896/jsjkx.210600061
[9] 闫佳丹, 贾彩燕.
基于双图神经网络信息融合的文本分类方法
Text Classification Method Based on Information Fusion of Dual-graph Neural Network
计算机科学, 2022, 49(8): 230-236. https://doi.org/10.11896/jsjkx.210600042
[10] 汪鸣, 彭舰, 黄飞虎.
基于多时间尺度时空图网络的交通流量预测模型
Multi-time Scale Spatial-Temporal Graph Neural Network for Traffic Flow Prediction
计算机科学, 2022, 49(8): 40-48. https://doi.org/10.11896/jsjkx.220100188
[11] 金方焱, 王秀利.
融合RACNN和BiLSTM的金融领域事件隐式因果关系抽取
Implicit Causality Extraction of Financial Events Integrating RACNN and BiLSTM
计算机科学, 2022, 49(7): 179-186. https://doi.org/10.11896/jsjkx.210500190
[12] 熊罗庚, 郑尚, 邹海涛, 于化龙, 高尚.
融合双向门控循环单元和注意力机制的软件自承认技术债识别方法
Software Self-admitted Technical Debt Identification with Bidirectional Gate Recurrent Unit and Attention Mechanism
计算机科学, 2022, 49(7): 212-219. https://doi.org/10.11896/jsjkx.210500075
[13] 彭双, 伍江江, 陈浩, 杜春, 李军.
基于注意力神经网络的对地观测卫星星上自主任务规划方法
Satellite Onboard Observation Task Planning Based on Attention Neural Network
计算机科学, 2022, 49(7): 242-247. https://doi.org/10.11896/jsjkx.210500093
[14] 张颖涛, 张杰, 张睿, 张文强.
全局信息引导的真实图像风格迁移
Photorealistic Style Transfer Guided by Global Information
计算机科学, 2022, 49(7): 100-105. https://doi.org/10.11896/jsjkx.210600036
[15] 曾志贤, 曹建军, 翁年凤, 蒋国权, 徐滨.
基于注意力机制的细粒度语义关联视频-文本跨模态实体分辨
Fine-grained Semantic Association Video-Text Cross-modal Entity Resolution Based on Attention Mechanism
计算机科学, 2022, 49(7): 106-112. https://doi.org/10.11896/jsjkx.210500224
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!