计算机科学 ›› 2019, Vol. 46 ›› Issue (8): 332-336.doi: 10.11896/j.issn.1002-137X.2019.08.055
于洋, 李世杰, 陈亮, 刘韵婷
YU Yang, LI Shi-jie, CHEN Liang, LIU Yun-ting
摘要: 针对船舶图像目标检测中存在的小目标检测准确率低、系统鲁棒性差的问题,提出一种改进的YOLO v2算法对船舶图像目标进行检测。通过目标框维度聚类、网络结构改进、输入图像多尺度变换等方法对传统YOLO v2算法进行改进,使其能够更好地适应船舶目标检测任务。测试结果表明,在输入图像尺寸为416×416时,该算法的平均精确率(mean Average Precision,mAP)达到79.1%,检测速度为64帧/s(Frames Per Second,FPS)。所提方法可满足实时检测的需要,且具有小目标检测精度高、鲁棒性强的特点。
中图分类号:
[1]ZHANG Y.Image Recognition of Ship Target Based on Cloud Computing [D].Guangzhou:South China University of Technology,2016.(in Chinese) 张羽.基于云计算的舰船目标图像识别[D].广州:华南理工大学,2016. [2]ZHAO Y.A brief talk on the methods and techniques of ship target recognition[J].Ship Science and Technology,2016,38(1A):163-165.(in Chinese) 赵友.浅谈舰船目标识别的方法和技术[J].舰船科学技术,2016,38(1A):163-165. [3]PIDGEON V W.Frequency Dependence of Radar Ducting[J].Radio Science,2016,5(3):541-549. [4]SHEN G N.Research on ship target recognition technology [D].Harbin:Harbin Engineering University,2012.(in Chinese) 沈广楠.舰船目标识别技术研究[D].哈尔滨:哈尔滨工程大学,2012. [5]ZHANG S,YAN Y Y,LI Y F.Moving target detection based on Video Fusion of thermal infrared and visible light[J].Computer Science,2015,42(8):86-89.(in Chinese) 张笙,严云洋,李郁峰.热红外与可见光视频融合的运动目标检测[J].计算机科学,2015,42(8):86-89. [6]ZHANG X,DONG G,XIONG B,et al.Refined segmentation of ship target in SAR images based on GVF snake with elliptical constraint[J].Remote Sensing Letters,2017,8(8):791-800. [7]YU J Y,HUANG D,WANG L Y,et al.A real-time on-board ship targets detection method for optical remote sensing satellite[C]∥International Conference on Signal Processing.New York:IEEE Press,2017:204-208. [8]ZHU R.Application of Computer Identification and Location Algorithm in Small far Infrared Target Recognition of Ship Under Surge Interference[J].Polish Maritime Research,2017,24(S1):171-181. [9]LECUN Y,BENGIO Y,HINTON G.Deep learning[J].Nature,2015,521(7553):436. [10]GIRSHICK R.Fast R-CNN[C]∥IEEE International Conference on Computer Vision.New York:IEEE Computer Society,2015:1440-1448. [11]REN S,HE K,GIRSHICK R,et al.Faster R-CNN:towards real-time object detection with region proposal networks[C]∥International Conference on Neural Information Processing Systems.Massachusetts:MIT Press,2015:91-99. [12]REDMON J,DIVVALA S,GIRSHICK R,et al.You Only Look Once:Unified,Real-Time Object Detection[C]∥Computer Vision and Pattern Recognition.NewYork:IEEE Press,2016:779-788. [13]LIU W,ANGUELOV D,ERHAN D,et al.SSD:Single Shot MultiBox Detector[C]∥European Conference on Computer Vision.Springer:Cham,2016:21-37. [14]REDMON J,FARHADI A.YOLO9000:Better,Faster,Stronger[C]∥IEEE Conference on Computer Vision and Pattern Recognition.NewYork:IEEE Computer Society,2017:6517-6525. [15]SIMONYAN K,ZISSERMAN A.Very Deep Convolutional Networks for Large-Scale Image Recognition[J].Computer Scien-ce,2014.https://arxiv.org/abs/1409.1556. [16]SZEGEDY C,LIU W,JIA Y,et al.Going deeper with convolutions[J/OL].https://arxiv.org/abs/1409.4842. [17]LIN M,CHEN Q,YAN S.Network In Network[J/OL].Computer Science.https://arxiv.org/abs/1312.4400. [18]KRIZHEVSKY A,SUTSKEVER I,HINTON G E.ImageNet classification with deep convolutional neural networks[C]∥International Conference on Neural Information Processing Systems.NewYork:ACM,2012:1097-1105. [19]ZHOU W B,SHI Y X.Optimization algorithm of K-means clustering center of selection based on density[J].Application Research of Computers,2012,29(5):1726-1728. [20]HE K,ZHANG X,REN S,et al.Deep residual learning for ima- ge recognition[M]∥Computer Vision and Pattern Recognition.New York:IEEE,2016:770-778. |
[1] | 周乐员, 张剑华, 袁甜甜, 陈胜勇. 多层注意力机制融合的序列到序列中国连续手语识别和翻译 Sequence-to-Sequence Chinese Continuous Sign Language Recognition and Translation with Multi- layer Attention Mechanism Fusion 计算机科学, 2022, 49(9): 155-161. https://doi.org/10.11896/jsjkx.210800026 |
[2] | 李宗民, 张玉鹏, 刘玉杰, 李华. 基于可变形图卷积的点云表征学习 Deformable Graph Convolutional Networks Based Point Cloud Representation Learning 计算机科学, 2022, 49(8): 273-278. https://doi.org/10.11896/jsjkx.210900023 |
[3] | 陈泳全, 姜瑛. 基于卷积神经网络的APP用户行为分析方法 Analysis Method of APP User Behavior Based on Convolutional Neural Network 计算机科学, 2022, 49(8): 78-85. https://doi.org/10.11896/jsjkx.210700121 |
[4] | 朱承璋, 黄嘉儿, 肖亚龙, 王晗, 邹北骥. 基于注意力机制的医学影像深度哈希检索算法 Deep Hash Retrieval Algorithm for Medical Images Based on Attention Mechanism 计算机科学, 2022, 49(8): 113-119. https://doi.org/10.11896/jsjkx.210700153 |
[5] | 刘冬梅, 徐洋, 吴泽彬, 刘倩, 宋斌, 韦志辉. 基于边框距离度量的增量目标检测方法 Incremental Object Detection Method Based on Border Distance Measurement 计算机科学, 2022, 49(8): 136-142. https://doi.org/10.11896/jsjkx.220100132 |
[6] | 王灿, 刘永坚, 解庆, 马艳春. 基于软标签和样本权重优化的Anchor Free目标检测算法 Anchor Free Object Detection Algorithm Based on Soft Label and Sample Weight Optimization 计算机科学, 2022, 49(8): 157-164. https://doi.org/10.11896/jsjkx.210600240 |
[7] | 檀莹莹, 王俊丽, 张超波. 基于图卷积神经网络的文本分类方法研究综述 Review of Text Classification Methods Based on Graph Convolutional Network 计算机科学, 2022, 49(8): 205-216. https://doi.org/10.11896/jsjkx.210800064 |
[8] | 金方焱, 王秀利. 融合RACNN和BiLSTM的金融领域事件隐式因果关系抽取 Implicit Causality Extraction of Financial Events Integrating RACNN and BiLSTM 计算机科学, 2022, 49(7): 179-186. https://doi.org/10.11896/jsjkx.210500190 |
[9] | 张颖涛, 张杰, 张睿, 张文强. 全局信息引导的真实图像风格迁移 Photorealistic Style Transfer Guided by Global Information 计算机科学, 2022, 49(7): 100-105. https://doi.org/10.11896/jsjkx.210600036 |
[10] | 戴朝霞, 李锦欣, 张向东, 徐旭, 梅林, 张亮. 基于DNGAN的磁共振图像超分辨率重建算法 Super-resolution Reconstruction of MRI Based on DNGAN 计算机科学, 2022, 49(7): 113-119. https://doi.org/10.11896/jsjkx.210600105 |
[11] | 刘月红, 牛少华, 神显豪. 基于卷积神经网络的虚拟现实视频帧内预测编码 Virtual Reality Video Intraframe Prediction Coding Based on Convolutional Neural Network 计算机科学, 2022, 49(7): 127-131. https://doi.org/10.11896/jsjkx.211100179 |
[12] | 徐鸣珂, 张帆. Head Fusion:一种提高语音情绪识别的准确性和鲁棒性的方法 Head Fusion:A Method to Improve Accuracy and Robustness of Speech Emotion Recognition 计算机科学, 2022, 49(7): 132-141. https://doi.org/10.11896/jsjkx.210100085 |
[13] | 孙福权, 崔志清, 邹彭, 张琨. 基于多尺度特征的脑肿瘤分割算法 Brain Tumor Segmentation Algorithm Based on Multi-scale Features 计算机科学, 2022, 49(6A): 12-16. https://doi.org/10.11896/jsjkx.210700217 |
[14] | 吴子斌, 闫巧. 基于动量的映射式梯度下降算法 Projected Gradient Descent Algorithm with Momentum 计算机科学, 2022, 49(6A): 178-183. https://doi.org/10.11896/jsjkx.210500039 |
[15] | 杨涵, 万游, 蔡洁萱, 方铭宇, 吴卓超, 金扬, 钱伟行. 基于步态分类辅助的虚拟IMU的行人导航方法 Pedestrian Navigation Method Based on Virtual Inertial Measurement Unit Assisted by GaitClassification 计算机科学, 2022, 49(6A): 759-763. https://doi.org/10.11896/jsjkx.211200148 |
|