计算机科学 ›› 2019, Vol. 46 ›› Issue (8): 332-336.doi: 10.11896/j.issn.1002-137X.2019.08.055

• 图形图像与模式识别 • 上一篇    下一篇

基于改进YOLO v2的船舶目标检测方法

于洋, 李世杰, 陈亮, 刘韵婷   

  1. (沈阳理工大学自动化与电气工程学院 沈阳110159)
  • 收稿日期:2018-06-05 出版日期:2019-08-15 发布日期:2019-08-15
  • 通讯作者: 于洋(1963-),男,硕士,教授,主要研究方向为智能检测与控制、故障诊断与监控技术等,E-mail:781784300@qq.com
  • 作者简介:李世杰(1994-),女,硕士,主要研究方向为智能检测与信息处理、人工智能;陈亮(1979-),男,博士,主要研究方向为机器学习、嵌入式仪表与信息处理技术等;刘韵婷(1983-),女,博士,主要研究方向为人工神经网络、无线传感器网络与数据分析等
  • 基金资助:
    国家重点研发计划(2017YFC0821001),国家自然科学基金(61373089),辽宁省自然科学基金(201602652),辽宁省教育厅基本科研项目(LG201707)

Ship Target Detection Based on Improved YOLO v2

YU Yang, LI Shi-jie, CHEN Liang, LIU Yun-ting   

  1. (School of Automation and Electrical Engineering,Shenyang Ligong University,Shenyang 110159,China)
  • Received:2018-06-05 Online:2019-08-15 Published:2019-08-15

摘要: 针对船舶图像目标检测中存在的小目标检测准确率低、系统鲁棒性差的问题,提出一种改进的YOLO v2算法对船舶图像目标进行检测。通过目标框维度聚类、网络结构改进、输入图像多尺度变换等方法对传统YOLO v2算法进行改进,使其能够更好地适应船舶目标检测任务。测试结果表明,在输入图像尺寸为416×416时,该算法的平均精确率(mean Average Precision,mAP)达到79.1%,检测速度为64帧/s(Frames Per Second,FPS)。所提方法可满足实时检测的需要,且具有小目标检测精度高、鲁棒性强的特点。

关键词: 船舶目标检测, 改进YOLOv2, 卷积神经网络, 目标检测

Abstract: Aiming at the problem of low target detection accuracy and poor system robustness in ship image target detection,an improved YOLO v2 algorithm was proposed to detect ship image targets.The traditional YOLO v2 algorithm is improved by clustering the target frame dimension,optimizing the network structure,multi-scale transformation of input image,so as to better adapt to the ship target detection task.The test results show that the mean Average Precision (mAP)of the algorithm is 79.1% when the input image size is 416×416,and the detection speed is 64 frames per se-cond (FPS),which can satisfy the real-time detection and exhibit high precision and strong robustness for small target detection

Key words: Convolutional neural network, Improved YOLO v2, Ship target detection, Target detection

中图分类号: 

  • TP183
[1]ZHANG Y.Image Recognition of Ship Target Based on Cloud Computing [D].Guangzhou:South China University of Technology,2016.(in Chinese) 张羽.基于云计算的舰船目标图像识别[D].广州:华南理工大学,2016.
[2]ZHAO Y.A brief talk on the methods and techniques of ship target recognition[J].Ship Science and Technology,2016,38(1A):163-165.(in Chinese) 赵友.浅谈舰船目标识别的方法和技术[J].舰船科学技术,2016,38(1A):163-165.
[3]PIDGEON V W.Frequency Dependence of Radar Ducting[J].Radio Science,2016,5(3):541-549.
[4]SHEN G N.Research on ship target recognition technology [D].Harbin:Harbin Engineering University,2012.(in Chinese) 沈广楠.舰船目标识别技术研究[D].哈尔滨:哈尔滨工程大学,2012.
[5]ZHANG S,YAN Y Y,LI Y F.Moving target detection based on Video Fusion of thermal infrared and visible light[J].Computer Science,2015,42(8):86-89.(in Chinese) 张笙,严云洋,李郁峰.热红外与可见光视频融合的运动目标检测[J].计算机科学,2015,42(8):86-89.
[6]ZHANG X,DONG G,XIONG B,et al.Refined segmentation of ship target in SAR images based on GVF snake with elliptical constraint[J].Remote Sensing Letters,2017,8(8):791-800.
[7]YU J Y,HUANG D,WANG L Y,et al.A real-time on-board ship targets detection method for optical remote sensing satellite[C]∥International Conference on Signal Processing.New York:IEEE Press,2017:204-208.
[8]ZHU R.Application of Computer Identification and Location Algorithm in Small far Infrared Target Recognition of Ship Under Surge Interference[J].Polish Maritime Research,2017,24(S1):171-181.
[9]LECUN Y,BENGIO Y,HINTON G.Deep learning[J].Nature,2015,521(7553):436.
[10]GIRSHICK R.Fast R-CNN[C]∥IEEE International Conference on Computer Vision.New York:IEEE Computer Society,2015:1440-1448.
[11]REN S,HE K,GIRSHICK R,et al.Faster R-CNN:towards real-time object detection with region proposal networks[C]∥International Conference on Neural Information Processing Systems.Massachusetts:MIT Press,2015:91-99.
[12]REDMON J,DIVVALA S,GIRSHICK R,et al.You Only Look Once:Unified,Real-Time Object Detection[C]∥Computer Vision and Pattern Recognition.NewYork:IEEE Press,2016:779-788.
[13]LIU W,ANGUELOV D,ERHAN D,et al.SSD:Single Shot MultiBox Detector[C]∥European Conference on Computer Vision.Springer:Cham,2016:21-37.
[14]REDMON J,FARHADI A.YOLO9000:Better,Faster,Stronger[C]∥IEEE Conference on Computer Vision and Pattern Recognition.NewYork:IEEE Computer Society,2017:6517-6525.
[15]SIMONYAN K,ZISSERMAN A.Very Deep Convolutional Networks for Large-Scale Image Recognition[J].Computer Scien-ce,2014.https://arxiv.org/abs/1409.1556.
[16]SZEGEDY C,LIU W,JIA Y,et al.Going deeper with convolutions[J/OL].https://arxiv.org/abs/1409.4842.
[17]LIN M,CHEN Q,YAN S.Network In Network[J/OL].Computer Science.https://arxiv.org/abs/1312.4400.
[18]KRIZHEVSKY A,SUTSKEVER I,HINTON G E.ImageNet classification with deep convolutional neural networks[C]∥International Conference on Neural Information Processing Systems.NewYork:ACM,2012:1097-1105.
[19]ZHOU W B,SHI Y X.Optimization algorithm of K-means clustering center of selection based on density[J].Application Research of Computers,2012,29(5):1726-1728.
[20]HE K,ZHANG X,REN S,et al.Deep residual learning for ima- ge recognition[M]∥Computer Vision and Pattern Recognition.New York:IEEE,2016:770-778.
[1] 周乐员, 张剑华, 袁甜甜, 陈胜勇.
多层注意力机制融合的序列到序列中国连续手语识别和翻译
Sequence-to-Sequence Chinese Continuous Sign Language Recognition and Translation with Multi- layer Attention Mechanism Fusion
计算机科学, 2022, 49(9): 155-161. https://doi.org/10.11896/jsjkx.210800026
[2] 李宗民, 张玉鹏, 刘玉杰, 李华.
基于可变形图卷积的点云表征学习
Deformable Graph Convolutional Networks Based Point Cloud Representation Learning
计算机科学, 2022, 49(8): 273-278. https://doi.org/10.11896/jsjkx.210900023
[3] 陈泳全, 姜瑛.
基于卷积神经网络的APP用户行为分析方法
Analysis Method of APP User Behavior Based on Convolutional Neural Network
计算机科学, 2022, 49(8): 78-85. https://doi.org/10.11896/jsjkx.210700121
[4] 朱承璋, 黄嘉儿, 肖亚龙, 王晗, 邹北骥.
基于注意力机制的医学影像深度哈希检索算法
Deep Hash Retrieval Algorithm for Medical Images Based on Attention Mechanism
计算机科学, 2022, 49(8): 113-119. https://doi.org/10.11896/jsjkx.210700153
[5] 刘冬梅, 徐洋, 吴泽彬, 刘倩, 宋斌, 韦志辉.
基于边框距离度量的增量目标检测方法
Incremental Object Detection Method Based on Border Distance Measurement
计算机科学, 2022, 49(8): 136-142. https://doi.org/10.11896/jsjkx.220100132
[6] 王灿, 刘永坚, 解庆, 马艳春.
基于软标签和样本权重优化的Anchor Free目标检测算法
Anchor Free Object Detection Algorithm Based on Soft Label and Sample Weight Optimization
计算机科学, 2022, 49(8): 157-164. https://doi.org/10.11896/jsjkx.210600240
[7] 檀莹莹, 王俊丽, 张超波.
基于图卷积神经网络的文本分类方法研究综述
Review of Text Classification Methods Based on Graph Convolutional Network
计算机科学, 2022, 49(8): 205-216. https://doi.org/10.11896/jsjkx.210800064
[8] 金方焱, 王秀利.
融合RACNN和BiLSTM的金融领域事件隐式因果关系抽取
Implicit Causality Extraction of Financial Events Integrating RACNN and BiLSTM
计算机科学, 2022, 49(7): 179-186. https://doi.org/10.11896/jsjkx.210500190
[9] 张颖涛, 张杰, 张睿, 张文强.
全局信息引导的真实图像风格迁移
Photorealistic Style Transfer Guided by Global Information
计算机科学, 2022, 49(7): 100-105. https://doi.org/10.11896/jsjkx.210600036
[10] 戴朝霞, 李锦欣, 张向东, 徐旭, 梅林, 张亮.
基于DNGAN的磁共振图像超分辨率重建算法
Super-resolution Reconstruction of MRI Based on DNGAN
计算机科学, 2022, 49(7): 113-119. https://doi.org/10.11896/jsjkx.210600105
[11] 刘月红, 牛少华, 神显豪.
基于卷积神经网络的虚拟现实视频帧内预测编码
Virtual Reality Video Intraframe Prediction Coding Based on Convolutional Neural Network
计算机科学, 2022, 49(7): 127-131. https://doi.org/10.11896/jsjkx.211100179
[12] 徐鸣珂, 张帆.
Head Fusion:一种提高语音情绪识别的准确性和鲁棒性的方法
Head Fusion:A Method to Improve Accuracy and Robustness of Speech Emotion Recognition
计算机科学, 2022, 49(7): 132-141. https://doi.org/10.11896/jsjkx.210100085
[13] 孙福权, 崔志清, 邹彭, 张琨.
基于多尺度特征的脑肿瘤分割算法
Brain Tumor Segmentation Algorithm Based on Multi-scale Features
计算机科学, 2022, 49(6A): 12-16. https://doi.org/10.11896/jsjkx.210700217
[14] 吴子斌, 闫巧.
基于动量的映射式梯度下降算法
Projected Gradient Descent Algorithm with Momentum
计算机科学, 2022, 49(6A): 178-183. https://doi.org/10.11896/jsjkx.210500039
[15] 杨涵, 万游, 蔡洁萱, 方铭宇, 吴卓超, 金扬, 钱伟行.
基于步态分类辅助的虚拟IMU的行人导航方法
Pedestrian Navigation Method Based on Virtual Inertial Measurement Unit Assisted by GaitClassification
计算机科学, 2022, 49(6A): 759-763. https://doi.org/10.11896/jsjkx.211200148
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!