计算机科学 ›› 2019, Vol. 46 ›› Issue (5): 260-265.doi: 10.11896/j.issn.1002-137X.2019.05.040

• 图形图像与模式识别 • 上一篇    下一篇

迁移学习模式下基于GoogLeNet网络的风电机组视觉检测

徐一鸣, 张娟, 刘成成, 顾菊平, 潘高超   

  1. (南通大学电气工程学院 江苏 南通226019)
  • 发布日期:2019-05-15
  • 作者简介:徐一鸣(1981-),男,博士,副教授,主要研究方向为数字图像处理及先进传感器技术;张 娟(1992-),女,硕士,主要研究方向为数字图像处理及机器视觉应用技术;刘成成(1995-),男,硕士,主要研究方向为数字图像处理及机器视觉应用技术;顾菊平(1971-),女,博士,教授,主要研究方向为微特电机及先进控制,E-mail:gu.jp@ntu.edu.cn(通信作者);潘高超(1994-),男,硕士,主要研究方向为数字图像处理及模式识别。
  • 基金资助:
    国家自然科学基金面上项目(61673226),南通市应用基础研究项目(GY12016018)资助。

Wind Turbine Visual Inspection Based on GoogLeNet Network in Transfer Learning Mode

XU Yi-ming, ZHANG Juan, LIU Cheng-cheng, GU Ju-ping, PAN Gao-chao   

  1. (School of Electrical Engineering,Nantong University,Nantong,Jiangsu 226019,China)
  • Published:2019-05-15

摘要: 针对无人机航拍环境下拍摄角度变换、特征不显著等干扰问题,提出一种改进的GoogLeNet卷积神经网络对风电机组进行识别和定位,无需人工预选取即可自动提取风电机组类别特征。通过GoogLeNet网络构造风电机组深度特征向量,在网络模型训练过程中引入迁移学习的概念,利用风电机组图像训练已预训练的GoogLeNet网络,在加快模型训练速度的同时,能避免分类网络陷入局部最优解。并在Faster RCNN框架下采用区域建议网络和多任务损失函数将候选区域搜索和边框回归融入到网络中,实现航拍图像中风电机组的自动分类和标注,缩短数据处理时间。实验结果表明,通过迁移学习的手段,利用优化的GoogLeNet网络能改善复杂航拍环境下的目标视觉检测准确率,完成风电机组自动定位任务,基于GoogLeNet的风电机组平均准确率达到了96%以上。

关键词: GoogLeNet模型, 风电机组, 卷积神经网络, 迁移学习, 深度学习, 视觉检测

Abstract: Aiming at the interference of shooting angle changes and insignificant features in the drone aerial photography environment,this paper proposed an improved GoogLeNet convolutional neural network to identify and locate the wind turbines,which can automatically extract wind turbine category features without manual pre-selection.The deep feature vectors of wind turbines are constructed through GoogLeNet network.In the network model training process,the concept of transfer learning is introduced and the pre-trained GoogLeNet network is trained by using wind turbine images.The classification network can be prevented from falling into the local optimal solution while speeding up the model training.The region proposal network and the multi-task loss function are used to integrate the candidate region search and border regression into the network in the Faster RCNN framework,so that the wind turbines in the aerial image can be automatically classified and annotated,and the time complexity can be reduced.Experimental results show that the optimized GoogLeNet network can improve the accuracy of target visual detection in the complex aerial photography environment and complete the task of wind turbine automatic positioning by means of transfer learning.The avera-ge accuracy of wind turbines based on GoogLeNet is over 96%.

Key words: Convolutional neural network, Deep learning, GoogLeNet model, Transfer learning, Visual inspection, Wind turbine

中图分类号: 

  • TP391
[1]ZHANG H,WANG K F,WANG F Y.Advances and perspec-tives on applications of deep learning in visual object detection[J].Acta Automatica Sinica,2017,43(8):1289-1305.(in Chinese)张慧,王坤峰,王飞跃.深度学习在目标视觉检测中的应用进展与展望[J].自动化学报,2017,43(8):1289-1305.
[2]WEI Y M.Research on aerial image location based on convolution neural network [J].Ship Electronic Engineering,2017,37(6):33-37.(in Chinese)魏湧明.基于卷积神经网络的航拍图像定位研究[J].舰船电子工程,2017,37(6):33-37.
[3]SUN Z Y,LU C X,SHI Z Z,et al.Research and advances on deep learning [J].Computer Science,2016,43(2):1-8.(in Chinese)孙志远,鲁成祥,史忠植,等.深度学习研究与进展[J].计算机科学,2016,43(2):1-8.
[4]GIRSHICK R,DONAHUE J,DARRELL T,et al.Rich feature hierarchies for accurate object detection and semantic segmentation[C]∥Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.USA:IEEE Press,2014:580-587.
[5]GIRSHICK R.Fast R-CNN[C]∥Proceedings of IEEE International Conference on Computer Vision.Chile:IEEE Press,2015:1440-1448.
[6]REN S Q,HE K M,GIRSHICK R,et al.Faster R-CNN:to-wards real-time object detection with region proposal networks [J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2017,39(6):1137-1149.
[7]SZEGEDY C,LIU W,JIA Y Q,et al.Going deeper with convolutions[C]∥Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.USA:IEEE Press,2015:1-9.
[8]LI Y D,HAO Z B,LEI H.Survey of convolutional neural network [J].Journal of Computer Applications,2016,36(9):2508-2515,2565.(in Chinese)李彦东,郝宗波,雷航.卷积神经网络研究综述[J].计算机应用,2016,36(9):2508-2515,2565.
[9]ZHOU J Y,ZHAO Y M.Application of convolution neural network in image classification and object detection [J].Computer Engineering and Applications,2017,53(13):34-41.(in Chinese)周俊宇,赵艳明.卷积神经网络在图像分类和目标检测应用综述[J].计算机工程与应用,2017,53(13):34-41.
[10]LI X D,YE M,LI T.Review of object detection based on convolutional neural networks [J].Application Research of Compu-ters,2017,34(10):2881-2886,2891.(in Chinese)李旭东,叶茂,李涛.基于卷积神经网络的目标检测研究综述[J].计算机应用研究,2017,34(10):2881-2886,2891.
[11]WANG Z M,CAO H J,FAN L.Method on human activity re-cognition based on convolutional neural networks [J].Computer Science,2016,43(11A):56-58,87.(in Chinese)王忠民,曹洪江,范琳.一种基于卷积神经网络深度学习的人体行为识别方法[J].计算机科学,2016,43(11A):56-58,87.
[12]FENG Y S,WANG Z L.Fine-grained image categorization with segmentation based on top-down attention map [J].Journal of Image and Graphics,2016,21(9):1147-1154.(in Chinese)冯语姗,王子磊.自上而下注意图分割的细粒度图像分类[J].中国图象图形学报,2016,21(9):1147-1154.
[13]DAI C K,LI Y.Aeroplane detection in static aerodrome based on Faster RCNN and multi-part model[J].Journal of Computer Applications,2017,37(S2):85-88.(in Chinese)戴陈卡,李毅.基于Faster RCNN以及多部件结合的机场场面静态飞机检测[J].计算机应用,2017,37(S2):85-88.
[14]PENG G,YANG S Q,HUANG X H,et al.Improved object detection method of micro-operating system based on regioncon-volutional neural network [J].Pattern Recognition and Artificial Intelligence,2018,31(2):142-149.(in Chinese)彭刚,杨诗琪,黄心汉,等.改进的基于区域卷积神经网络的微操作系统目标检测方法[J].模式识别与人工智能,2018,31(2):142-149.
[15]ZHUANG F Z,LUO P,HE Q,et al.Survey on transfer learning research[J].Journal of Software,2015,26(1):26-39.(in Chinese)庄福振,罗平,何清,等.迁移学习研究与进展[J].软件学报,2015,26(1):26-39.
[16]DAI W Y,YANG Q,XUE G R,et al.Boosting for transfer learning[C]∥Proceedings of International Conference on Machine Learning.USA:IEEE Press,2007:193-200.
[17]WU L N,HUANG Y P,ZHENG X.Noval transfer learning algorithm based on bag-of-visual words model[J].ComputerScien-ce,2014,41(12):260-263,274.(in Chinese)吴丽娜,黄雅平,郑翔.基于词带模型的迁移学习算法[J].计算机科学,2014,41(12):260-263,274.
[1] 周乐员, 张剑华, 袁甜甜, 陈胜勇.
多层注意力机制融合的序列到序列中国连续手语识别和翻译
Sequence-to-Sequence Chinese Continuous Sign Language Recognition and Translation with Multi- layer Attention Mechanism Fusion
计算机科学, 2022, 49(9): 155-161. https://doi.org/10.11896/jsjkx.210800026
[2] 徐涌鑫, 赵俊峰, 王亚沙, 谢冰, 杨恺.
时序知识图谱表示学习
Temporal Knowledge Graph Representation Learning
计算机科学, 2022, 49(9): 162-171. https://doi.org/10.11896/jsjkx.220500204
[3] 饶志双, 贾真, 张凡, 李天瑞.
基于Key-Value关联记忆网络的知识图谱问答方法
Key-Value Relational Memory Networks for Question Answering over Knowledge Graph
计算机科学, 2022, 49(9): 202-207. https://doi.org/10.11896/jsjkx.220300277
[4] 汤凌韬, 王迪, 张鲁飞, 刘盛云.
基于安全多方计算和差分隐私的联邦学习方案
Federated Learning Scheme Based on Secure Multi-party Computation and Differential Privacy
计算机科学, 2022, 49(9): 297-305. https://doi.org/10.11896/jsjkx.210800108
[5] 方义秋, 张震坤, 葛君伟.
基于自注意力机制和迁移学习的跨领域推荐算法
Cross-domain Recommendation Algorithm Based on Self-attention Mechanism and Transfer Learning
计算机科学, 2022, 49(8): 70-77. https://doi.org/10.11896/jsjkx.210600011
[6] 陈泳全, 姜瑛.
基于卷积神经网络的APP用户行为分析方法
Analysis Method of APP User Behavior Based on Convolutional Neural Network
计算机科学, 2022, 49(8): 78-85. https://doi.org/10.11896/jsjkx.210700121
[7] 朱承璋, 黄嘉儿, 肖亚龙, 王晗, 邹北骥.
基于注意力机制的医学影像深度哈希检索算法
Deep Hash Retrieval Algorithm for Medical Images Based on Attention Mechanism
计算机科学, 2022, 49(8): 113-119. https://doi.org/10.11896/jsjkx.210700153
[8] 孙奇, 吉根林, 张杰.
基于非局部注意力生成对抗网络的视频异常事件检测方法
Non-local Attention Based Generative Adversarial Network for Video Abnormal Event Detection
计算机科学, 2022, 49(8): 172-177. https://doi.org/10.11896/jsjkx.210600061
[9] 檀莹莹, 王俊丽, 张超波.
基于图卷积神经网络的文本分类方法研究综述
Review of Text Classification Methods Based on Graph Convolutional Network
计算机科学, 2022, 49(8): 205-216. https://doi.org/10.11896/jsjkx.210800064
[10] 李宗民, 张玉鹏, 刘玉杰, 李华.
基于可变形图卷积的点云表征学习
Deformable Graph Convolutional Networks Based Point Cloud Representation Learning
计算机科学, 2022, 49(8): 273-278. https://doi.org/10.11896/jsjkx.210900023
[11] 王剑, 彭雨琦, 赵宇斐, 杨健.
基于深度学习的社交网络舆情信息抽取方法综述
Survey of Social Network Public Opinion Information Extraction Based on Deep Learning
计算机科学, 2022, 49(8): 279-293. https://doi.org/10.11896/jsjkx.220300099
[12] 郝志荣, 陈龙, 黄嘉成.
面向文本分类的类别区分式通用对抗攻击方法
Class Discriminative Universal Adversarial Attack for Text Classification
计算机科学, 2022, 49(8): 323-329. https://doi.org/10.11896/jsjkx.220200077
[13] 姜梦函, 李邵梅, 郑洪浩, 张建朋.
基于改进位置编码的谣言检测模型
Rumor Detection Model Based on Improved Position Embedding
计算机科学, 2022, 49(8): 330-335. https://doi.org/10.11896/jsjkx.210600046
[14] 侯钰涛, 阿布都克力木·阿布力孜, 哈里旦木·阿布都克里木.
中文预训练模型研究进展
Advances in Chinese Pre-training Models
计算机科学, 2022, 49(7): 148-163. https://doi.org/10.11896/jsjkx.211200018
[15] 周慧, 施皓晨, 屠要峰, 黄圣君.
基于主动采样的深度鲁棒神经网络学习
Robust Deep Neural Network Learning Based on Active Sampling
计算机科学, 2022, 49(7): 164-169. https://doi.org/10.11896/jsjkx.210600044
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!