计算机科学 ›› 2019, Vol. 46 ›› Issue (5): 260-265.doi: 10.11896/j.issn.1002-137X.2019.05.040
徐一鸣, 张娟, 刘成成, 顾菊平, 潘高超
XU Yi-ming, ZHANG Juan, LIU Cheng-cheng, GU Ju-ping, PAN Gao-chao
摘要: 针对无人机航拍环境下拍摄角度变换、特征不显著等干扰问题,提出一种改进的GoogLeNet卷积神经网络对风电机组进行识别和定位,无需人工预选取即可自动提取风电机组类别特征。通过GoogLeNet网络构造风电机组深度特征向量,在网络模型训练过程中引入迁移学习的概念,利用风电机组图像训练已预训练的GoogLeNet网络,在加快模型训练速度的同时,能避免分类网络陷入局部最优解。并在Faster RCNN框架下采用区域建议网络和多任务损失函数将候选区域搜索和边框回归融入到网络中,实现航拍图像中风电机组的自动分类和标注,缩短数据处理时间。实验结果表明,通过迁移学习的手段,利用优化的GoogLeNet网络能改善复杂航拍环境下的目标视觉检测准确率,完成风电机组自动定位任务,基于GoogLeNet的风电机组平均准确率达到了96%以上。
中图分类号:
[1]ZHANG H,WANG K F,WANG F Y.Advances and perspec-tives on applications of deep learning in visual object detection[J].Acta Automatica Sinica,2017,43(8):1289-1305.(in Chinese)张慧,王坤峰,王飞跃.深度学习在目标视觉检测中的应用进展与展望[J].自动化学报,2017,43(8):1289-1305. [2]WEI Y M.Research on aerial image location based on convolution neural network [J].Ship Electronic Engineering,2017,37(6):33-37.(in Chinese)魏湧明.基于卷积神经网络的航拍图像定位研究[J].舰船电子工程,2017,37(6):33-37. [3]SUN Z Y,LU C X,SHI Z Z,et al.Research and advances on deep learning [J].Computer Science,2016,43(2):1-8.(in Chinese)孙志远,鲁成祥,史忠植,等.深度学习研究与进展[J].计算机科学,2016,43(2):1-8. [4]GIRSHICK R,DONAHUE J,DARRELL T,et al.Rich feature hierarchies for accurate object detection and semantic segmentation[C]∥Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.USA:IEEE Press,2014:580-587. [5]GIRSHICK R.Fast R-CNN[C]∥Proceedings of IEEE International Conference on Computer Vision.Chile:IEEE Press,2015:1440-1448. [6]REN S Q,HE K M,GIRSHICK R,et al.Faster R-CNN:to-wards real-time object detection with region proposal networks [J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2017,39(6):1137-1149. [7]SZEGEDY C,LIU W,JIA Y Q,et al.Going deeper with convolutions[C]∥Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.USA:IEEE Press,2015:1-9. [8]LI Y D,HAO Z B,LEI H.Survey of convolutional neural network [J].Journal of Computer Applications,2016,36(9):2508-2515,2565.(in Chinese)李彦东,郝宗波,雷航.卷积神经网络研究综述[J].计算机应用,2016,36(9):2508-2515,2565. [9]ZHOU J Y,ZHAO Y M.Application of convolution neural network in image classification and object detection [J].Computer Engineering and Applications,2017,53(13):34-41.(in Chinese)周俊宇,赵艳明.卷积神经网络在图像分类和目标检测应用综述[J].计算机工程与应用,2017,53(13):34-41. [10]LI X D,YE M,LI T.Review of object detection based on convolutional neural networks [J].Application Research of Compu-ters,2017,34(10):2881-2886,2891.(in Chinese)李旭东,叶茂,李涛.基于卷积神经网络的目标检测研究综述[J].计算机应用研究,2017,34(10):2881-2886,2891. [11]WANG Z M,CAO H J,FAN L.Method on human activity re-cognition based on convolutional neural networks [J].Computer Science,2016,43(11A):56-58,87.(in Chinese)王忠民,曹洪江,范琳.一种基于卷积神经网络深度学习的人体行为识别方法[J].计算机科学,2016,43(11A):56-58,87. [12]FENG Y S,WANG Z L.Fine-grained image categorization with segmentation based on top-down attention map [J].Journal of Image and Graphics,2016,21(9):1147-1154.(in Chinese)冯语姗,王子磊.自上而下注意图分割的细粒度图像分类[J].中国图象图形学报,2016,21(9):1147-1154. [13]DAI C K,LI Y.Aeroplane detection in static aerodrome based on Faster RCNN and multi-part model[J].Journal of Computer Applications,2017,37(S2):85-88.(in Chinese)戴陈卡,李毅.基于Faster RCNN以及多部件结合的机场场面静态飞机检测[J].计算机应用,2017,37(S2):85-88. [14]PENG G,YANG S Q,HUANG X H,et al.Improved object detection method of micro-operating system based on regioncon-volutional neural network [J].Pattern Recognition and Artificial Intelligence,2018,31(2):142-149.(in Chinese)彭刚,杨诗琪,黄心汉,等.改进的基于区域卷积神经网络的微操作系统目标检测方法[J].模式识别与人工智能,2018,31(2):142-149. [15]ZHUANG F Z,LUO P,HE Q,et al.Survey on transfer learning research[J].Journal of Software,2015,26(1):26-39.(in Chinese)庄福振,罗平,何清,等.迁移学习研究与进展[J].软件学报,2015,26(1):26-39. [16]DAI W Y,YANG Q,XUE G R,et al.Boosting for transfer learning[C]∥Proceedings of International Conference on Machine Learning.USA:IEEE Press,2007:193-200. [17]WU L N,HUANG Y P,ZHENG X.Noval transfer learning algorithm based on bag-of-visual words model[J].ComputerScien-ce,2014,41(12):260-263,274.(in Chinese)吴丽娜,黄雅平,郑翔.基于词带模型的迁移学习算法[J].计算机科学,2014,41(12):260-263,274. |
[1] | 周乐员, 张剑华, 袁甜甜, 陈胜勇. 多层注意力机制融合的序列到序列中国连续手语识别和翻译 Sequence-to-Sequence Chinese Continuous Sign Language Recognition and Translation with Multi- layer Attention Mechanism Fusion 计算机科学, 2022, 49(9): 155-161. https://doi.org/10.11896/jsjkx.210800026 |
[2] | 徐涌鑫, 赵俊峰, 王亚沙, 谢冰, 杨恺. 时序知识图谱表示学习 Temporal Knowledge Graph Representation Learning 计算机科学, 2022, 49(9): 162-171. https://doi.org/10.11896/jsjkx.220500204 |
[3] | 饶志双, 贾真, 张凡, 李天瑞. 基于Key-Value关联记忆网络的知识图谱问答方法 Key-Value Relational Memory Networks for Question Answering over Knowledge Graph 计算机科学, 2022, 49(9): 202-207. https://doi.org/10.11896/jsjkx.220300277 |
[4] | 汤凌韬, 王迪, 张鲁飞, 刘盛云. 基于安全多方计算和差分隐私的联邦学习方案 Federated Learning Scheme Based on Secure Multi-party Computation and Differential Privacy 计算机科学, 2022, 49(9): 297-305. https://doi.org/10.11896/jsjkx.210800108 |
[5] | 方义秋, 张震坤, 葛君伟. 基于自注意力机制和迁移学习的跨领域推荐算法 Cross-domain Recommendation Algorithm Based on Self-attention Mechanism and Transfer Learning 计算机科学, 2022, 49(8): 70-77. https://doi.org/10.11896/jsjkx.210600011 |
[6] | 陈泳全, 姜瑛. 基于卷积神经网络的APP用户行为分析方法 Analysis Method of APP User Behavior Based on Convolutional Neural Network 计算机科学, 2022, 49(8): 78-85. https://doi.org/10.11896/jsjkx.210700121 |
[7] | 朱承璋, 黄嘉儿, 肖亚龙, 王晗, 邹北骥. 基于注意力机制的医学影像深度哈希检索算法 Deep Hash Retrieval Algorithm for Medical Images Based on Attention Mechanism 计算机科学, 2022, 49(8): 113-119. https://doi.org/10.11896/jsjkx.210700153 |
[8] | 孙奇, 吉根林, 张杰. 基于非局部注意力生成对抗网络的视频异常事件检测方法 Non-local Attention Based Generative Adversarial Network for Video Abnormal Event Detection 计算机科学, 2022, 49(8): 172-177. https://doi.org/10.11896/jsjkx.210600061 |
[9] | 檀莹莹, 王俊丽, 张超波. 基于图卷积神经网络的文本分类方法研究综述 Review of Text Classification Methods Based on Graph Convolutional Network 计算机科学, 2022, 49(8): 205-216. https://doi.org/10.11896/jsjkx.210800064 |
[10] | 李宗民, 张玉鹏, 刘玉杰, 李华. 基于可变形图卷积的点云表征学习 Deformable Graph Convolutional Networks Based Point Cloud Representation Learning 计算机科学, 2022, 49(8): 273-278. https://doi.org/10.11896/jsjkx.210900023 |
[11] | 王剑, 彭雨琦, 赵宇斐, 杨健. 基于深度学习的社交网络舆情信息抽取方法综述 Survey of Social Network Public Opinion Information Extraction Based on Deep Learning 计算机科学, 2022, 49(8): 279-293. https://doi.org/10.11896/jsjkx.220300099 |
[12] | 郝志荣, 陈龙, 黄嘉成. 面向文本分类的类别区分式通用对抗攻击方法 Class Discriminative Universal Adversarial Attack for Text Classification 计算机科学, 2022, 49(8): 323-329. https://doi.org/10.11896/jsjkx.220200077 |
[13] | 姜梦函, 李邵梅, 郑洪浩, 张建朋. 基于改进位置编码的谣言检测模型 Rumor Detection Model Based on Improved Position Embedding 计算机科学, 2022, 49(8): 330-335. https://doi.org/10.11896/jsjkx.210600046 |
[14] | 侯钰涛, 阿布都克力木·阿布力孜, 哈里旦木·阿布都克里木. 中文预训练模型研究进展 Advances in Chinese Pre-training Models 计算机科学, 2022, 49(7): 148-163. https://doi.org/10.11896/jsjkx.211200018 |
[15] | 周慧, 施皓晨, 屠要峰, 黄圣君. 基于主动采样的深度鲁棒神经网络学习 Robust Deep Neural Network Learning Based on Active Sampling 计算机科学, 2022, 49(7): 164-169. https://doi.org/10.11896/jsjkx.210600044 |
|