计算机科学 ›› 2022, Vol. 49 ›› Issue (11A): 211100100-5.doi: 10.11896/jsjkx.211100100

• 图像处理&多媒体技术 • 上一篇    下一篇

基于改进EfficientNetV2的有害垃圾图像分类方法

原慧琳1, 刘军涛2, 黄碧2, 韩真2, 冯宠2   

  1. 1 东北大学秦皇岛分校管理学院 河北 秦皇岛 066004
    2 东北大学信息科学与工程学院 沈阳 110819
  • 出版日期:2022-11-10 发布日期:2022-11-21
  • 通讯作者: 刘军涛(1822061948@qq.com)
  • 作者简介:(1000289@nequ.edu.cn)
  • 基金资助:
    东北大学产学研战略合作项目(71971050)

Classification Method of Harmful Garbage Images Based on Improved EfficientNetV2

YUAN Hui-lin1, LIU Jun-tao2, HUANG Bi2, HAN Zhen2, FENG Chong2   

  1. 1 College of Management,Northeastern University at Qinhuangdao,Qinhuangdao,Hebei 066004,China
    2 College of Information Science and Engineering,Northeastern University,Shenyang 110819,China
  • Online:2022-11-10 Published:2022-11-21
  • About author:YUAN Hui-lin,born in 1969,Ph.D,professor.Her main research interests include modeling and optimization of complex systems,information retrieval,and artificial intelligence.
    LIU Jun-tao,born in 1996,postgra-duate.His main research interests include machine learning and artificial intelligence.
  • Supported by:
    Northeastern University “Industry University Research Strategic Cooperation Project”(71971050).

摘要: 随着工业的迅速发展,垃圾产生的数量呈爆炸式上升,使得垃圾处理成为一个世界性的难题。我国政府对环境的关切也逐渐加深,不断推出了各种垃圾分类政策以及法律法规以监督市民进行垃圾分类。垃圾处理特别是有害垃圾处理,如电子废弃物等,如果处理不当,会对环境产生恶劣影响。有害垃圾图像数据具有数据质量低、图像不清晰的特点,采集来自不同设备的图像样本又有明显差异,因此有害垃圾图像处理面临巨大挑战,同时有害废弃物分类结果关系到环境污染问题,且目前产出的垃圾数量巨大,要求具有较高的处理速度和准确度。文中提出了一种基于卷积神经网络和注意力机制的垃圾图像分类方法。该方法不需要对输入的图像进行手工提取特征,通过深度学习模型框架,弥补传统图像处理算法的不足,实现对有害垃圾准确、高效的分类,可以较好地识别多种类型的有害垃圾。经实验验证,所提方法在harmful-waste数据集上的准确率达到97.46%,相比其他算法模型,其模型训练时间更短,性能更优。利用深度学习的方法,部署自动化垃圾分类模型,对于环境保护有重要意义。

关键词: 积神经网络, 注意力机制, 深度学习, 垃圾分类, 图像分类

Abstract: With the rapid development of industry,the amount of waste generated has also exploded,making waste disposal a worldwide problem.The Chinese government’s concern for the environment has gradually deepened,and various garbage classification policies and laws and regulations have been continuously introduced to supervise citizens’ garbage classification.Garbage disposal,especially hazardous garbage such as electronic waste,if it is improperly handled,will result in bad influence.Hazardous garbage image data has the characteristics of low data quality and unclear images.The image samples collected from different devices have obvious differences.Therefore,the image processing of hazardous garbage faces huge challenges.At the same time,the classification results of hazardous waste are related to environmental pollution issues.The amount of waste produced is huge,requiring high processing speed and accuracy.This paper proposes a garbage image classification method based on convolutional neural network and hybrid attention mechanism.This method does not need to manually extract features from the input image.Through the deep learning model framework,it overcomes the shortcomings of traditional image processing algorithms,achieves accurate and efficient classification of hazardous waste,and can better identify multiple types of hazardous waste.Experiment shows that the proposed method has an accuracy rate of 97.47% on the harmful-waste data set,the model training time is shorter,and its performance is better than other algorithm models.Using deep learning methods to deploy automated garbage classification models is of great significance to environmental protection.

Key words: Convolutional neural network, Attentional mechanism, Deep learning, Garbage classification, Image classification

中图分类号: 

  • TP391
[1]MA A,MAT B,SF A,et al.Artificial intelligence applications insolid waste management:A systematic research review[J].Waste Management,2020,109:231-246.
[2]YANG M,THUNG G.Classification of trash for recyclability status[J].CS229 Project Report,2016,2016(1):3.
[3]VO A H,LE H S,VO M T,et al.A Novel Framework forTrash Classification Using Deep Transfer Learning[J].IEEE Access,2019,7:178631-178639.
[4]XIE S,GIRSHICK R,DOLLÁR P,et al.Aggregated ResidualTransformations for Deep Neural Networks[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2017:1492-1500.
[5]HOWARD A G,ZHU M,CHEN B,et al.MobileNets:Efficient Convolutional Neural Networks for Mobile Vision Applications[J].arXiv:1704.04861,2017.
[6]ZHANG X,ZHOU X,LIN M,et al.ShuffleNet:An Extremely Efficient Convolutional Neural Network for Mobile Devices[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2018:6848-6856.
[7]KRIZHEVSKY A,SUTSKEVER I,HINTON G E.Imagenetclassification with deep convolutional neural networks[J].Advances in Neural Information Processing Systems,2012,25:1097-1105.
[8]GAO Z,XIE J,WANG Q,et al.Global Second-Order PoolingConvolutional Networks[C]//2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition(CVPR).IEEE,2020.
[9]WOO S,PARK J,LEE J Y,et al.CBAM:Convolutional Block Attention Module[C]//Proceedings of the European Conference on Computer Vision(ECCV).2018:3-19.
[10]ZHANG Q L,YANG Y B.SA-Net:Shuffle attention for deep convolutional neural networks[C]//2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).IEEE,2021:2235-2239.
[11]HU J,SHEN L,SUN G.Squeeze-and-excitation networks[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2018:7132-7141.
[12]TAN M,LE Q V.EfficientNetV2:Smaller Models and FasterTraining[C]//International Conference on Machine Learning.PMLR,2021:10096-10106.
[13]HE K,ZHANG X,REN S,et al.Deep residual learningfor image recognition[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2016:770-778.
[14]HOWARD A,SANDLER M,CHEN B,et al.Searching for MobileNetV3[C]//2019 IEEE/CVF International Conference on Computer Vision(ICCV).IEEE,2020.
[15]GAVAI N R,JAKHADE Y A,TRIBHUVAN S A,et al.MobileNets for flower classification using TensorFlow[C]//2017 International Conference on Big Data, IoT and Data Science(BID).IEEE,2017:154-158.
[16]SANDLER M,HOWARD A,ZHU M,et al.MobileNetV2:Inverted Residuals and Linear Bottlenecks[C]//2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition(CVPR).IEEE,2018.
[17]SUNIL C K,JAIDHAR C D,PATIL N.Cardamom Plant Disease Detection Approach Using EfficientNetV2[J].IEEE Access,2021,10:789-804.
[18]LIU D,WANG W,WU X,et al.EfficientNetv2 Model for Breast Cancer Histopathological Image Classification[C]//2022 3rd International Conference on Electronic Communication and Artificial Intelligence(IWECAI).IEEE,2022:384-387.
[19]KOUMPAROULIS A,POTAMIANOS G.Accurate and Re-source-Efficient Lipreading with Efficientnetv2 and Transfor-mers[C]//2022 IEEE International Conference on Acoustics,Speech and Signal Processing(ICASSP).IEEE,2022:8467-8471.
[20]MA B,WANG X,ZHANG H,et al.CBAM-GAN:generativeadversarial networks based on convolutional block attention module[C]//International Conference on Artificial Intelligence and Security.Cham:Springer,2019:227-236.
[21]QU H,YANG J,SHEN M,et al.Fault diagnosis of rolling bea-ring under time-varying speed conditions based on EfficientNetv2[J]. Measurement Science and Technology,2022,33(6):065023.
[1] 周芳泉, 成卫青.
基于全局增强图神经网络的序列推荐
Sequence Recommendation Based on Global Enhanced Graph Neural Network
计算机科学, 2022, 49(9): 55-63. https://doi.org/10.11896/jsjkx.210700085
[2] 戴禹, 许林峰.
基于文本行匹配的跨图文本阅读方法
Cross-image Text Reading Method Based on Text Line Matching
计算机科学, 2022, 49(9): 139-145. https://doi.org/10.11896/jsjkx.220600032
[3] 周乐员, 张剑华, 袁甜甜, 陈胜勇.
多层注意力机制融合的序列到序列中国连续手语识别和翻译
Sequence-to-Sequence Chinese Continuous Sign Language Recognition and Translation with Multi- layer Attention Mechanism Fusion
计算机科学, 2022, 49(9): 155-161. https://doi.org/10.11896/jsjkx.210800026
[4] 徐涌鑫, 赵俊峰, 王亚沙, 谢冰, 杨恺.
时序知识图谱表示学习
Temporal Knowledge Graph Representation Learning
计算机科学, 2022, 49(9): 162-171. https://doi.org/10.11896/jsjkx.220500204
[5] 熊丽琴, 曹雷, 赖俊, 陈希亮.
基于值分解的多智能体深度强化学习综述
Overview of Multi-agent Deep Reinforcement Learning Based on Value Factorization
计算机科学, 2022, 49(9): 172-182. https://doi.org/10.11896/jsjkx.210800112
[6] 饶志双, 贾真, 张凡, 李天瑞.
基于Key-Value关联记忆网络的知识图谱问答方法
Key-Value Relational Memory Networks for Question Answering over Knowledge Graph
计算机科学, 2022, 49(9): 202-207. https://doi.org/10.11896/jsjkx.220300277
[7] 汤凌韬, 王迪, 张鲁飞, 刘盛云.
基于安全多方计算和差分隐私的联邦学习方案
Federated Learning Scheme Based on Secure Multi-party Computation and Differential Privacy
计算机科学, 2022, 49(9): 297-305. https://doi.org/10.11896/jsjkx.210800108
[8] 武红鑫, 韩萌, 陈志强, 张喜龙, 李慕航.
监督和半监督学习下的多标签分类综述
Survey of Multi-label Classification Based on Supervised and Semi-supervised Learning
计算机科学, 2022, 49(8): 12-25. https://doi.org/10.11896/jsjkx.210700111
[9] 汪鸣, 彭舰, 黄飞虎.
基于多时间尺度时空图网络的交通流量预测模型
Multi-time Scale Spatial-Temporal Graph Neural Network for Traffic Flow Prediction
计算机科学, 2022, 49(8): 40-48. https://doi.org/10.11896/jsjkx.220100188
[10] 陈泳全, 姜瑛.
基于卷积神经网络的APP用户行为分析方法
Analysis Method of APP User Behavior Based on Convolutional Neural Network
计算机科学, 2022, 49(8): 78-85. https://doi.org/10.11896/jsjkx.210700121
[11] 朱承璋, 黄嘉儿, 肖亚龙, 王晗, 邹北骥.
基于注意力机制的医学影像深度哈希检索算法
Deep Hash Retrieval Algorithm for Medical Images Based on Attention Mechanism
计算机科学, 2022, 49(8): 113-119. https://doi.org/10.11896/jsjkx.210700153
[12] 孙奇, 吉根林, 张杰.
基于非局部注意力生成对抗网络的视频异常事件检测方法
Non-local Attention Based Generative Adversarial Network for Video Abnormal Event Detection
计算机科学, 2022, 49(8): 172-177. https://doi.org/10.11896/jsjkx.210600061
[13] 檀莹莹, 王俊丽, 张超波.
基于图卷积神经网络的文本分类方法研究综述
Review of Text Classification Methods Based on Graph Convolutional Network
计算机科学, 2022, 49(8): 205-216. https://doi.org/10.11896/jsjkx.210800064
[14] 闫佳丹, 贾彩燕.
基于双图神经网络信息融合的文本分类方法
Text Classification Method Based on Information Fusion of Dual-graph Neural Network
计算机科学, 2022, 49(8): 230-236. https://doi.org/10.11896/jsjkx.210600042
[15] 李宗民, 张玉鹏, 刘玉杰, 李华.
基于可变形图卷积的点云表征学习
Deformable Graph Convolutional Networks Based Point Cloud Representation Learning
计算机科学, 2022, 49(8): 273-278. https://doi.org/10.11896/jsjkx.210900023
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!