Computer Science ›› 2021, Vol. 48 ›› Issue (11A): 391-395.doi: 10.11896/jsjkx.201200127

• Image Processing & Multimedia Technology • Previous Articles     Next Articles

Clothing Image Sets Classification Based on Manifold Structure Neural Network

CHENG Ming, MA Pei, HE Ru-han   

  1. School of Mathematics and Computer Science,Wuhan Textile University,Wuhan 430200,China
  • Online:2021-11-10 Published:2021-11-12
  • About author:CHENG Ming,born in 1996,postgraduate,is a student member of China Computer Federation.His main research interests include machine learning and computer vision.
    HE Ru-han,born in 1974,Ph.D,professor,master supervisor,is a member of China Computer Federation.His main research interests include machine learning,computer vision and multimedia retrieval.
  • Supported by:
    National Science Foundation of China(61170093).

Abstract: Clothing classification based on deep learning has developed rapidly with the release of large-scale fashion data sets.However,most of the current clothing image classification methods are performed in a scene where the same clothing has a single,frontal or close-to-front image,which leads to misclassification of clothing when the view of clothing changes.In reality,the clothing features such as large deformation and severe occlusion further aggravate the problem.Therefore,a clothing image set classification method based on manifold structure neural network is proposed which uses manifold space to better represent the internal structure characteristics of clothing.Concretely,first,the shallow features of the clothing image set are extracted through the traditional convolutional neural network,and then the Euclidean feature data are converted into manifold data by using the covariance pooling.Finally,the internal manifold structures of clothing image sets are learned through the neural network based on manifold structure to obtain more accurate classification results.The experimental results show that the Precision,Recall and F-1 score of the proposed method on the MVC dataset can reach 89.64%,89.12% and 88.69%.Compared with the existing image sets (video) classification algorithms,the proposed method obtains an improvement of 2.04%,2.65% and 2.70%.It is illustrated that the proposed method is more accurate,efficient and robust than existing methods.

Key words: Clothing image set classification, Computer vision, Deep learning, Fashion analysis, Manifold neural network

CLC Number: 

  • TP391
[1]CHENG W H,SONG S,CHEN C Y,et al.Fashion Meets Computer Vision:A Survey[J].arXiv:2003.13988,2020.
[2]LEE S,OH S,JUNG C,et al.A global-local embedding module for fashion landmark detection [C]//Proceedings of the IEEE International Conference on Computer Vision Workshops.2019.
[3]GE Y,ZHANG R,WANG X,et al.Deepfashion2:A versatile benchmark for detection,pose estimation,segmentation and re-identification of clothing images[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2019:5337-5345.
[4]WANG W,ZHANG Z,QI S,et al.Learning compositional neural information fusion for human parsing[C]//Proceedings of the IEEE International Conference on Computer Vision.2019:5703-5713.
[5]WANG W,XU Y,SHEN J,et al.Attentive fashion grammarnetwork for fashion landmark detection and clothing category classification[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2018:4271-4280.
[6]MALL U,MATZEN K,HARIHARAN B,et al.Geostyle:Discovering fashion trends and events[C]//Proceedings of the IEEE International Conference on Computer Vision.2019:411-420.
[7]WU B,CHENG W H,LIU P,et al.SMP challenge:An overview of social media prediction challenge 2019[C]//Proceedings of the 27th ACM International Conference on Multimedia.2019:2667-2671.
[8]DONG H,LIANG X,SHEN X,et al.Fw-gan:Flow-navigated warping gan for video virtual try-on[C]//Proceedings of the IEEE International Conference on Computer Vision.2019:1161-1170.
[9]BALASKRISHNAN G,ZHAO A,DALCA A V,et al.Synthesizing images of humans in unseen poses[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2018:8340-8348.
[10]SANTESTEBAN I,OTADUY M A,CASAS D.Learning-Based Animation of Clothing for Virtual Try-On[J].Computer Graphics Forum,2019,38(2):355-366.
[11]SONG X,HAS X,LI Y,et al.GP-BPR:Personalized Compatibility Modeling for Clothing Matching[C]//Proceedings of the 27th ACM International Conference on Multimedia.2019:320-328.
[12]DONG X,SONG X,FENG F,et al.Personalized Capsule Wardrobe Creation with Garment and User Modeling[C]//Proceedings of the 27th ACM International Conference on Multimedia.2019:302-310.
[13]LIU Z,LUO P,QIU S,et al.Deepfashion:Powering robustclothes recognition and retrieval with rich annotations[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2016:1096-1104.
[14]SIMONYAN K,ZISSERMAN A.Very deep convolutional networks for large-scale image recognition[J].arXiv:1409.1556,2014.
[15]HE K,GKIOXARI G,DOLLAR P,et al.Mask r-cnn[C]//Proceedings of the IEEE International Conference on Computer Vision.2017:2961-2969.
[16]LIU K H,CHEN T Y,CHEN C S.Mvc:A dataset for view-invariant clothing retrieval and attribute prediction[C]//Proceedings of the 2016 ACM on International Conference on Multimedia Retrieval.2016:313-316.
[17]DONAHUE J,ANNE HENDRICKS L,GUADARRAMA S,et al.Long-term recurrent convolutional networks for visual recognition and description[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2015:2625-2634.
[18]HOCHREITER S,SCHMIDHUBER J.Long short-term memory[J].Neural Computation,1997,9(8):1735-1780.
[19]TRAN D,BOURDEV L,FERGUS R,et al.Learning spatiotemporal features with 3d convolutional networks[C]//Proceedings of the IEEE International Conference on Computer Vision.2015:4489-4497.
[20]SIMONYAN K,ZISSERMAN A.Two-stream convolutionalnetworks for action recognition in videos[C]//Advances in Neural Information Processing Systems.2014:568-576.
[21]HUANG Z,GOOL L V.A riemannian network for SPD matrix learning[C]//Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence.2017:2036-2042.
[22]HUANG Z,WU J,GOOL L V.Building Deep Neural Networks on Grassmann Manifolds[C]//Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence.2018:3279-3286.
[23]CHAKRABORTY R,BOUZA J,MANTON J,et al.Manifoldnet:A deep neural network for manifold-valued data with applications[C]//IEEE Transactions on Pattern Analysis and Machine Intelligence.2020.
[24]ACHARYA D,HUANG Z,PANI P D,et al.Covariance pooling for facial expression recognition[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops.2018:367-374.
[25]WANG W,XU Y,SHEN J,et al.Attentive fashion grammarnetwork for fashion landmark detection and clothing category classification[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2018:4271-4280.
[26]BRONSTEIN M M,BTUNA J,LECUN Y,et al.Geometricdeep learning:going beyond euclidean data[J].IEEE Signal Processing Magazine,2017,34(4):18-42.
[27]MASCI J,BOSCAINI D,BRONSTEIN M,et al.Geodesic convolutional neural networks on riemannian manifolds[C]//Proceedings of the IEEE international Conference on Computer Vision Workshops.2015:37-45.
[28]NHUYEN X S,BRUN L,LEZORAY O,et al.Skeleton-based hand gesture recognition by learning SPD matrices with neural networks[C]//2019 14th IEEE International Conference on Automatic Face & Gesture Recognition (FG 2019).IEEE,2019:1-5.
[29]QIAO S,WANG R,SHAN S,et al.Deep heterogeneous hashing for face video retrieval[J].IEEE Transactions on Image Processing,2019,29:1299-1312.
[30]LI C,ZHANG B,CHEN C,et al.Deep manifold structure transfer for action recognition[J].IEEE Transactions on Image Processing,2019,28(9):4646-4658.
[31]WANG R,GUO H,DAVIS L S,et al.Covariance discriminative learning:A natural and efficient approach to image set classification[C]//2012 IEEE Conference on Computer Vision and Pattern Recognition.IEEE,2012:2496-2503.
[1] XU Yong-xin, ZHAO Jun-feng, WANG Ya-sha, XIE Bing, YANG Kai. Temporal Knowledge Graph Representation Learning [J]. Computer Science, 2022, 49(9): 162-171.
[2] RAO Zhi-shuang, JIA Zhen, ZHANG Fan, LI Tian-rui. Key-Value Relational Memory Networks for Question Answering over Knowledge Graph [J]. Computer Science, 2022, 49(9): 202-207.
[3] TANG Ling-tao, WANG Di, ZHANG Lu-fei, LIU Sheng-yun. Federated Learning Scheme Based on Secure Multi-party Computation and Differential Privacy [J]. Computer Science, 2022, 49(9): 297-305.
[4] SUN Qi, JI Gen-lin, ZHANG Jie. Non-local Attention Based Generative Adversarial Network for Video Abnormal Event Detection [J]. Computer Science, 2022, 49(8): 172-177.
[5] WANG Jian, PENG Yu-qi, ZHAO Yu-fei, YANG Jian. Survey of Social Network Public Opinion Information Extraction Based on Deep Learning [J]. Computer Science, 2022, 49(8): 279-293.
[6] HAO Zhi-rong, CHEN Long, HUANG Jia-cheng. Class Discriminative Universal Adversarial Attack for Text Classification [J]. Computer Science, 2022, 49(8): 323-329.
[7] JIANG Meng-han, LI Shao-mei, ZHENG Hong-hao, ZHANG Jian-peng. Rumor Detection Model Based on Improved Position Embedding [J]. Computer Science, 2022, 49(8): 330-335.
[8] HU Yan-yu, ZHAO Long, DONG Xiang-jun. Two-stage Deep Feature Selection Extraction Algorithm for Cancer Classification [J]. Computer Science, 2022, 49(7): 73-78.
[9] CHENG Cheng, JIANG Ai-lian. Real-time Semantic Segmentation Method Based on Multi-path Feature Extraction [J]. Computer Science, 2022, 49(7): 120-126.
[10] HOU Yu-tao, ABULIZI Abudukelimu, ABUDUKELIMU Halidanmu. Advances in Chinese Pre-training Models [J]. Computer Science, 2022, 49(7): 148-163.
[11] ZHOU Hui, SHI Hao-chen, TU Yao-feng, HUANG Sheng-jun. Robust Deep Neural Network Learning Based on Active Sampling [J]. Computer Science, 2022, 49(7): 164-169.
[12] SU Dan-ning, CAO Gui-tao, WANG Yan-nan, WANG Hong, REN He. Survey of Deep Learning for Radar Emitter Identification Based on Small Sample [J]. Computer Science, 2022, 49(7): 226-235.
[13] WANG Jun-feng, LIU Fan, YANG Sai, LYU Tan-yue, CHEN Zhi-yu, XU Feng. Dam Crack Detection Based on Multi-source Transfer Learning [J]. Computer Science, 2022, 49(6A): 319-324.
[14] CHU Yu-chun, GONG Hang, Wang Xue-fang, LIU Pei-shun. Study on Knowledge Distillation of Target Detection Algorithm Based on YOLOv4 [J]. Computer Science, 2022, 49(6A): 337-344.
[15] LIU Wei-ye, LU Hui-min, LI Yu-peng, MA Ning. Survey on Finger Vein Recognition Research [J]. Computer Science, 2022, 49(6A): 1-11.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!