Computer Science ›› 2019, Vol. 46 ›› Issue (12): 286-291.doi: 10.11896/jsjkx.181202257

• Graphics ,Image & Pattern Recognition • Previous Articles     Next Articles

Word Vectors Fusion Based Remote Sensing Scenes Zero-shot Classification Algorithm

WU Chen1,3, YUAN Yu-wei2, WANG Hong-wei3, LIU Yu3, LIU Si-tong4, QUAN Ji-cheng3   

  1. (Naval Aeronautical University,Yantai,Shandong 264001,China)1;
    (The 91977 of PLA,Beijing 102200,China)2;
    (Aviation University Air Force,Changchun 130022,China)3;
    (Xi’an Flight Academy of PLA,Xi’an 710000,China)4
  • Received:2018-12-04 Online:2019-12-15 Published:2019-12-17

Abstract: Zero-shot classification algorithm does not need to label the sample of unseen classes to be recognized,so it can greatly reduce the cost of practical application,which has attracted wide attention in recent years.The problem of structure difference between word vectors and image feature prototypesseriously affects the zero-shot classification performance of remote sensing scenes.Based on the complementarity among different kinds of word vectors,the remote sensing scenes zero-shot classification algorithm based on word vectors fusion,named coupled analysis dictionary lear-ning method,was proposed.Firstly,the sparse coefficients of different kinds of word vectors are obtained by the more efficient analysis dictionary learning to reduce the redundant information.Then,the sparse coefficients are concatenated and denoted as the fused word vectors,and a structure alignment operation is performed based on the image feature prototypes to reduce structural differences by embedding the fused word vectors into image feature space.Finally,the image feature prototypes of the scene classes unseen are calculated,and the nearest neighbor classifier is employed to complete the classification in the image feature space.Quantitative experiments of the fusion of multiple semantic word vectors were carried out on UCM and AID datasets.At the same time,two real remote sensing images were qualitatively tested with RSSCN7 datasets as the seen dataset.Auantitative experiments obtaines the highest overall classification accuracies of 48.40% and 60.23% on UCM and AID,which respectively exceeds the typical comparative methods by 4.80% and 6.98%.In qualitative experiments on two real remote sensing images ,the algorithm also obtaines the best zero-shot classification performance.The experimental results show that the fused word vectors are more consistent with the prototypes in image feature space,and the zero-shot classification accuracies of remote sensing scenes can be significantly improved.

Key words: Analysis dictionary learning, Remote sensing scenes classification, Structure alignment, Word vectors fusion, Zero-shot classification

CLC Number: 

  • TP181
[1]CHEN S,TIAN Y L.Pyramid of spatial relations for scene-level land use classification[J].IEEE Transactions on Geoscience & Remote Sensing,2014,53(4):1947-1957.
[2]LI A,LU Z,WANG L,et al.Zero-shot scene classification for high spatial resolution remote sensing images[J].IEEE Tran-sactions on Geoscience & Remote Sensing,2017,55(7):4157-4167.
[3]XIAN Y Q,AKATA Z,SHARMA G,et al.Latent embeddings for zero-shot classification[C]//Proceedings of Computer Vision and Pattern Recognition.LA USA:IEEE,2016:69-77.
[4]WANG D,LI Y,LIN Y,et al.Relational Knowledge Transfer for Zero-Shot Learning[C]//Proceedings of AAAI.CA USA:2016,2-7.
[5]ZHANG Z,SALIGRAMA V.Zero-shot learning via joint latent similarity embedding[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.LA USA:IEEE,2016:6034-6042.
[6]ZHANG Z,SALIGRAMA V.Zero-shot learning via semantic similarity embedding[C]//Proceedings of the IEEE InternationalConference on Computer Vision.Boston USA:IEEE,2015:4166-4174.
[7]WANG Q,CHEN K.Zero-shot visual recognition via bidirec- tional latent embedding[J].International Journal of Computer Vision,2017,124(3):356-383.
[8]LI Y,WANG D,HU H,et al.Zero-shot recognition using dual visual-semantic mapping paths[J].arXiv:1703.05002,2017.
[9]ZHAO B,WU B,WU T,et al.Zero-shot learning posed as a missing data problem[J].arXiv:1612.00560,2016.
[10]LAMPERT C H,NICKISCH H,HARMELING S.Attribute- based classification for zero-shot visual object categorization[J].IEEE Transactions on Pattern Analysis & Machine Intelligence,2014,36(3):453-465.
[11]SOCHER R,GANJOO M,MANNING C D,et al.Zero-shot learning through cross-modal transfer[C]//Proceedings of Advances in Neural Information Processing Systems.Lake Tahoe USA,ACM 2013:935-943.
[12]MIKOLOV T,CHEN K,CORRADO G,et al.Efficient estimation of word representations in vector space[J].arXiv:1301.3781,2013.
[13]SOCHER R,MANNING C D.Glove:Global vectors for word representation[C]//Empirical Methods in Natural Language Processing (EMNLP).2014:1532-1543 [14]YANG M,CHANG H Y,LUO W X.Discriminative analysis-synthesis dictionary learning for image classification[J].Neurocomputing,2017,219:404-411.
[15]WANG J J,GUO Y Q,GUO J,et al.Synthesis linear classifier-based analysis dictionary learning for pattern classification [J].Neurocomputing,2017,238:103-113.
[16]RAVISHANKAR S,BRESLER Y.L0 sparsifying transform learning with efficient optimal updates and convergence guarantees[J].IEEE Transactions on Signal Processing,2015,63(9):2389-2404.
[17]YANG Y,NEWSAM S.Bag-of-visual-words and spatial extensions for land-use classification[C]//Proceedings of the 18th SIGSPATIAL International Conference on Advances In Geographic Information Systems.ACM,2010:270-279.
[18]XIA G S,HU J,HU F,et al.AID:A benchmark dataset for performance evaluation of aerial scene classification[J].IEEE Transactions on Geoscience and Remote Sensing,2017,55(7):3965-3981.
[19]ZOU Q,NI L,ZHANG T,et al.Deep learning based feature selection for remote sensing scene classification[J].IEEEGeo-science & Remote Sensing Letters,2015,12(11):2321-2325.
[20]SIMONYAN K,ZISSERMAN A.Very deep convolutional networks for large-scale image recognition[J].arXiv:1409.1556,2014.
[1] WANG Jun-hao, YAN De-qin, LIU De-shan, XING Yu-jia. Algorithm with Discriminative Analysis Dictionary Learning by Fusing Extreme Learning Machine [J]. Computer Science, 2020, 47(5): 137-143.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!