计算机科学 ›› 2023, Vol. 50 ›› Issue (6): 183-193.doi: 10.11896/jsjkx.220400038
秦亮1, 谢良1, 陈盛双1, 徐海蛟2
QIN Liang1, XIE Liang1, CHEN Shengshuang1, XU Haijiao2
摘要: 近年来,哈希算法由于其存储成本小、检索速度快的特点,在大规模多媒体数据的高效跨模态检索中受到了广泛关注。现有的跨模态哈希算法大多是有监督和无监督方法,其中有监督方法通常能够获得更好的性能,但在实际应用中要求所有数据都被标记并不具有可行性。此外,这些方法大多数是离线方法,面对流数据的输入需要付出高额训练成本且十分低效。针对上述问题,提出了一种新的半监督跨模态哈希方法——在线半监督锚图跨模态哈希(Online Semi-supervised Anchor Graph Cross-modal Hashing,OSAGCH),构建了半监督锚图跨模态哈希模型,在只有部分数据有标签的情况下,利用正则化锚图预测数据标签,并通过子空间关系学习哈希函数,一步生成统一的哈希码,同时针对流数据输入的情况对该模型进行了在线化学习,使其能够处理流数据。在公共多模态数据集上进行了实验,结果表明所提方法的性能优于其他现有方法。
中图分类号:
[1]WANG J,ZHANG T,SEBE N,et al.A survey on learning to hash [J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2017,40(4):769-790. [2]GORISSE D,CORD M,PRECIOSO F.Locality-sensitive ha-shing for chi2 distance [J].IEEE Transactions on Pattern Ana-lysis and Machine Intelligence,2011,34(2):402-409. [3]MATSUSHITA Y,WADA T.Principal component hashing:an accelerated approximate nearest neighbor search[C]//Advances in Image and Video Technology.Tokyo:Springer,2009:374-385. [4]GONG Y C,LAZEBNIK S,GORDO A,et al.Iterative quantization:a procrustean approach to learning binary codes for large-scale image retrieval [J].IEEE Transactions on Pattern Analysis & Machine Intelligence,2013,35(12):2916-2929. [5]LIU L,YU M Y,SHAO L.Unsupervised local feature hashing for image similarity search [J].IEEE Trans.Cybern,2015,46(11):2548-2558. [6]HEO J P,LEE Y,HE J F,et al,Spherical hashing:binary code embedding with hyperspheres [J].IEEE Transactions on Pattern Analysis Machine Intelligence,2015,37(11):2304-2316. [7]STRECHA C,BRONSTEIN A,BRONSTEIN M,et al.LDA-Hash:improved matching with smaller descriptors [J].IEEE Trans.Pattern Anal.Mach.Intell,2011,34(1):66-78. [8]LIU W,WANG J,JI R R,et al.Supervised Hashing with Kernels[C]//IEEE Conference on Computer Vision and Pattern Recognition.Providence:CVPR,2012:2074-2081. [9]NGUYEN V A,DO M N.Deep learning based supervised ha-shing for efficient image retrieval[C]//IEEE International Conference on Multimedia and Expo.Seattle:ICME,2016:1-6. [10]WANG J,KUMAR S,CHANG S F.Semi-supervised hashingfor large-scale search [J].IEEE Trans.Pattern Anal.Mach.Intell,2012,34(12):2393-2406. [11]WU C X,ZHU J K,CAI D,et al.Semi-supervised nonlinearhashing using bootstrap sequential projection learning [J].IEEE Trans.Knowl.Data Eng,2013,25(6):1380-1393. [12]GAO L L,SONG J K,ZOU F H,et al.Scalable multimedia retrieval by deep learning hashing with relative similarity learning[C]//Proceedings of the 23rd ACM International Conference on Multimedia.New York:ACM,2015:903-906. [13]ZHOU X C,TIAN X,WANG X Z,et al.Bagging-boosting-based semi-supervised multi-hashing with query-adaptive re-ranking [J].Neurocomputing,2018,275(10):916-923. [14]ZHANG S F,LI J M,ZHANG B.Pairwise teacher-student network for semi-supervised hashing[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops.New York:IEEE,2019:730-737. [15]ANCULEF R,MENA F,MACALUSO A,et al.Self-Supervised Bernoulli Autoencoders for Semi-Supervised Hashing[C]//Iberoamerican Congress on Pattern Recognition.Porto:Springer,2021:258-268. [16]WANG G A,HU Q H,CHENG J,et al.Semi-supervised gene-rative adversarial hashing for image retrieval[C]//Proceedings of the 15th European Conference on Computer Vision.Munich:ECCV,2018:469-485. [17]YU J,WU X J,KITTLER J.Semi-supervised hashing for semi-paired cross-view retrieval[C]//Proceedings of the 24th International Conference on Pattern Recognition(ICPR).Beijing:IEEE,2018:958-963. [18]SONG T C,CAI J F,ZHANG T Q,et al.Semi-supervised mani-fold-embedded hashing with joint feature representation and classifier learning [J].Pattern Recognition,2017,68(2):99-110. [19]XU X,SHEN F M,YANG Y,et al.Learning discriminative binary codes for large-scale cross-modal retrieval [J].IEEE Transactions on Image Processing,2017,26(5):2494-2507. [20]ETO K,KOUTAKI G,SHIRAI K.Hadamard Coded Discrete Cross Modal Hashing[C]//2018 25th IEEE International Conference on Image Processing(ICIP).Switzerland:IEEE,2018:2007-2011. [21]YU G X,LIU X W,WANG J,et al.Flexible Cross-Modal Ha-shing [J].IEEE Transactions on Neural Networks and Learning Systems,2020,1(1):1-11. [22]LIN Z J,DING G G,HAN J G,et al.Cross-view retrieval via probability-based semantics-preserving hashing [J].IEEE Tran-sactions on Cybernetics,2016,47(12):4342-4355. [23]JIANG Q Y,LI W J.Deep cross-modal hashing[C]//Procee-dings of the IEEE Conference on Computer Vision and Pattern Recognition.Hawaii:CVPR,2017:3232-3240. [24]HUANG L K,YANG Q,ZHENG W S.Online hashing [J].IEEE Transactions on Neural Networks and Learning Systems,2017,29(6):2309-2322. [25]LIN M B,JI R R,LIU H,et al.Supervised online hashing via hadamard code-book learning[C]//Proceedings of the 26th ACM International Conference on Multimedia.Seoul Republic:ACM,2018:1635-1643. [26]LIN M B,JI R R,LIU H,et al.Hadamard matrix guided online hashing [J].International Journal of Computer Vision,2020,128(8):2279-2306. [27]ZHENG C,ZHU L,ZHANG S,et al.Efficient parameter-freeadaptive multi-modal hashing [J].IEEE Signal Processing Letters,2020,27(9):1270-1274. [28]LIN T Y,MAIRE M,BELONGIE S,et al.Microsoft coco:Common objects in context[C]//European Conference on Computer vision.Zurich:Springer,Cham,2014:740-755. [29]HUISKES M J,LEW M S.The mir flickr retrieval evaluation[C]//Proceedings of the 1st ACM International Conference on Multimedia Information Retrieval.Canada:ACM,2008:39-43. [30]CHUA T S,TANG J,HONG R,et al.Nus-wide:a real-world web image database from national university of singapore[C]//Proceedings of the ACM International Conference on Image and Video Retrieval.Greece:CIVR,2009:1-9. [31]SONG J,YANG Y,HUANG Z,et al.Effective multiple feature hashing for large-scale near-duplicate video retrieval [J].IEEE Transactions on Multimedia,2013,15(8):1997-2008. [32]LU X,ZHU L,LI J,et al.Efficient supervised discrete multi-view hashing for large-scale multimedia search [J].IEEE Tran-sactions on Multimedia,2019,22(8):2048-2060. [33]SCHEFFER T,WROBEL S.Active learning of partially hidden markov models[C]//Proceedings of the ECML/PKDD Workshop on Instance Selection.Freiburg:ECML,2001:1-15. [34]WANG D,WANG Q,GAO X.Robust and flexible discrete ha-shing for cross-modal similarity search [J].IEEE Transactions on Circuits and Systems for Video Technology,2017,28(10):2703-2715. [35]WANG D,WANG Q,HE L,et al.Joint and individual matrix factorization hashing for large-scale cross-modal retrieval [J].Pattern Recognition,2020,107(4):7-19. [36]LIU L,YANG Y,HU M,et al.Index and retrieve multimedia data:Cross-modal hashing by learning subspacerelation[C]//International Conference on Database Systems for Advanced Applications.Australia:Springer,2018:606-621. |
|