计算机科学 ›› 2017, Vol. 44 ›› Issue (Z6): 84-87.doi: 10.11896/j.issn.1002-137X.2017.6A.017

• 智能计算 • 上一篇    下一篇

基于双向学习排序的跨媒体语义相似性度量方法

刘爽,白亮,于天元,贾玉华   

  1. 国防科学技术大学信息系统工程重点实验室 长沙410073,国防科学技术大学信息系统工程重点实验室 长沙410073,国防科学技术大学信息系统工程重点实验室 长沙410073,国防科学技术大学信息系统工程重点实验室 长沙410073
  • 出版日期:2017-12-01 发布日期:2018-12-01

Cross-media Semantic Similarity Measurement Using Bi-directional Learning Ranking

LIU Shuang, BAI Liang, YU Tian-yuan and JIA Yu-hua   

  • Online:2017-12-01 Published:2018-12-01

摘要: 随着互联网技术的迅猛发展,网络信息的呈现形式不断从简单的文本扩展到图像、声音、视频等多媒体表达形式。在多媒体信息检索领域中,传统方法往往在同一个特征空间中表示所有的媒体模式,并采取一对一的配对数据,或者利用单向排序实例作为训练样本进行检索。在此背景下,考虑了学习双向排序实例,进而实现了跨媒体检索的方法。在Wikipedia数据集上进行测试,实验结果表明,基于双向排序的跨媒体语义相似性度量方法具有更好的性能。

关键词: 跨媒体表示,双向学习排序,隐空间,相似性度量

Abstract: With the rapid development of Internet technology,the presented forms of network information have exten-ded from simple text to images,voice,video and other multimedia expression.In the field of multimedia information retrieval,the traditional methods often represent all of the media mode in the same feature space model.Existing methods take either one-to-one paired data or uni-directional ranking examples.In this paper,we considered learning bi-directionalranking examples in the cross-media retrieval.By analyzing the experimental results basing on the Wikipedia dataset,it is demonstrated better performance of the proposed method.

Key words: Cross-media representation,Bi-directional learning ranking,Latent space,Similarity measurement

[1] HOTELLING H.Relations between two sets of variates[J].Biometrika,1936,8(3/4):321-377.
[2] RASIWASIA N,PEREIRA J C,C OVIELLO E,et al.A new approach to cross-modal multimedia retrieval[C]∥Proceedings of the International Conference on Multimedia .2010:251-260.
[3] JIA Y,SALZMANN M,DARRELL T.Factorized latent spaces with structured sparsity[C]∥Advances in Neural Information Processing Systems.2010,23:982-990.
[4] JIA Y,SALZMANN M,DARRELL T.Learning cross-modality similarity for multinomial data[C]∥IEEE International Confe-rence on Computer Vision.2011:2407-2414.
[5] GRANGIER D,BENGIO S.A discriminative kernel-based approach to rank images from text queries[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2008,30(8):1371-1384.
[6] WU F,LU X,ZHANG Z,et al.Cross-media semantic representation via bi-directional learning to rank[C]∥MM’13.2013:877-886.
[7] JIANG X,WU F,LI X,et al.ACM International Conference on Multimedia [J].Computer & Graphics,1994,8(4):611-612.
[8] DEERWESTER S,DUMAIS S,F URNAS G,et al.Indexing by latent semantic analysis[J].Journal of the American Society for Information Science,1990,41(6):391-407.
[9] BAI B,WESTON J,GRANGIER D,et al.Learning to rank with (a lot of) word features[J].Information Retrieval,2010,13(3):291-314.
[10] TSOCHANTARIDIS I,JOACHIMS T,HOFMANN T,et al.Large margin methods for structured and interdependent output variables[J].Journal of Machine Learning Research,2006,6(2):1453-1484.
[11] JOACHIMS T.A support vector method for multivariate performance measures[C]∥Proceedings of the 22nd International Conference on Machine Learning.2005:377-384.
[12] SHALEV-SHWARTZ S,SINGER Y,SREBRO N.Pegasos:Primal estimated sub-gradient solver for svm[C]∥Proceedings of the 24th International Conference on Machine Learning.2007:807-814.
[13] JOACHIMS T,FINLEY T,YU C.Cutting-plane training ofstructural svms[J].Machine Learning,2009,7(1):27-59.
[14] BAEZA-YATES R,RIBEIRO-NETO B.Modern InformationRetrieval[M].Addison-Wesley,1999.
[15] Ada.http://www.svcl.ucsd.edu/projects.

No related articles found!
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!