一种视频数据代表选择框架方法

doi:10.11896/j.issn.1002-137X.2016.11.004

摘要/Abstract

摘要： 为有效处理视频数据问题,提出一种识别海量数据集中代表子集的方法,即代表选择方法,经选择后的小容量的数据代表完全可以代表原始大数据集的结构特征。对于给定的大数据集,首先生成相应₁-norm非负稀疏图,然后利用一种谱聚类算法基于所生成的稀疏图将大数据反复划分直至形成聚类簇。代表选择过程中,将每个聚类看作Grassmann流形中的一个点,然后基于测地距衡量这些点间的距离,接着利用min-max算法分析距离以提取出较优的聚类子集。最后,通过分析被选集类的一个稀疏子图,利用主成分集中性方法探测出数据代表,称此过程为基于非负稀疏图与Grassmann流形测地距的代表选择框架。为验证所提出的框架,将之应用于视频分析中,从一长段的视频流中识别出少数的几个关键帧,实验效果通过人工判断与标准评价方法进行评价,并与现有的几种方法的效果进行比对,结果证明所提出的代表选择框架方法具有更好的效果与可行性。

关键词: 稀疏图,Grassmann流形,测地距,关键帧,代表

Abstract: In order to solve the process problem of massive videos data,a representative selection method of identifying the optimal subset of data points as a representative of original massive dataset was proposed.The selected data points of subset can represent inner structure of original massive dataset.And the novel representative selection method is based on ₁-norm non-negative sparse graph for the original massive dataset.The massive data points are partitioned into some clusters by using a spectral clustering algorithm based on the non-negative sparse graph generated in previous steps.Each cluster is viewed as a point in the Grassmann manifold,and the geodesic distances among these points are measured.By using a min-max algorithm,geodesic distances are analyzed to build an optimal subset of clusters.Finally,the principal component centrality method is used to detect a representative after analyzing the sparse graph of selected clusters.The proposed framework is validated on the problem of video summarization,where a few key frames should be selected in long video clips which contain massive frames.The comparison of the results obtained between the proposed algorithm and some state-of-the-art methods was producted.Result indicates the effectiveness and feasibility of the proposed framework.

Key words: Sparse graph,Grassmann manifold,Geodesic distance,Key frame,Representative

蒋勇,张海涛. 一种视频数据代表选择框架方法[J]. 计算机科学, 2016, 43(11): 19-23. https://doi.org/10.11896/j.issn.1002-137X.2016.11.004

JIANG Yong and ZHANG Hai-tao. Representative Selection Framework Approach for Videos[J]. Computer Science, 2016, 43(11): 19-23. https://doi.org/10.11896/j.issn.1002-137X.2016.11.004

参考文献

[1] Dang C,Radha H.Heterogeneity Image Patch Index and Its Ap-plication to Consumer Video Summarization[J].IEEE Transactions on Image Processing,March 2014,23(6):2704-2718
[2] Elhamifar,Ehsan,Sapiro G,et al.Finding Exemplars from Pair-wise Dissimilarities via Simultaneous Sparse Recovery[J].Advances in Neural Information Processing Systems,2012,1(8):19-27
[3] Kumar,Mrityunjay,Loui A C.Key frame extraction from consumer videos using sparse representation[C]∥Proc.18^th IEEE ICIP.2011:2437-2440
[4] Luo Jie-bo,Papin C,Costello K.Towards extracting semantically meaningful key frames from personal video clips:from humans to computers[J].IEEE Transactions on Circuits and Systems for Video Technology,2009,19(2):289-301
[5] Wang Zhe-shen,Kumar M,Luo Jie-bo,et al.Extracting keyframes from consumer videos using bi-layer group sparsity[C]∥ACM International Conference on Multimedia.2011:1505-1508
[6] Elhamifar,Ehsan,Sapiro G,Vidal R.See all by looking at a few:Sparse modeling for finding representative objects[C]∥Computer Vision and Pattern Recognition IEEE Conference on(CVPR).2012:1600-1607
[7] Cheng B,Yang J,Yan S,et al.Learning with ₁-graph for image analysis [J].IEEE Transactions on Image Processing,2010,19(4):858-866
[8] Dang C T,Kumar M,Radha H.Key frame extraction from consumer videos using epitome[C]∥Image Processing,Internatio-nal Conference on (ICIP).IEEE,2012:93-96
[9] He Ran,Zheng Wei-shi,Hu Bao-gang,et al.Nonegative sparsecoding for discriminative semi-supervised learning[C]∥Computer Vision and Pattern Recognition (CVPR).CVPR,IEEE,Providence,USA,2011:792-801
[10] Wong W K.Discover latent discriminant information for dimensionality reduction:non-negative sparseness preserving embedding [J].Pattern Recognition,2012,45(4):1511-1523
[11] Lu Gui-fu,Jin Zhong,Zou Jian.Face recognition using discriminant sparsity neighborhood preserving embedding [J].Know-ledge based Systems,2012,31(2):119-127
[12] Shi Jian-bo,Malik J.Normalized Cuts and Image Segmentation[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2000,2(8):888-905
[13] Raducanu B,Dornaika F.A supervised non-linear dimensionality reduction approach for manifold learning [J].Pattern Recognition,2012,45(6):2432-2444
[14] Ilyas M U,Radha H.Identifying Influential Nodes in Online Social Networks Using Principal Component Centrality[C]∥2011 IEEE International Conference on Kyoto.IEEE,Japan,2011
[15] Almeida,Jurandy,Leite N J,Ricardo da S Torres.Online video summarization on compressed domain [J].Journal of Visual Communication and Image Representation,2013,24(6):729-738
[16] Almeida,Jurandy,Leite N J,et al.Vision:Video summarization for online applications[J].Pattern Recognition Letters,2012,33(4):397-409
[17] The Open Video Project.http://www.open-video.org
[18] Kim Y J,Oh Y T,Yoon S H,et al.Efficient Hausdorff Distance computation for freeform geometric models in close proximity[J].Computer-Aided Design,2013,45(2):270-276
[19] Schuetze O,Equivel X,Lara A,et al.Some comments on GD and IGD and relations to the Hausdorff distance[C]∥Proceedings of the 12^th Annual Conference Comp on Genetic and Evolutionary Computation(GECCO’s 10:).New York,NY,USA,ACM,2010:1971-1974

Metrics

Viewed

Full text

Abstract

Cited

Shared

Discussed