计算机科学 ›› 2019, Vol. 46 ›› Issue (2): 271-278.doi: 10.11896/j.issn.1002-137X.2019.02.042

• 图形图像与模式识别 • 上一篇    下一篇

基于HEVC压缩域的镜头边界检测方法

朱威1, 商明将1, 荣意1, 冯杰2   

  1. 浙江工业大学信息工程学院 杭州3100231
    浙江理工大学信息学院 杭州3100182
  • 收稿日期:2018-06-06 出版日期:2019-02-25 发布日期:2019-02-25
  • 通讯作者: 朱 威(1982-),男,博士,副教授,硕士生导师,主要研究方向为视频编解码和智能视觉处理,E-mail:weizhu@zjut.edu.cn
  • 作者简介:商明将(1994-),男,硕士生,主要研究方向为智能视频分析;荣 意(1992-),女,硕士生,主要研究方向为智能视频分析;冯 杰(1981-),男,博士,讲师,硕士生导师,主要研究方向为视频信息处理和机器学习。
  • 基金资助:
    本文受浙江省自然科学基金(LY17F010013),国家自然科学基金(61501402,61401398)资助。

Shot Boundary Detection Method Based on HEVC Compressed Domain

ZHU Wei1, SHANG Ming-jiang1, RONG Yi1, FENG Jie2   

  1. College of Information Engineering,Zhejiang University of Technology,Hangzhou 310023,China1
    School of Information Science and Technology,Zhejiang Sci-Tech University,Hangzhou 310018,China2
  • Received:2018-06-06 Online:2019-02-25 Published:2019-02-25

摘要: 镜头边界检测是智能视频检索的一个重要环节。现有的检测方法主要是在像素域进行处理,切变检测精度不高,计算复杂度过大。针对这些问题,文中利用解析HEVC码流得到的编码信息,提出了一种基于HEVC压缩域的镜头边界检测方法。首先统计每帧编码信息中各类预测模式的PU个数,并根据CU深度对运动矢量进行幅值滤波;然后采用PU预测模式、运动矢量和帧比特数对切变候选帧进行两级筛选,再对其进行自适应阈值的镜头切变检测;接着根据切变帧对视频序列进行分段,并在时域上对帧比特数进行平滑滤波;最后使用PU预测模式和经滤波平滑后的帧比特数对分段视频进行镜头渐变检测。实验结果表明,该方法具有良好的镜头边界检测效果,并具有较低的计算复杂度。

关键词: HEVC, 镜头边界检测, 压缩域, 预测模式

Abstract: Shot boundary detection is an important part of intelligent video retrieval.The existing detection methods are mainly processed in pixel domain,with low accuracy of cut and high computational complexity.To solve these problems,this paper used the encoding information obtained by parsing HEVC stream and proposesd a shot boundary detection method based on HEVC compressed domain.First,the number of PUs with different prediction modes is counted for each frame,and the motion vectors are filtered according to CU depth.Then,the two-stage candidate frame of cut is selected by using the PU prediction modes,the motion vectors and the number of frame bits.And then,the cut shot detection of adaptive threshold is performed.After that,the video is segmented according to the cut frame.In addition,the smooth filtering is carried out for the frame bits in the time domain.Finally,the PU prediction modes and the number of smoothed frame bits are used to detect the gradual shot detection.The experimental results show that the proposed method has a good effect on shot boundary detection with lower computational complexity.

Key words: Compressed domain, HEVC, Prediction mode, Shot boundary detection

中图分类号: 

  • TP391
[1]DAI K X,LI Q,LI G H.Prospects and current studies on video mining[J].Computer Science,2010,37(10):11-15.(in Chinese)
代科学,李强,李国辉.视频挖掘研究进展[J].计算机科学,2010,37(10):11-15.
[2]SHEN R K,LIN Y N,JUANG T T,et al.Automatic detection of video shot boundary in social media using a hybrid approach of HLFPN and keypoint matching[J].IEEE Transactions on Computational Social Systems,2018,5(1):210-219.
[3]ALI M,ADNAN A.Short boundary detection using spatial-temporal features[J].Advances in Intelligent Systems and Computing,2016,448:971-981.
[4]SANTOS A C S,PEDRINI H.Shot boundary detection for video temporal segmentation based on the weber local descriptor[C]∥IEEE International Conference on Systems,Man,and Cyberne-tics.IEEE,2017:1310-1315.
[5]LI J Q,PAN Q K,LIANG Y C.Shot change detection on news videos using color histogram and edge based approaches[C]∥ IEEE International Conference on Advances in Computer Applications.IEEE,2016:50-54.
[6]PENG T L,ZHANG W J,WANG Y B,et al.Video shot boun- dary detection algorithm based on multi-features[J].Chinese Journal of Scientific Instrument,2015,36(9):2013-2020.(in Chinese)
彭太乐,张文俊,汪友宝,等.基于多特征的视频镜头检测方法[J].仪器仪表学报,2015,36(9):2013-2020.
[7]QU Z,GAO T F,ZHANG Q Q.Study on an improved algorithm of video keyframe extraction[J].Computer Science,2012,39(8):300-303.(in Chinese)
瞿中,高腾飞,张庆庆.一种改进的视频关键帧提取算法研究[J].计算机科学,2012,39(8):300-303.
[8]ZHONG X,YANG G,LU Y S.Method of key frames extraction based on double-threshold values sliding window sub-shot segmentation and fully connected graph[J].Computer Science,2016,43(6):289-293.(in Chinese)
钟忺,杨光,卢炎生.基于双阈值滑动窗口子镜头分割和完全连通图的关键帧提取方法[J].计算机科学,2016,43(6):289-293.
[9]LU Z M,SHI Y.Fast video shot boundary detection based on SVD and pattern matching[J].IEEE Transactions on Image Processing,2013,22(12):5136-5145.
[10]LAKSHMI P G G,DOMNIC S.Walsh-hadamard transform kernel-based feature vector for shot boundary detection[J].IEEE Transactions on Image Processing,2014,23(12):5187-5197.
[11]TIPPAYA S,SITJONGSATAPORN S,TAN T,et al.Multi- Modal Visual Features-Based Video Shot Boundary Detection[J].IEEE Access,2017,5:12563-12575.
[12]MONDAL J,KUNDU M K,DAS S,et al.Video shot boundary detection using multiscale geometric analysis of nsct and least squares support vector machine[J].Multimedia Tools and Applications,2018,77(7):8139-8161.
[13]JIAN M,YIN Y,DONG J.Relative Flow Estimates for Shot Boundary Detection[J].Pattern Recognition and Image Analysis,2018,28(1):53-58.
[14]YANG Z,LI C.Gradual shot detection employing automatic white balance method[C]∥ ACM International Conference on Multimedia Systems and Signal Processing.ACM,2018:71-74.
[15]SULLIVAN G J,OHM J,HAN W J,et al.Overview of the high efficiency video coding (HEVC) standard[J].IEEE Transactions on Circuits & Systems for Video Technology,2012,22(12):1649-1668.
[16]GONG S R,FAN Y J,ZHOU X.A novel scene change detection algorithm on H.264/AVC[J].Journal of Chinese Computer Systems,2007,28(4):688-691.(in Chinese)
龚声蓉,范益进,周翔.一种基于H.264/AVC码流的镜头边界检测方法[J].小型微型计算机系统,2007,28(4):688-691.
[17]XIA D Y,XIE H L.Shot boundary detection based on H.264/ AVC compressed domain[J].Journal of Image and Graphics,2009,14(12):2595-2598.(in Chinese)
夏定元,谢惠琳.一种在H.264/AVC压缩域中检测镜头边界的方法[J].中国图象图形学报,2009,14(12):2595-2598.
[18]LIU Y,WANG W,GAO W,et al.A novel compressed domain shot segmentation algorithm on H.264/AVC[C]∥ IEEE International Conference on Image Processing.IEEE,2004:2235-2238.
[19]ZHANG W,WANG Y,JIANG X.A shot segmentation algo- rithm for H.264 compressed videos[C]∥ IEEE International Congress on Image and Signal Processing.IEEE,2013:81-85.
[20]ZHANG W,WANG Y,JIANG X.A compressed-domain method of shot segmentation for X264 videos[C]∥ IEEE International Conference on Natural Computation.IEEE,2014:868-872.
[21]YOU Y X,ZHANG E D,GOU Z J.Shot boundary detection using Biased-SVM in H.264 compressed domain[J].Computer Engineering and Applications,2013,49(24):138-143.(in Chinese)
游运喜,张恩迪,苟志坚.H.264压缩域中利用Biased-SVM检测镜头边界[J].计算机工程与应用,2013,49(24):138-143.
[22]ZHANG Q M.Video shot boundary detection based on MB co- ding mode and SIFT features on H.264/AVC[C]∥ IEEE International Conference on Progress in Informatics and Computing.IEEE,2014:299-302.
[1] 徐艺菲, 熊淑华, 孙伟恒, 何小海, 陈洪刚.
基于非局部低秩和自适应量化约束先验的HEVC后处理算法
HEVC Post-processing Algorithm Based on Non-local Low-rank and Adaptive Quantization Constraint Prior
计算机科学, 2021, 48(5): 155-162. https://doi.org/10.11896/jsjkx.200800079
[2] 刘东, 王叶斐, 林建平, 马海川, 杨闰宇.
端到端优化的图像压缩技术进展
Advances in End-to-End Optimized Image Compression Technologies
计算机科学, 2021, 48(3): 1-8. https://doi.org/10.11896/jsjkx.201100134
[3] 蔡于涵,熊淑华,孙伟恒,Karn Pradeep,何小海.
基于运动矢量细化的帧率上变换与HEVC结合的视频压缩算法
Video Compression Algorithm Combining Frame Rate Up-conversion with HEVC Standard Based on Motion Vector Refinement
计算机科学, 2020, 47(2): 76-82. https://doi.org/10.11896/jsjkx.190500092
[4] 徐婧瑶, 王祖林, 徐迈.
基于深度学习的视频转码快速算法
Deep Learning Based Fast VideoTranscoding Algorithm
计算机科学, 2019, 46(3): 113-118. https://doi.org/10.11896/j.issn.1002-137X.2019.03.016
[5] 郭红伟, 骆洪军, 刘帅, 牛林, 杨波.
一种改进的R-λ模型码率控制算法
Improved R-λ Model Based Rate Control Algorithm
计算机科学, 2019, 46(3): 142-147. https://doi.org/10.11896/j.issn.1002-137X.2019.03.021
[6] 朱威, 易瑶, 王图强, 郑雅羽.
一种深度图像帧内编码单元快速划分算法
Fast Coding Unit Partition Algorithm for Depth Maps
计算机科学, 2019, 46(10): 286-294. https://doi.org/10.11896/jsjkx.180701337
[7] 张兆丰,吴泽民,杜麟,胡磊.
基于压缩域编码长度的视频显著性检测
Video Saliency Detection Based on Compressed Domain Coding Length
计算机科学, 2017, 44(10): 312-317. https://doi.org/10.11896/j.issn.1002-137X.2017.10.056
[8] 赵磊,黄华.
AVS监控档视频的压缩域摘要研究
Compressed Domain Synopsis Research in AVS Surveillance Profile
计算机科学, 2016, 43(7): 46-50. https://doi.org/10.11896/j.issn.1002-137X.2016.07.007
[9] 岑跃峰,王万良,姚信威,王超超,潘铁强.
基于决策树的HEVC编码单元划分算法
Decision Tree Based Coding Unit Splitting Algorithm for HEVC
计算机科学, 2016, 43(4): 308-312. https://doi.org/10.11896/j.issn.1002-137X.2016.04.063
[10] 丁琦,平西建.
基于脉冲位置参数统计特征的压缩域语音隐写分析
Steganalysis of Compressed Speech Based on Statistics of Pulse Position Parameters
计算机科学, 2011, 38(1): 217-220.
[11] 侯绿林,白亮,老松杨.
一种压缩域中的体育视频慢镜头探测方法
Method for Slow-motion Replay Detection on Compressed Domain in Sports Video
计算机科学, 2009, 36(9): 283-286.
[12] 韩冰 高新波 姬红兵.
一种分层的和多分辨的镜头边界检测方法

计算机科学, 2006, 33(6): 225-231.
[13] .
一种基于有限自动机的渐变镜头检测算法

计算机科学, 2006, 33(1): 252-254.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!