计算机科学 ›› 2011, Vol. 38 ›› Issue (9): 271-275.

• 图形图像 • 上一篇    下一篇

基于梯度离散余弦变换的视频文字定位

颜俊华,李丹,周亚同   

  1. (西南石油大学计算机科学学院 成都 610500);(河北工业大学信息工程学院 天津 300130)
  • 出版日期:2018-11-16 发布日期:2018-11-16
  • 基金资助:
    本文受中国博上后科学基金(20090450750)资助。

Approach to Video Text Localization Based on Gradient Discrete Cosines Transform

YAN Jun-hua,LI Dan,ZHOU Ya-tong   

  • Online:2018-11-16 Published:2018-11-16

摘要: 视频文字信息在基于语义的视频分析、检索、提取中占有重要地位。根据视频中文字和背景的灰度变化程度不同,提出一种基于梯度离散余弦变换的视频文字定位方法:先对视频帧进行NX V分块,计算每一块的离散余弦变换系数,然后求出梯度算子的幅值,利用得到的幅值作为块强度进行平滑滤波以及形态学处理,最后对图像进行水平和垂直方向投影,统计字幕条数,并利用文本框标识文字区域,进而达到对视频文字定位的目的。仿真结果表明这种视频文字定位方法对于静态文字和滚动字幕的定位均是可行的,且其算法的运行速度快、效率高,特别是对于笔画较少的文字定位准确,不会出现遗漏现象。

关键词: 文字定位,视频帧,梯度

Abstract: Video text information plays an important role in semantio-based video analysis, indexing and retrieval. The grade of gray in the video text and background is different,so an approach to video text localization using gradient disCrete cosines transform(DCT) was proposed. Firstly, the video frame is divided into N X N macro-blocks, and we can get the DCT coefficients from every macro-block. Then the gradient operator value considered as block intensity is calculated and we deal with it by using of smooth filter and morphology processing. Lastly, the horizontal and vertical projection can be obtained, the number of text line can be got and the text region is marked with text box hhe experimental results show that the proposed method is efficient on the localization of the static and rolling video text, especially the accuracy and integrality of character localization with the less strokes.

Key words: Next localization, Video frame, Gradient

No related articles found!
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!