计算机科学 ›› 2020, Vol. 47 ›› Issue (6): 126-132.doi: 10.11896/jsjkx.19050002

• 计算机图形学&多媒体 • 上一篇    下一篇

基于特征图融合的小尺寸人脸检测

杨少鹏1, 刘宏哲1, 王雪峤2   

  1. 1 北京联合大学北京市信息服务工程重点实验室 北京100101
    2 北京联合大学计算机技术研究所 北京100101
  • 收稿日期:2019-05-06 出版日期:2020-06-15 发布日期:2020-06-10
  • 通讯作者: 刘宏哲(liuhongzhe@buu.edu.cn)
  • 作者简介:shaopeng568@163.com
  • 基金资助:
    国家自然科学基金(61871039,61802019,61906017);北京联合大学领军计划项目(BPHR2019AZ01);北京市属高校高水平教师队伍建设支持计划项目(IDHT20170511);国家科技支撑项目(2015BAH55F03);北京市教委科技计划一般项目(KM201811417002,KM201911417001,KM201711417005);智能驾驶大数据协作创新中心(CYXC1902);北京自然科学基金(4184088)

Small Size Face Detection Based on Feature Map Fusion

YANG Shao-peng1, LIU Hong-zhe1, WANG Xue-qiao2   

  1. 1 Beijing Key Laboratory of Information Service Engineering,Beijing Union University,Beijing 100101,China
    2 Institute of Computer Technology,Beijing Union University,Beijing 100101,China
  • Received:2019-05-06 Online:2020-06-15 Published:2020-06-10
  • About author:YANG Shao-peng,born in 1990,postgraduate.His main research interests include pattern recognition and so on.
    LIU Hong-zhe,born in 1971,Ph.D,professor,Ph.D supervisor,is a member of China Computer Federation.Her main research interests include pattern recognition and so on.
  • Supported by:
    This work was supported by the National Natural Science Foundation of China (61871039,61802019,61906017),Beijing United University Lea-dership Program(BPHR2019AZ01),Supporting Plan for Cultivating High Level Teachers in Colleges and Universities in Beijing (IDHT20170511),National Science and Technology Support Project (2015BAH55F03),Foundation of Beijing Municipal Education Commission (KM201811417002,KM201911417001, KM201711417005), Big Data Collaborative Innovation Center for Intelligent Driving(CYXC1902),and Beijing Natural Science Foundation(4184088)

摘要: 人脸检测是指从输入图片或视频中找到人脸的精确位置并确定其大小。为了应对尺度多样性特别是小尺寸人脸给人脸检测任务带来的困难,文中提出一种新的基于特征图融合的小尺寸人脸检测方法。首先,合理地选择待检测特征图,使用不同的特征图检测不同大小的人脸。然后,通过将较深的特征图和较浅的特征图进行融合,合理地引入上下文信息,从而提高小尺寸人脸的检测精度。在NVIDIA GTX TATAN X上,使用WIDERFACE数据集对所提方法进行训练和测试,其在WIDERFACE 3个测试子集上的测试结果分别为88.9%(hard),93.5%(medium),94.3%(easy)AP,检测速度为39 fps,其检测精度和检测速度均优于其他优秀的检测方法。

关键词: 上下文信息, 特征图融合, 小尺寸, 一阶段网络

Abstract: Face detection is finding and locating all faces from the input pictures or videos.In order to solve the difficulties caused by the diversity of face size,especially small-sized faces,a new single shot small-scale face detection method is presented based on feature map fusion.The method first selects the feature map to be detected reasonably,and uses different feature maps to detect faces of different sizes.Then,by combining the deep feature map and the shallow feature map,the context information is introduced reasonably,thereby improving the detection precision of the small-sized face.The proposed model is trained and tested on the NVIDIA GTX TATAN X using the WIDERFACE dataset.The results on the three test subsets of WIDERFACE are 88.9% (hard),93.5% (medium),94.3% (easy) AP,at 39 fps.It is superior to other excellent detection methods in both detection accuracy and detection speed.

Key words: Contextual information, Feature map fusion, Single shot, Small size

中图分类号: 

  • TP391
[1]JIA J.Research on Face Detection Technology Based on Deep Learning Network[J].China New Telecommunications,2017,19(24):54-55.
[2]VIOLA P,JONES M.Rapid object detection using a boosted cascade of simple features[C]//IEEE Computer Society Confe-rence on Computer Vision & Pattern Recognition.Kauai,HI,USA,2001:511.
[3]MATHIAS M,BENENSON R,PEDERSOLI M,et al.Face detection without bells and whistles[C]//European Conference on Computer Vision.Cham:Springer,2014:720-735.
[4]LI H,LIN Z,SHEN X,et al.A convolutional neural network cascade for face detection[C]//Proceedings of the IEEE Confe-rence on Computer Vision and Pattern Recognition.2015:5325-5334.
[5]WU S Z,KAN M,SHAN S G,et al.Funnel-structured cascade for multi-view face detection with alignment-awareness[J].Neurocomputing,2016,221(C):138-145.
[6]YANG S,LUO P,CHEN C L,et al.Faceness-Net:Face detection through deep facial part responses[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2017,PP(99):1.
[7]LIU W,ANGUELOV D,ERHAN D,et al.SSD:Single shot multibox detector[C]//European Conference on Computer Vision.Amsterdam,Netherlands,2016:21-37.
[8]ZHANG S,ZHU X,LEI Z,et al.S3FD:Single Shot Scale-invariant Face Detector[C]//Proceedings of the IEEE International Conference on Computer Vision.2017:192-201.
[9]REN S,HE K,GIRSHICK R,et al.Faster R-CNN:Towards Real-time Object Detection with Region Proposal Networks[C]//Advances in Neural Information Processing Systems.2015:91-99.
[10]REDMON J,DIVVALA S,GIRSHICK R,et al.You only Look Once:Unified,Real-time Object Detection[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2016:779-788.
[11]ZHU C,ZHENG Y,LUU K,et al.CMS-RCNN:Con textual Multi-Scale Region-Based CNN for Unconstrained Face Detection [M]//Deep Learning for Biometrics.Cham:Springer,2017:57-79.
[12]HU P,RAMANAN D.Finding Tiny Faces[C]// Computer Vision & Pattern Recognition.2017.
[13]HE K,ZHANG X,REN S,et al.Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition[J].IEEE Transactions on Pattern Analysis & Machine Intelligence,2014,37(9):1904-1916.
[14]SIMONYAN K,ZISSERMAN A.Very Deep Convolutional Networks for Large-scale Image Recognition[J].arXiv:1409.1556.
[15]LUO W,LI Y,URTASUN R,et al.Understanding the Effective Receptive Field in Deep Convolutional Neural Networks[C]//Advances in Neural Information Processing Systems.2016:4898-4906.
[16]YANG S,LUO P,LOY C,et al.WIDERFACE:A face detection benchmark[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.Las Vegas,USA,2016:5525-5533.
[17]CAO G,XIE X,YANG W,et al.Feature-Fused SSD:Fast Detection for Small Objects[C]//Ninth International Conference on Graphic and Image Processing (ICGIP 2017).International Society for Optics and Photonics,2018.
[18]FU C Y,LIU W,RANGA A,et al.Dssd:Deconvolutional single shot detector[J].arXiv:1701.06659,2017.
[19]SHRIVASTAVA A,GUPTA A,GIRSHICK R.Training region-based object detectors with online hard example mining[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2016:761-769.
[20]LIN T Y,GOYAL P,GIRSHICK R,et al.Focal loss for dense object detection[C]//Proceedings of the IEEE International Conference on Computer Vision.2017:2980-2988.
[21]ZHU C,TAO R,LUU K,et al.Seeing Small Faces from Robust Anchor’s Perspective[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2018:5127-5136.
[1] 黄少滨, 孙雪薇, 李熔盛.
基于跨句上下文信息的神经网络关系分类方法
Relation Classification Method Based on Cross-sentence Contextual Information for Neural Network
计算机科学, 2022, 49(6A): 119-124. https://doi.org/10.11896/jsjkx.210600150
[2] 郝志峰, 廖祥财, 温雯, 蔡瑞初.
基于多上下文信息的协同过滤推荐算法
Collaborative Filtering Recommendation Algorithm Based on Multi-context Information
计算机科学, 2021, 48(3): 168-173. https://doi.org/10.11896/jsjkx.200700101
[3] 晏旭, 马帅, 曾凤娇, 郭正华, 伍俊龙, 杨平, 许冰.
基于编码-解码器架构的光场深度估计方法
Light Field Depth Estimation Method Based on Encoder-decoder Architecture
计算机科学, 2021, 48(10): 212-219. https://doi.org/10.11896/jsjkx.200900005
[4] 马海江.
基于卷积神经网络与约束概率矩阵分解的推荐算法
Recommendation Algorithm Based on Convolutional Neural Network and Constrained Probability Matrix Factorization
计算机科学, 2020, 47(6A): 540-545. https://doi.org/10.11896/JsJkx.191000172
[5] 周鹏程,龚声蓉,钟珊,包宗铭,戴兴华.
基于深度特征融合的图像语义分割
Image Semantic Segmentation Based on Deep Feature Fusion
计算机科学, 2020, 47(2): 126-134. https://doi.org/10.11896/jsjkx.190100119
[6] 徐扬,王建成,刘启元,李寿山.
基于上下文信息的口语意图检测方法
Intention Detection in Spoken Language Based on Context Information
计算机科学, 2020, 47(1): 205-211. https://doi.org/10.11896/jsjkx.181202269
[7] 赵鹏, 吴礼发, 洪征.
基于经纪人的多云访问控制模型研究
Research on Broker Based Multicloud Access Control Model
计算机科学, 2019, 46(11): 123-129. https://doi.org/10.11896/jsjkx.190300112
[8] 文俊浩,孙光辉,李顺.
基于用户聚类和移动上下文的矩阵分解推荐算法研究
Study on Matrix Factorization Recommendation Algorithm Based on User Clustering and Mobile Context
计算机科学, 2018, 45(4): 215-219. https://doi.org/10.11896/j.issn.1002-137X.2018.04.036
[9] 谌国风,孔俊俊,郭耀,陈向群.
一种智能手机上下文信息获取的代价模型及其应用
Context Retrieval Cost Model on Smartphones and its Application
计算机科学, 2014, 41(11): 132-136. https://doi.org/10.11896/j.issn.1002-137X.2014.11.026
[10] 田宣,李冬梅.
上下文信息检索研究综述
Survey on Contextual Information Retrieval
计算机科学, 2011, 38(9): 18-24.
[11] .
目标独立的Prolog程序路径依赖分析语义

计算机科学, 2008, 35(2): 246-252.
[12] 姚寒冰 胡和平 卢正鼎 李瑞轩.
基于角色和上下文的动态网格访问控制研究

计算机科学, 2006, 33(1): 41-44.
[13] 张仰森 曹元大.
基于语料库的自然语言建模方法研究

计算机科学, 2004, 31(5): 176-179.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!