基于改进YOLOv4-tiny的人脸关键点快速检测

doi:10.11896/jsjkx.211100290

Computer Science ›› 2022, Vol. 49 ›› Issue (11A): 211100290-5.doi: 10.11896/jsjkx.211100290

• Image Processing & Multimedia Technology • Previous Articles Next Articles

Facial Landmark Fast Detection Based on Improved YOLOv4-tiny

FU Bo-wen¹, LI Chuang-chuang¹, LIANG Ai-hua²

1 School of Robotics,Beijing Union University,Beijing 100101,China
2 Frontier Intelligent Technology Research Institute,Beijing Union University,Beijing 100101,China

Online:2022-11-10 Published:2022-11-21
About author:FU Bo-wen,born in 2000,undergra-duate.His main research interests include computer vison and so on.
LIANG Ai-hua,born in 1979,Ph.D,associate professor.Her main research interests include biometric recognition and image processing.
Supported by:
National Natural Science Foundation of China(61502036),Scientific Research Project of Beijing Union University(ZK50202002) and General Project of Beijing Association of Higher Education(YB202175).

Abstract

Abstract: Facial landmark detection is an important part of face recognition,which has been a hot issue in the field of computer vision.In order to meet the needs of efficient and lightweight face recognition,this paper proposes a facial landmark detection algorithm based on improved YOLOv4-tiny.608*608*3 color image is used for model input.The CSPDarknet53-tiny network is adopted to extract the main features of the input image.Then the extracted features are up-sampled and fused.Attention mechanism is added before feature fusion to improve the detection accuracy.The loss function of YOLOv4-tiny target detection is reconstructed,and the loss function of facial landmark is added to realize the location of facial landmark while detecting.The model output includes face marker frame and five key points.Compared with other facial landmark detection algorithms,the proposed algorithm has higher recognition efficiency and lower configuration requirements while ensuring recognition accuracy.Therefore,it can be better deployed on edge devices or mobile devices.

Key words: Facial landmark detection, YOLOv4-tiny, Attention mechanism, Real-time detection, Deep learning

CLC Number:

TP391

FU Bo-wen, LI Chuang-chuang, LIANG Ai-hua. Facial Landmark Fast Detection Based on Improved YOLOv4-tiny[J].Computer Science, 2022, 49(11A): 211100290-5.

References

[1]COOTES T F,TAYLOR C J,COOPER D H,et al.Active shape models-their training and application[J].Computer Vision and Image Understanding,1995,61(1):38-59.
[2]COOTES T F,EDWARDS G J,TAYLOR C J.Active appea-rance models[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2001,23(6):681-685.
[3]DENG J,GUO J,ZHOU Y,et al.Retinaface:Single-stage dense face localisation in the wild[J].arXiv:1905.00641,2019.
[4]HE K,ZHANG X,REN S,et al.Deep residual learning forimage recognition[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2016:770-778.
[5]LIN T Y,DOLLÁR P,GIRSHICK R,et al.Feature pyramidnetworks for object detection[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2017:2117-2125.
[6]SUI Y T,YAN Z Y,DAI L L,et al.Research on face multi-attribute detection algorithm based on RetinaFace[J].Railway Computer Applications,2021,30(3):1-4.
[7]DENG J,GUO J,XUE N,et al.Arcface:Additive angular margin loss for deep face recognition[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.2019:4690-4699.
[8]LI H,LIN Z,SHEN X,et al.A convolutional neural network cascade for face detection[C]//Proceedings of the IEEE Confe-rence on Computer Vision and Pattern Recognition.2015:5325-5334.
[9]ZHANG K,ZHANG Z,LI Z,et al.Joint face detection andalignment using multitask cascaded convolutional networks[J].IEEE Signal Processing Letters,2016,23(10):1499-1503.
[10]ZHANG S,ZHU X,LEI Z,et al.Faceboxes:A CPU real-time face detector with high accuracy[C]//2017 IEEE International Joint Conference on Biometrics(IJCB).IEEE,2017:1-9.
[11]WANG C Y,BOCHKOVSKIY A,LIAO H Y M.Scaled-yolov4:Scaling cross stage partial network[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.2021:13029-13038.
[12]BOCHKOVSKIY A,WANG C Y,LIAO H Y M.Yolov4:Optimal speed and accuracy of object detection[J].arXiv:2004.10934,2020.
[13]WANG Q L,WU B G,ZHU P F,et al.ECT-Net:EfficientChannerl Attention for Deep Convolutional Neural Networks[J].arXiv:1910.03151,2019.
[14]REN S,HE K,GIRSHICK R,et al.Faster r-cnn:Towards real-time object detection with region proposal networks[J].Advances in Neural Information Processing Systems,2015,28:91-99.
[15]YANG S,LUO P,LOY C C,et al.WIDER FACE:A Face Detection Benchmark[C]//IEEE Conference on Computer Vision &Pattern Recognition.IEEE,2016.

Related Articles 15

[1]	RAO Zhi-shuang, JIA Zhen, ZHANG Fan, LI Tian-rui. Key-Value Relational Memory Networks for Question Answering over Knowledge Graph [J]. Computer Science, 2022, 49(9): 202-207.
[2]	TANG Ling-tao, WANG Di, ZHANG Lu-fei, LIU Sheng-yun. Federated Learning Scheme Based on Secure Multi-party Computation and Differential Privacy [J]. Computer Science, 2022, 49(9): 297-305.
[3]	ZHOU Fang-quan, CHENG Wei-qing. Sequence Recommendation Based on Global Enhanced Graph Neural Network [J]. Computer Science, 2022, 49(9): 55-63.
[4]	DAI Yu, XU Lin-feng. Cross-image Text Reading Method Based on Text Line Matching [J]. Computer Science, 2022, 49(9): 139-145.
[5]	ZHOU Le-yuan, ZHANG Jian-hua, YUAN Tian-tian, CHEN Sheng-yong. Sequence-to-Sequence Chinese Continuous Sign Language Recognition and Translation with Multi- layer Attention Mechanism Fusion [J]. Computer Science, 2022, 49(9): 155-161.
[6]	XU Yong-xin, ZHAO Jun-feng, WANG Ya-sha, XIE Bing, YANG Kai. Temporal Knowledge Graph Representation Learning [J]. Computer Science, 2022, 49(9): 162-171.
[7]	XIONG Li-qin, CAO Lei, LAI Jun, CHEN Xi-liang. Overview of Multi-agent Deep Reinforcement Learning Based on Value Factorization [J]. Computer Science, 2022, 49(9): 172-182.
[8]	WANG Jian, PENG Yu-qi, ZHAO Yu-fei, YANG Jian. Survey of Social Network Public Opinion Information Extraction Based on Deep Learning [J]. Computer Science, 2022, 49(8): 279-293.
[9]	HAO Zhi-rong, CHEN Long, HUANG Jia-cheng. Class Discriminative Universal Adversarial Attack for Text Classification [J]. Computer Science, 2022, 49(8): 323-329.
[10]	JIANG Meng-han, LI Shao-mei, ZHENG Hong-hao, ZHANG Jian-peng. Rumor Detection Model Based on Improved Position Embedding [J]. Computer Science, 2022, 49(8): 330-335.
[11]	WANG Ming, PENG Jian, HUANG Fei-hu. Multi-time Scale Spatial-Temporal Graph Neural Network for Traffic Flow Prediction [J]. Computer Science, 2022, 49(8): 40-48.
[12]	ZHU Cheng-zhang, HUANG Jia-er, XIAO Ya-long, WANG Han, ZOU Bei-ji. Deep Hash Retrieval Algorithm for Medical Images Based on Attention Mechanism [J]. Computer Science, 2022, 49(8): 113-119.
[13]	SUN Qi, JI Gen-lin, ZHANG Jie. Non-local Attention Based Generative Adversarial Network for Video Abnormal Event Detection [J]. Computer Science, 2022, 49(8): 172-177.
[14]	YAN Jia-dan, JIA Cai-yan. Text Classification Method Based on Information Fusion of Dual-graph Neural Network [J]. Computer Science, 2022, 49(8): 230-236.
[15]	HU Yan-yu, ZHAO Long, DONG Xiang-jun. Two-stage Deep Feature Selection Extraction Algorithm for Cancer Classification [J]. Computer Science, 2022, 49(7): 73-78.

Metrics

Viewed

Full text

Abstract

Cited

Shared

Discussed

Comments

Recommended 0

No Suggested Reading articles found!

Facial Landmark Fast Detection Based on Improved YOLOv4-tiny

PDF (PC)

Abstract

Cite this article

share this article

References

Related Articles 15

Metrics

Comments

Recommended 0