计算机科学 ›› 2022, Vol. 49 ›› Issue (10): 214-223.doi: 10.11896/jsjkx.210900080

• 计算机图形学&多媒体 • 上一篇    下一篇

几何特征和属性标签驱动的人脸图像合成

代福芸, 迟静, 任明国, 张琪东   

  1. 山东财经大学计算机科学与技术学院 济南 250014
    山东省数字媒体技术重点实验室 济南 250014
  • 收稿日期:2021-09-10 修回日期:2022-03-10 出版日期:2022-10-15 发布日期:2022-10-13
  • 通讯作者: 迟静(peace_world_cj@126.com)
  • 作者简介:(192115004@mail.sdufe.edu.cn)
  • 基金资助:
    山东省重点研发计划(2019GSF109112);山东省自然科学基金(ZR2019MF016);山东省高等学校青创科技支持计划(2020KJN007);济南市“新高校20条”科研带头人工作室(2021GXRC092)

Face Image Synthesis Driven by Geometric Feature and Attribute Label

DAI Fu-yun, CHI Jing, REN Ming-guo, ZHANG Qi-dong   

  1. School of Computer Science and Technology,Shandong University of Finance and Economics,Jinan 250014,China
    Shandong Provincial Key Laboratory of Digital Media Technology,Jinan 250014,China
  • Received:2021-09-10 Revised:2022-03-10 Online:2022-10-15 Published:2022-10-13
  • About author:DAI Fu-yun,born in 1996,postgra-duate.Her main research interests include face image synthesis and so on.
    CHI Jing,born in 1980,Ph.D,professor,Ph.D supervisor.Her main research interests include computer animation,digital image processing and information visualization.
  • Supported by:
    Key Research and Development Program of Shandong Province(2019GSF109112),Natural Science Foundation of Shandong Province(ZR2019MF016),Science and Technology Plan for Young Talents in Colleges and Universities of Shandong Province(2020KJN007) and Scientific Research Studio in Colleges and Universities of Jinan City(2021GXRC092).

摘要: 针对当前人脸图像合成中存在的模样和表情合成的多样性不足、表情的真实感不高、合成效率低的问题,提出了一种引入面部几何特征和属性标签约束的新型人脸合成网络模型。该模型在给定一张源人脸图片、一张目标人脸图片和属性(如发色、性别、年龄等)标签的情况下,能够生成一张具有源人脸表情、目标人脸身份特征以及指定属性的高真实感的人脸图像。模型包括两部分:人脸特征点生成器(Facial Landmark Generator,FLMG)和几何-属性感知生成器(Geometry and Attribute Aware Generator,GAAG)。FLMG利用人脸几何特征点编码表情信息,并将源人脸的表情信息迁移到目标人脸的特征点上;GAAG结合FLMG中生成的特征点、给定的属性标签和目标人脸图片,生成一张具有指定模样和表情的人脸图片。文中还引入了一个新的软截断三元感知损失函数用于约束GAAG,该函数可使生成的人脸更好地保持目标人脸的身份特征且更加真实自然,同时可使GAAG模型以更快的速度收敛。实验结果表明,所提方法可合成模样和表情多样化的人脸图像,且人脸的外貌真实、表情自然。另外,所提网络模型只需训练一次即可实现任意不同表情之间的迁移,效率较高。

关键词: 人脸图像合成, 表情迁移, 人脸编辑, 生成式对抗网络

Abstract: Aiming at the problems in current face image synthesis,such as the lack of diversity of synthetic appearances and expressions,the low reality of the facial expressions and the low synthesis efficiency,this paper proposes a novel face synthesis network model driven by facial geometric feature and attribute label.Given a source face image,a target face image and the attribute(e.g.,hair color,gender,age) label,the new face synthesis model can generate a highly realistic face image which owns the expression of the source face,the identity of the target face and the specified attribute.The new model consists of two parts:facial landmark generator(FLMG) and geometry and attribute aware generator(GAAG).FLMG uses the facial geometric feature points to encode the expression information,and transfers the expression from the source to the target face in the form of feature points.Combining the transferred feature points,the specified attribute label and the target face image,GAAG generates a face image with specified appearance and expression.A novel soft margin triplet perception loss is introduced to GAAG,which can make the synthesized face more natural and keep the identity of the target face well,and makes the GAAG converge faster.Experimental results show that the face images generated by our approach have more diverse appearances and more realistic expressions.In addition,our model only needs to be trained once to realize the transfer between any arbitrary different expressions,so its efficiency is high.

Key words: Face image synthesis, Expression transfer, Face editing, Generative adversarial network

中图分类号: 

  • TP317
[1]SUSSKIND J M,HINTON G E,MOVELLAN J R,et al.Gene-rating facial expressions with deep belief nets[M]//Affective Computing,Emotion Modelling,Synthesis and Recognition.2008:421-440.
[2]YAN Y F,LU K,XUE J,et al.Face animation method based on deep learning and expression AU parameters [J].Journal of Computer Aided Design & Graphics,2019,31(11):1973-1980.
[3]GOODFELLOW I J,POUGET-ABADIE J,MIRZA M,et al.Generative adversarial nets[C]//Neural Information Processing Systems.2014:2672-2680.
[4]ARJOVSKY M,CHINTALA S,BOTTOU L.Wasserstein GAN[J].arXiv:1701.07875,2017.
[5]ISOLA P,ZHU J Y,ZHOU T H,et al.Image-to-image translation with conditional adversarial networks[C]//Computer Vison and Pattern Recognition.2017:5967-5976.
[6]YI Z L,ZHANG H,TAN P,et al.DualGAN:unsupervised dual learning for image-to-image translation[C]//IEEE International Conference on Computer Vision.2017:2868-2876.
[7]HUANG H B.Face image synthesis and analysis based on gene-rative model[D].Beijing:University of Chinese Academy of Sciences,2019.
[8]CHEN F J,ZHU F,WU Q X,et al.A Survey of Generative Adversarial Networks and Its Application in Image Generation[J].Chinese Journal of Computers,2021,44(2):347-369.
[9]PERARNAU G,VAN J,RADUCANU B,et al.Invertible conditional GANs for image editing[C]//NIPS Workshop on Adversarial Training.2016.
[10]DING H,SRICHARAN K,CHELLAPPA R.ExprGAN:Facial expression editing with controllable expression intensity[C]//AAAI Conference on Artificial Intelligence.2018:6665-7655.
[11]KARRAS T,LAINE S,AILA T.A Style-Based Generator Architecture for Generative Adversarial Networks[C]//Confe-rence on Computer Vision and Pattern Recognition.2019:4401-4410.
[12]TEWARI A,ELGHARIB M,BHARAJ G.StyleRig:RiggingStyleGAN for 3D Control over Portrait Images[C]//Confe-rence on Computer Vision and Pattern Recognition.2020:6141-6150.
[13]SONG L X,LU Z H,HE R,et al.Geometry Guided Adversarial Facial Expression Synthesis[C]//ACM International Confe-rence on Multimedia.2018:627-635.
[14]QIAO F C,YAO N M,JIAO Z R,et al.Emotional facial expression transfer from a single image via generative adversarial nets[J].Computer Animation and Virtual Worlds,2018,29(3/4):1/10-10/10.
[15]OLSZEWSKI Z,LI Z M,YANG C,et al.Realistic Dynamic Facial Textures from a Single Image Using GANs[C]//IEEE International Conference on Computer Vision.2017:5429-5438.
[16]ZHU J Y,PARK T,ISOLA P,et al.Unpaired Image-to-Image Translation using Cycle-Consistent Adversarial Networks[C]//IEEE International Conference on Computer Vision.2017:2232-2969.
[17]CHOI Y J,CHOI M J,KIM M Y,et al.StarGAN:Unified Gene-rative Adversarial Networks for Multi-Domain Image-to-Image Translation[C]//IEEE Conference on Computer Vision and Pattern Recognition.2018:8789-8797.
[18]ZHANG J N,ZENG X F,WANG M M,et al.FReeNet:Multi-Identity Face Reenactment[C]//Computer Vison and Pattern Recognition.2020:5326-5335.
[19]WANG X P,WANG Y H,LI W X.U-Net Conditional GANs for Photo-Realistic and Identity-Preserving Facial Expression Synthesis[J].ACM Transactions on Multimedia Computing,Communications,and Applications,2019,15(3s):1-23.
[20]SCHROFF F,KALENICHENKO D,PHILBIN J.FaceNet:AUnified Embedding for Face Recognition and Clustering[C]//Computer Vison and Pattern Recognition.2015:815-823.
[21]JUSTIN J,ALEXANDRE A,LI F F.Perceptual Losses forReal-Time Style Transfer and Super-Resolution[C]//European Conference on Computer Vision.2016:694-711.
[22]GUO X J,LI S Y,ZHANG J W,et al.PFLD:A Practical Facial Landmark Detector[J].arXiv:1902.10859,2019.
[23]ODENA A,OLAH C,SHLENS J.Conditional Image Synthesis with Auxiliary Classifier GANs[C]//The International Confe-rence on Machine Learning.2017:2642-2651.
[24]SIMONYAN K,ZISSERMAN A.Very Deep Convolutional Networks for Large-Scale Image Recognition[C]//International Conference on Learning Representations.2015.
[25]LANGNER O,DOTSCH R,BIJLSTRA G,et al.Presentation and validation of the Radboud Faces Database [J].Cognition and Emotion,2010,24(8):1377-1388.
[26]Karolinska Institute,Stockholm,Sweden.The Karolinska Di-rected Emotional Faces[EB/OL].https://www.kdef.se/index.html.
[27]DIEDERIK P,KINGMA D,JIMMY B.Adam:A Method for Stochastic Optimization[C]//International Conference on Learning Representations.2015.
[28]AMAZON.Amazon Mechanical Turk(AMT)[EB/OL].https://www.mturk.com.
[1] 徐国宁, 陈奕芃, 陈一鸣, 陈晋音, 温浩.
基于约束优化生成式对抗网络的数据去偏方法
Data Debiasing Method Based on Constrained Optimized Generative Adversarial Networks
计算机科学, 2022, 49(6A): 184-190. https://doi.org/10.11896/jsjkx.210400234
[2] 唐雨潇, 王斌君.
基于深度生成模型的人脸编辑研究进展
Research Progress of Face Editing Based on Deep Generative Model
计算机科学, 2022, 49(2): 51-61. https://doi.org/10.11896/jsjkx.210400108
[3] 邢红杰, 郝忠.
基于全局和局部判别对抗自编码器的异常检测方法
Novelty Detection Method Based on Global and Local Discriminative Adversarial Autoencoder
计算机科学, 2021, 48(6): 202-209. https://doi.org/10.11896/jsjkx.200400083
[4] 刘海波,武天博,沈晶,史长亭.
基于GAN-LSTM的APT攻击检测
Advanced Persistent Threat Detection Based on Generative Adversarial Networks and Long Short-term Memory
计算机科学, 2020, 47(1): 281-286. https://doi.org/10.11896/jsjkx.181102103
[5] 刘剑, 金泽群.
基于深度学习的人脸表情迁移方法
Facial Expression Transfer Method Based on Deep Learning
计算机科学, 2019, 46(6A): 250-253.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!