计算机科学 ›› 2022, Vol. 49 ›› Issue (11): 141-147.doi: 10.11896/jsjkx.220600012

• 计算机图形学&多媒体 • 上一篇    下一篇

基于边缘引导的自校正皮肤检测

郑顺源, 胡良校, 吕晓倩, 孙鑫, 张盛平   

  1. 哈尔滨工业大学计算机科学与技术学院 山东 威海 264209
  • 收稿日期:2022-06-02 修回日期:2022-07-22 出版日期:2022-11-15 发布日期:2022-11-03
  • 通讯作者: 吕晓倩(xiaoqian.hit@gmail.com)
  • 作者简介:(sawyer0503@gmail.com)
  • 基金资助:
    国家自然科学基金(61872112);山东省泰山学者人才计划(tsqn201812106)

Edge Guided Self-correction Skin Detection

ZHENG Shun-yuan, HU Liang-xiao, LYU Xiao-qian, SUN Xin, ZHANG Sheng-ping   

  1. College of Computer Science and Technology,Harbin Institute of Technology,Weihai,Shandong 264209,China
  • Received:2022-06-02 Revised:2022-07-22 Online:2022-11-15 Published:2022-11-03
  • About author:ZHENG Shun-yuan,born in 1997,postgraduate.His main research interests include computer vision and deep lear-ning.
    LYU Xiao-qian,born in 1995,postgra-duate.Her main research interests include object detection,image enhancement and video analysis.
  • Supported by:
    National Natural Science Foundation of China(61872112) and Taishan Scholars Program of Shandong Province(tsqn201812106).

摘要: 皮肤检测作为计算机视觉领域中的研究热点多年来被广泛研究,且仍然是一项具有挑战性的任务。尽管目前的方法在许多常规场景下取得了成功,但仍然存在预测不完整和泛化能力差等问题。针对该问题,提出了一种基于边缘引导的神经网络,并且由大量经过自校正的皮肤检测数据驱动网络训练,实现鲁棒的皮肤检测。首先,提出一种基于多任务学习的网络,对皮肤检测和边缘检测两个任务进行联合优化。进一步,提出边缘注意力模块,将预测所得的边缘检测结果通过该模块重新融合到皮肤检测支路中。最后,提出一种自校正算法,通过借助人体解析任务中的大量低质量数据以增强皮肤检测模型的泛化能力。通过自校正算法对带噪声标签的优化,逐步消除使用带噪声标签进行监督训练的副作用。实验结果表明,所提皮肤检测方法优于现有的其他方法。

关键词: 皮肤检测, 边缘检测, 多任务学习, 自校正算法

Abstract: Skin detection has been a widely studied computer vision topic for many years,whereas remains a challenging task.Previous methods celebrate their success in various ordinary scenarios but still suffer from fragmentary prediction and poor generalization.To address this issue,this paper proposes an edge guided network driven by a massive self-corrected skin detection dataset for robust skin detection.To be specific,a multi-task learning based network which conducts skin detection and edge detection jointly is proposed.The predicted edge map is further converged to the skin detection stream via an edge attention module.Meanwhile,to engage a large-scale of low-quality data from the human parsing task to strengthen the generalization of the network,a self-correction algorithm is adapted to prune the side effect of supervised by noisy labels with continuously polishing up those defects during the training process.Experimental results indicate that the proposed method outperforms the state-of-the-art in skin detection.

Key words: Skin detection, Edge detection, Multi-task learning, Self-correction algorithm

中图分类号: 

  • TP391
[1]VELUSAM S,PARIHAR R,KINI R,et al.FabSoften:Facebeautification via dynamic skin smoothing,guided feathering,and texture restoration[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops.2020:530-531.
[2]CHEN W,WANG K,JIANG H,et al.Skin color modeling for face detection and segmentation:a review and a new approach[J].Multimedia Tools and Applications,2016,75(2):839-862.
[3]RAUTARAY S S,AGRAWAL A.Vision based hand gesturerecognition for human computer interaction:a survey[J].Artificial Intelligence Review,2015,43(1):1-54.
[4]QIN X,GUO H,HE C,et al.Lightweight human pose estimation:CVC-net[J].Multimedia Tools and Applications,2022,81(13):17615-17637.
[5]GOMEZ G,MORALES E.Automatic feature construction and a simple rule induction algorithm for skin detection[C]//Procee-dings of the ICML workshop on Machine Learning in Computer Vision.2002,31-38.
[6]CHEDDAD A,CONDELL J,CURRAN K,et al.A new colour space for skin tone detection [C]//Proceedings of the IEEE International Conference on Image Processing(ICIP).IEEE,2009:497-500.
[7]YOGARAJAH P,CONDELL J,CURRAN K,et al.A dynamic threshold approach for skin segmentation in color images[C]//Proceedings of the IEEE International Conference on Image Processing.IEEE,2010:2225-2228.
[8]JONES M J,REHG J M.Statistical color models with application to skin detection[J].International Journal of Computer Vision,2002,46(1):81-96.
[9]PHUNG S L,BOUZERDOUM A,CHAI D.Skin segmentation using color pixel classification:analysis and comparison[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2005,27(1):148-154.
[10]TAN W R,CHAN C S,YOGARAJAH P,et al.A fusion approach for efficient human skin detection[J].IEEE Transactions on Industrial Informatics,2011,8(1):138-147.
[11]HWANG I,KIM Y,CHO N I.Skin detection based on multi-seed propagation in a multi-layer graph for regional and color consistency[C]// Proceedings of the IEEE International Confe-rence on Acoustics,Speech and Signal Processing.IEEE,2017:1273-1277.
[12]PARACCHINI M B M,MARCON M,VILLA F,et al.Fast Skin Segmentation on Low Resolution Grayscale Images for Remote PhotoPlethysmoGraphy[J].IEEE MultiMedia,2022,29(1):28-35.
[13]ZUO H,FAN H,BLASCH E,et al.Combining convolutional and recurrent neural networks for human skin detection[J].IEEE Signal Processing Letters,2017,24(3):289-293.
[14]KIM Y,HWANG I,CHO N I.Convolutional neural networks and training strategies for skin detection[C]//Proceedings of the IEEE International Conference on Image Processing(ICIP).IEEE,2017:3919-3923.
[15]HE Y,SHI J,WANG C,et al.Semi-supervised skin detection by network with mutual guidance [C]//Proceedings of the IEEE/CVF International Conference on Computer Vision.2019:2111-2120.
[16]TARASIEWICZ T,NALEPA J,KAWULOK M.Skinny:Alightweight U-net for skin detection and segmentation[C]//Proceedings of the IEEE International Conference on Image Processing(ICIP).IEEE,2020:2386-2390.
[17]PANDEY P,TYAGI A K,AMBEKAR S,et al.Unsupervised domain adaptation for semantic segmentation of NIR images through generative latent search[C]//Proceedings of the European Conference on Computer Vision.Cham:Springer,2020:413-429.
[18]HU J,SHEN L,SUN G.Squeeze-and-excitation networks[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2018:7132-7141.
[19]CHEN L C,ZHU Y,PAPANDREOU G,et al.Encoder-decoder with atrous separable convolution for semantic image segmentation[C]// Proceedings of the European Conference on Computer Vision(ECCV).2018:801-818.
[20]LI P,XU Y,WEI Y,et al.Self-correction for human parsing[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2022,44(6):3260-3271.
[21]YAMAGUCHI K,KIAPOUR M H,ORTIZ L E,et al.Parsing clothing in fashion photographs [C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.IEEE,2012:3570-3577.
[22]CHEN X,MOTTAGHI R,LIU X,et al.Detect what you can:Detecting and representing objects using holistic models and body parts[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2014:1971-1978.
[23]LIANG X,LIU S,SHEN X,et al.Deep human parsing with active template regression[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2015,37(12):2402-2414.
[24]GONG K,LIANG X,ZHANG D,et al.Look into person:Self-supervised structure-sensitive learning and a new benchmark for human parsing[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2017:932-940.
[25]GONG K,LIANG X,LI Y,et al.Instance-level human parsing via part grouping network[C]// Proceedings of the European Conference on Computer Vision(ECCV).2018:770-785.
[26]LI J,ZHAO J,WEI Y,et al.Multiple-human parsing in the wild[J].arXiv:1705.07206,2017.
[27]ZHAO J,LI J,CHENG Y,et al.Understanding humans incrowded scenes:Deep nested adversarial learning and a new benchmark for multi-human parsing[C]//Proceedings of the 26th ACM International Conference on Multimedia.2018:792-800.
[28]POMA X S,RIBA E,SAPPA A.Dense extreme inception network:Towards a robust cnn model for edge detection[C]//Proceedings of the IEEE/ CVF winter Conference on Applications of Computer Vision.2020:1923-1932.
[29]SU Z,LIU W,YU Z,et al.Pixel difference networks for efficient edge detection[C]// Proceedings of the IEEE/CVF Inter-national Conference on Computer Vision.2021:5117-5127.
[30]ZHANG G,LU X,TAN J,et al.Refinemask:Towards high-quality instance segmentation with fine-grained features[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.2021:6861-6869.
[31]TAKIKAWA T,ACUNA D,JAMPANI V,et al.Gated-scnn:Gated shape cnns for semantic segmentation[C]//Proceedings of the IEEE/ CVF International Conference on Computer Vision.2019:5229-5238.
[32]ZHAO Y,LI J,ZHANG Y,et al.Multi-class part parsing with joint boundary-semantic awareness [C]//Proceedings of the IEEE/CVF International Conference on Computer Vision.2019:9177-9186.
[33]RUAN T,LIU T,HUANG Z,et al.Devil in the details:To-wards accurate single and multiple human parsing[C]//Proceedings of the AAAI Conference on Artificial Intelligence.2019:4814-4821.
[34]QIN X,ZHANG Z,HUANG C,et al.Basnet:Boundary-aware salient object detection[C]//Proceedings of the IEEE/CVF Conference on computer Vision and Pattern Recognition.2019:7479-7489.
[35]ZHAO J X,LIU J J,FAN D P,et al.EGNet:Edge guidance network for salient object detection[C]//Proceedings of the IEEE/CVF International Conference on Computer Vision.2019:8779-8788.
[36]HE K,ZHANG X,REN S,et al.Deep residual learning forimage recognition[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2016:770-778.
[1] 程成, 降爱莲.
基于多路径特征提取的实时语义分割方法
Real-time Semantic Segmentation Method Based on Multi-path Feature Extraction
计算机科学, 2022, 49(7): 120-126. https://doi.org/10.11896/jsjkx.210500157
[2] 杜丽君, 唐玺璐, 周娇, 陈玉兰, 程建.
基于注意力机制和多任务学习的阿尔茨海默症分类
Alzheimer's Disease Classification Method Based on Attention Mechanism and Multi-task Learning
计算机科学, 2022, 49(6A): 60-65. https://doi.org/10.11896/jsjkx.201200072
[3] 赵凯, 安卫超, 张晓宇, 王彬, 张杉, 相洁.
共享浅层参数多任务学习的脑出血图像分割与分类
Intracerebral Hemorrhage Image Segmentation and Classification Based on Multi-taskLearning of Shared Shallow Parameters
计算机科学, 2022, 49(4): 203-208. https://doi.org/10.11896/jsjkx.201000153
[4] 杨晓宇, 殷康宁, 候少麒, 杜文仪, 殷光强.
基于特征定位与融合的行人重识别算法
Person Re-identification Based on Feature Location and Fusion
计算机科学, 2022, 49(3): 170-178. https://doi.org/10.11896/jsjkx.210100132
[5] 宋龙泽, 万怀宇, 郭晟楠, 林友芳.
面向出租车空载时间预测的多任务时空图卷积网络
Multi-task Spatial-Temporal Graph Convolutional Network for Taxi Idle Time Prediction
计算机科学, 2021, 48(7): 112-117. https://doi.org/10.11896/jsjkx.201000089
[6] 郭文, 尹童灵, 张天柱, 徐常胜.
时间一致性保持的多任务稀疏深度表达视觉跟踪
Temporal Consistency Preserving Multi-Mask Sparse Deep Representation for Visual Tracking
计算机科学, 2021, 48(6): 110-117. https://doi.org/10.11896/jsjkx.200800212
[7] 宋昱, 孙文赟.
改进非线性结构张量的含噪图像边缘检测
Edge Detection in Images Corrupted with Noise Based on Improved Nonlinear Structure Tensor
计算机科学, 2021, 48(6): 138-144. https://doi.org/10.11896/jsjkx.200600017
[8] 刘小龙, 韩芳, 王直杰.
基于知识表示的联合问答模型
Joint Question Answering Model Based on Knowledge Representation
计算机科学, 2021, 48(6): 241-245. https://doi.org/10.11896/jsjkx.200600011
[9] 周晓进, 徐陈铭, 阮彤.
面向中文电子病历的多粒度医疗实体识别
Multi-granularity Medical Entity Recognition for Chinese Electronic Medical Records
计算机科学, 2021, 48(4): 237-242. https://doi.org/10.11896/jsjkx.200100036
[10] 张春云, 曲浩, 崔超然, 孙皓亮, 尹义龙.
基于过程监督的序列多任务法律判决预测方法
Process Supervision Based Sequence Multi-task Method for Legal Judgement Prediction
计算机科学, 2021, 48(3): 227-232. https://doi.org/10.11896/jsjkx.200700056
[11] 朱戎, 叶宽, 杨博, 谢欢, 赵蕾.
基于改进DeeplabV3+的地物分类方法研究
Feature Classification Method Based on Improved DeeplabV3+
计算机科学, 2021, 48(11A): 382-385. https://doi.org/10.11896/jsjkx.201100184
[12] 王体爽, 李培峰, 朱巧明.
基于数据增强的中文隐式篇章关系识别方法
Chinese Implicit Discourse Relation Recognition Based on Data Augmentation
计算机科学, 2021, 48(10): 85-90. https://doi.org/10.11896/jsjkx.200800115
[13] 潘祖江, 刘宁, 张伟, 王建勇.
基于层次注意力机制的多任务疾病进展模型
MTHAM:Multitask Disease Progression Modeling Based on Hierarchical Attention Mechanism
计算机科学, 2020, 47(9): 185-189. https://doi.org/10.11896/jsjkx.190900001
[14] 刘俊琦, 李智, 张学阳.
基于视觉显著性的海面船只候选区域检测方法
Candidate Region Detection Method for Maritime Ship Based on Visual Saliency
计算机科学, 2020, 47(6A): 237-241. https://doi.org/10.11896/JsJkx.191000196
[15] 周子钦, 严华.
基于多任务学习的有限样本多视角三维形状识别算法
3D Shape Recognition Based on Multi-task Learning with Limited Multi-view Data
计算机科学, 2020, 47(4): 125-130. https://doi.org/10.11896/jsjkx.190700163
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!