计算机科学 ›› 2022, Vol. 49 ›› Issue (11): 141-147.doi: 10.11896/jsjkx.220600012
郑顺源, 胡良校, 吕晓倩, 孙鑫, 张盛平
ZHENG Shun-yuan, HU Liang-xiao, LYU Xiao-qian, SUN Xin, ZHANG Sheng-ping
摘要: 皮肤检测作为计算机视觉领域中的研究热点多年来被广泛研究,且仍然是一项具有挑战性的任务。尽管目前的方法在许多常规场景下取得了成功,但仍然存在预测不完整和泛化能力差等问题。针对该问题,提出了一种基于边缘引导的神经网络,并且由大量经过自校正的皮肤检测数据驱动网络训练,实现鲁棒的皮肤检测。首先,提出一种基于多任务学习的网络,对皮肤检测和边缘检测两个任务进行联合优化。进一步,提出边缘注意力模块,将预测所得的边缘检测结果通过该模块重新融合到皮肤检测支路中。最后,提出一种自校正算法,通过借助人体解析任务中的大量低质量数据以增强皮肤检测模型的泛化能力。通过自校正算法对带噪声标签的优化,逐步消除使用带噪声标签进行监督训练的副作用。实验结果表明,所提皮肤检测方法优于现有的其他方法。
中图分类号:
[1]VELUSAM S,PARIHAR R,KINI R,et al.FabSoften:Facebeautification via dynamic skin smoothing,guided feathering,and texture restoration[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops.2020:530-531. [2]CHEN W,WANG K,JIANG H,et al.Skin color modeling for face detection and segmentation:a review and a new approach[J].Multimedia Tools and Applications,2016,75(2):839-862. [3]RAUTARAY S S,AGRAWAL A.Vision based hand gesturerecognition for human computer interaction:a survey[J].Artificial Intelligence Review,2015,43(1):1-54. [4]QIN X,GUO H,HE C,et al.Lightweight human pose estimation:CVC-net[J].Multimedia Tools and Applications,2022,81(13):17615-17637. [5]GOMEZ G,MORALES E.Automatic feature construction and a simple rule induction algorithm for skin detection[C]//Procee-dings of the ICML workshop on Machine Learning in Computer Vision.2002,31-38. [6]CHEDDAD A,CONDELL J,CURRAN K,et al.A new colour space for skin tone detection [C]//Proceedings of the IEEE International Conference on Image Processing(ICIP).IEEE,2009:497-500. [7]YOGARAJAH P,CONDELL J,CURRAN K,et al.A dynamic threshold approach for skin segmentation in color images[C]//Proceedings of the IEEE International Conference on Image Processing.IEEE,2010:2225-2228. [8]JONES M J,REHG J M.Statistical color models with application to skin detection[J].International Journal of Computer Vision,2002,46(1):81-96. [9]PHUNG S L,BOUZERDOUM A,CHAI D.Skin segmentation using color pixel classification:analysis and comparison[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2005,27(1):148-154. [10]TAN W R,CHAN C S,YOGARAJAH P,et al.A fusion approach for efficient human skin detection[J].IEEE Transactions on Industrial Informatics,2011,8(1):138-147. [11]HWANG I,KIM Y,CHO N I.Skin detection based on multi-seed propagation in a multi-layer graph for regional and color consistency[C]// Proceedings of the IEEE International Confe-rence on Acoustics,Speech and Signal Processing.IEEE,2017:1273-1277. [12]PARACCHINI M B M,MARCON M,VILLA F,et al.Fast Skin Segmentation on Low Resolution Grayscale Images for Remote PhotoPlethysmoGraphy[J].IEEE MultiMedia,2022,29(1):28-35. [13]ZUO H,FAN H,BLASCH E,et al.Combining convolutional and recurrent neural networks for human skin detection[J].IEEE Signal Processing Letters,2017,24(3):289-293. [14]KIM Y,HWANG I,CHO N I.Convolutional neural networks and training strategies for skin detection[C]//Proceedings of the IEEE International Conference on Image Processing(ICIP).IEEE,2017:3919-3923. [15]HE Y,SHI J,WANG C,et al.Semi-supervised skin detection by network with mutual guidance [C]//Proceedings of the IEEE/CVF International Conference on Computer Vision.2019:2111-2120. [16]TARASIEWICZ T,NALEPA J,KAWULOK M.Skinny:Alightweight U-net for skin detection and segmentation[C]//Proceedings of the IEEE International Conference on Image Processing(ICIP).IEEE,2020:2386-2390. [17]PANDEY P,TYAGI A K,AMBEKAR S,et al.Unsupervised domain adaptation for semantic segmentation of NIR images through generative latent search[C]//Proceedings of the European Conference on Computer Vision.Cham:Springer,2020:413-429. [18]HU J,SHEN L,SUN G.Squeeze-and-excitation networks[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2018:7132-7141. [19]CHEN L C,ZHU Y,PAPANDREOU G,et al.Encoder-decoder with atrous separable convolution for semantic image segmentation[C]// Proceedings of the European Conference on Computer Vision(ECCV).2018:801-818. [20]LI P,XU Y,WEI Y,et al.Self-correction for human parsing[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2022,44(6):3260-3271. [21]YAMAGUCHI K,KIAPOUR M H,ORTIZ L E,et al.Parsing clothing in fashion photographs [C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.IEEE,2012:3570-3577. [22]CHEN X,MOTTAGHI R,LIU X,et al.Detect what you can:Detecting and representing objects using holistic models and body parts[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2014:1971-1978. [23]LIANG X,LIU S,SHEN X,et al.Deep human parsing with active template regression[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2015,37(12):2402-2414. [24]GONG K,LIANG X,ZHANG D,et al.Look into person:Self-supervised structure-sensitive learning and a new benchmark for human parsing[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2017:932-940. [25]GONG K,LIANG X,LI Y,et al.Instance-level human parsing via part grouping network[C]// Proceedings of the European Conference on Computer Vision(ECCV).2018:770-785. [26]LI J,ZHAO J,WEI Y,et al.Multiple-human parsing in the wild[J].arXiv:1705.07206,2017. [27]ZHAO J,LI J,CHENG Y,et al.Understanding humans incrowded scenes:Deep nested adversarial learning and a new benchmark for multi-human parsing[C]//Proceedings of the 26th ACM International Conference on Multimedia.2018:792-800. [28]POMA X S,RIBA E,SAPPA A.Dense extreme inception network:Towards a robust cnn model for edge detection[C]//Proceedings of the IEEE/ CVF winter Conference on Applications of Computer Vision.2020:1923-1932. [29]SU Z,LIU W,YU Z,et al.Pixel difference networks for efficient edge detection[C]// Proceedings of the IEEE/CVF Inter-national Conference on Computer Vision.2021:5117-5127. [30]ZHANG G,LU X,TAN J,et al.Refinemask:Towards high-quality instance segmentation with fine-grained features[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.2021:6861-6869. [31]TAKIKAWA T,ACUNA D,JAMPANI V,et al.Gated-scnn:Gated shape cnns for semantic segmentation[C]//Proceedings of the IEEE/ CVF International Conference on Computer Vision.2019:5229-5238. [32]ZHAO Y,LI J,ZHANG Y,et al.Multi-class part parsing with joint boundary-semantic awareness [C]//Proceedings of the IEEE/CVF International Conference on Computer Vision.2019:9177-9186. [33]RUAN T,LIU T,HUANG Z,et al.Devil in the details:To-wards accurate single and multiple human parsing[C]//Proceedings of the AAAI Conference on Artificial Intelligence.2019:4814-4821. [34]QIN X,ZHANG Z,HUANG C,et al.Basnet:Boundary-aware salient object detection[C]//Proceedings of the IEEE/CVF Conference on computer Vision and Pattern Recognition.2019:7479-7489. [35]ZHAO J X,LIU J J,FAN D P,et al.EGNet:Edge guidance network for salient object detection[C]//Proceedings of the IEEE/CVF International Conference on Computer Vision.2019:8779-8788. [36]HE K,ZHANG X,REN S,et al.Deep residual learning forimage recognition[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2016:770-778. |
[1] | 程成, 降爱莲. 基于多路径特征提取的实时语义分割方法 Real-time Semantic Segmentation Method Based on Multi-path Feature Extraction 计算机科学, 2022, 49(7): 120-126. https://doi.org/10.11896/jsjkx.210500157 |
[2] | 杜丽君, 唐玺璐, 周娇, 陈玉兰, 程建. 基于注意力机制和多任务学习的阿尔茨海默症分类 Alzheimer's Disease Classification Method Based on Attention Mechanism and Multi-task Learning 计算机科学, 2022, 49(6A): 60-65. https://doi.org/10.11896/jsjkx.201200072 |
[3] | 赵凯, 安卫超, 张晓宇, 王彬, 张杉, 相洁. 共享浅层参数多任务学习的脑出血图像分割与分类 Intracerebral Hemorrhage Image Segmentation and Classification Based on Multi-taskLearning of Shared Shallow Parameters 计算机科学, 2022, 49(4): 203-208. https://doi.org/10.11896/jsjkx.201000153 |
[4] | 杨晓宇, 殷康宁, 候少麒, 杜文仪, 殷光强. 基于特征定位与融合的行人重识别算法 Person Re-identification Based on Feature Location and Fusion 计算机科学, 2022, 49(3): 170-178. https://doi.org/10.11896/jsjkx.210100132 |
[5] | 宋龙泽, 万怀宇, 郭晟楠, 林友芳. 面向出租车空载时间预测的多任务时空图卷积网络 Multi-task Spatial-Temporal Graph Convolutional Network for Taxi Idle Time Prediction 计算机科学, 2021, 48(7): 112-117. https://doi.org/10.11896/jsjkx.201000089 |
[6] | 郭文, 尹童灵, 张天柱, 徐常胜. 时间一致性保持的多任务稀疏深度表达视觉跟踪 Temporal Consistency Preserving Multi-Mask Sparse Deep Representation for Visual Tracking 计算机科学, 2021, 48(6): 110-117. https://doi.org/10.11896/jsjkx.200800212 |
[7] | 宋昱, 孙文赟. 改进非线性结构张量的含噪图像边缘检测 Edge Detection in Images Corrupted with Noise Based on Improved Nonlinear Structure Tensor 计算机科学, 2021, 48(6): 138-144. https://doi.org/10.11896/jsjkx.200600017 |
[8] | 刘小龙, 韩芳, 王直杰. 基于知识表示的联合问答模型 Joint Question Answering Model Based on Knowledge Representation 计算机科学, 2021, 48(6): 241-245. https://doi.org/10.11896/jsjkx.200600011 |
[9] | 周晓进, 徐陈铭, 阮彤. 面向中文电子病历的多粒度医疗实体识别 Multi-granularity Medical Entity Recognition for Chinese Electronic Medical Records 计算机科学, 2021, 48(4): 237-242. https://doi.org/10.11896/jsjkx.200100036 |
[10] | 张春云, 曲浩, 崔超然, 孙皓亮, 尹义龙. 基于过程监督的序列多任务法律判决预测方法 Process Supervision Based Sequence Multi-task Method for Legal Judgement Prediction 计算机科学, 2021, 48(3): 227-232. https://doi.org/10.11896/jsjkx.200700056 |
[11] | 朱戎, 叶宽, 杨博, 谢欢, 赵蕾. 基于改进DeeplabV3+的地物分类方法研究 Feature Classification Method Based on Improved DeeplabV3+ 计算机科学, 2021, 48(11A): 382-385. https://doi.org/10.11896/jsjkx.201100184 |
[12] | 王体爽, 李培峰, 朱巧明. 基于数据增强的中文隐式篇章关系识别方法 Chinese Implicit Discourse Relation Recognition Based on Data Augmentation 计算机科学, 2021, 48(10): 85-90. https://doi.org/10.11896/jsjkx.200800115 |
[13] | 潘祖江, 刘宁, 张伟, 王建勇. 基于层次注意力机制的多任务疾病进展模型 MTHAM:Multitask Disease Progression Modeling Based on Hierarchical Attention Mechanism 计算机科学, 2020, 47(9): 185-189. https://doi.org/10.11896/jsjkx.190900001 |
[14] | 刘俊琦, 李智, 张学阳. 基于视觉显著性的海面船只候选区域检测方法 Candidate Region Detection Method for Maritime Ship Based on Visual Saliency 计算机科学, 2020, 47(6A): 237-241. https://doi.org/10.11896/JsJkx.191000196 |
[15] | 周子钦, 严华. 基于多任务学习的有限样本多视角三维形状识别算法 3D Shape Recognition Based on Multi-task Learning with Limited Multi-view Data 计算机科学, 2020, 47(4): 125-130. https://doi.org/10.11896/jsjkx.190700163 |
|