计算机科学 ›› 2026, Vol. 53 ›› Issue (5): 207-217.doi: 10.11896/jsjkx.251100057
陈琦1,2, 陈星凯1,2, 张辉煌1,2, 苏一平3, 胡海根1,2
CHEN Qi1,2, CHEN Xingkai1,2, ZHANG Huihuang1,2, SU Yiping3, HU Haigen1,2
摘要: 雄激素性脱发的临床诊断主要依赖于人工毛发计数,该方法不仅繁琐耗时,且主观性强,其严重制约了临床诊断的效率与一致性。因此,开发能够实现毛发自动识别与计数的智能辅助诊断技术具有重要的临床意义。然而,由于毛发结构纤细,且缺乏像素级标注的高质量训练数据,现有的监督检测方法仍面临巨大挑战。为应对该挑战,提出了一种名为 uHairDet 的毛发检测新方法,其核心思路是通过合成具有标注信息的毛发图像来降低对手工标注数据的依赖。该方法包含3个部分:1)提出了自标注毛发数据生成器(HBDG),用于合成具有像素级标注信息、结构合理的毛发图像;2)提出了结合语义感知自适应纠错损失(HEE Loss) 的结构稳定风格迁移模型(BS-GAN),以提升在风格迁移过程中图像结构的稳定性;3)引入了基于Mean-Teacher的框架的FCOS+PSC检测模型,配合提出的适用于毛发的OBB标注范式对模型进行训练,解决了传统毛发标注范式丢失大量毛发有效信息的问题。结果表明,在完全无须手工标注数据的情况下,该方法取得了 56.9% 的平均精度,显著优于多种基线模型,有望为皮肤科毛发疾病的智能辅助诊断建立一种新范式。
中图分类号:
| [1]KIM M,KANG S,LEE B D.Evaluation of automated measurement of hair density using deep neural networks[J].Sensors,2022,22(2):650. [2]KIM J H,KWON S,FU J,et al.Hair follicle classification and hair loss severity estimation using mask R-CNN[J].Journal of Imaging,2022,8(10):283. [3]WANG Y F,HSU M H,WANG M Y F.Comparative analysis of automatic hair count estimation methods[J].Archives of Dermatological Research,2025,317(1):361. [4]CHEN M C,CHANG W J,XIAO Y X,et al.A hair volumeanalysis system using deep learning and image processing technologies[C]//2021 IEEE International Conference on Consumer Electronics-Taiwan(ICCE-TW).IEEE,2021:1-2. [5]BOCHKOVSKIY A,WANG C Y,LIAO H Y M.Yolov4:Opti-mal speed and accuracy of object detection[J].arXiv:2004.10934,2020. [6]ZHU J Y,PARK T,ISOLA P,et al.Unpaired image-to-imagetranslation using cycle-consistent adversarial networks[C]//Proceedings of the IEEE International Conference on Computer Vision.2017:2223-2232. [7]WANG X,YU J.Learning to cartoonize using white-box cartoon representations[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.2020:8090-8099. [8]CHOI Y,UH Y,YOO J,et al.v2:Diverse image synthesis for multiple domains.In 2020 IEEE[C]//CVF Conference on Computer Vision and Pattern Recognition(CVPR).2020:8185-8194. [9]PARK T,EFROS A A,ZHANG R,et al.Contrastive learning for unpaired image-to-image translation[C]//European Confe-rence on Computer Vision.Cham:Springer,2020:319-345. [10]YI R,LIU Y J,LAI Y K,et al.Unpaired portrait drawing generation via asymmetric cycle mapping[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.2020:8217-8225. [11]TARVAINEN A,VALPOLA H.Mean teachers are better role models:Weight-averaged consistency targets improve semi-supervised deep learning results[C]//Proceedings of the 31st International Conference on Neural Information Processing Systems.2017:1195-1204. [12]SHIH H C.An unsupervised hair segmentation and countingsystem in microscopy images[J].IEEE Sensors Journal,2014,15(6):3565-3572. [13]JÄHNE B.Digital image processing[M].Berlin:Springer,2005. [14]SHIH H C,LIU E R.Adaptive region merging approach formorphological color image segmentation[C]//Proceedings of the ACCV.2014:1-14. [15]OTSU N.A threshold selection method from gray-level histograms[J].Automatica,1975,11(285-296):23-27. [16]ZHANG Q,EUN S J.Design and implementation of an automatic hair counting system[J].Journal of Digital Art Enginee-ring and Multimedia,2014,1(2):75. [17]HOUGH P V C,POWELL B W.A method for faster analysis of bubble chamber photographs[J].Il Nuovo Cimento(1955-1965),1960,18(6):1184-1191. [18]KIM W,KIM H,REW J,et al.A hair density measuring scheme using smartphone[C]//Annual Conference of KIPS.Korea Information Processing Society,2015:1416-1419. [19]GATYS L A,ECKER A S,BETHGE M.A neural algorithm of artistic style[J].arXiv:1508.06576,2015. [20]HUANG X,BELONGIE S.Arbitrary style transfer inreal-time with adaptive instance normalization[C]//Proceedings of the IEEE International Conference on Computer Vision.2017:1501-1510. [21]GOODFELLOW I J,POUGET-ABADIE J,MIRZA M,et al.Generative adversarial nets[C]//Proceedings of the 28st International Conference on Neural Information Processing Systems.2014. [22]CHO H,LEE J,CHANG S,et al.One-shot structure-awarestylized image synthesis[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.2024:8302-8311. [23]WANG Z,ZHAO L,XING W.Stylediffusion:Controllable disentangled style transfer via diffusion models[C]//Proceedings of the IEEE/CVF International Conference on Computer Vision.2023:7677-7689. [24]EVERAERT M N,BOCCHIO M,ARPA S,et al.Diffusion instyle[C]//Proceedings of the IEEE/CVF International Confe-rence on Computer Vision.2023:2251-2261. [25]YANG S,HWANG H,YE J C.Zero-shot contrastive loss fortext-guided diffusion image style transfer[C]//Proceedings of the IEEE/CVF International Conference on Computer Vision.2023:22873-22882. [26]KWON G,YE J C.Diffusion-based image translation using disentangled style and content representation[J].arXiv:2209.15264,2022. [27]ZHANG Y,HUANG N,TANG F,et al.Inversion-based creativity transfer with diffusion models[J].arXiv:2211.13203,2022. [28]ZHANG Y,DONG W,TANG F,et al.Prospect:Expanded conditioning for the personalization of attribute-aware image gene-ration[C]//CoRR.2023. [29]MOU C,WANG X,XIE L,et al.T2i-adapter:Learning adapters to dig out more controllable ability for text-to-image diffusion models[C]//Proceedings of the AAAI Conference on Artificial Intelligence.2024:4296-4304. [30]ZHANG Z,ZHANG Q,XING W,et al.Artbank:Artistic style transfer with pre-trained diffusion model and implicit style prompt bank[C]//Proceedings of the AAAI Conference on Artificial Intelligence.2024:7396-7404. [31]DALAL N,TRIGGS B.Histograms of oriented gradients forhuman detection[C]//2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition(CVPR’05).IEEE,2005:886-893. [32]LOWE D G.Distinctive image features from scale-invariant keypoints[J].International Journal of Computer Vision,2004,60(2):91-110. [33]MA J,SHAO W,YE H,et al.Arbitrary-oriented scene text detection via rotation proposals[J].IEEE Transactions on Multimedia,2018,20(11):3111-3122. [34]YANG X,ZHOU Y,ZHANG G,et al.The KFIoU loss for rotated object detection[J].arXiv:2201.12558,2022. [35]TIAN Z,SHEN C,CHEN H,et al.FCOS:A simple and strong anchor-free object detector[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2020,44(4):1922-1933. [36]YU Y,DA F.Phase-shifting coder:Predicting accurate orientation in oriented object detection[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.2023:13354-13363. [37]GANIN Y,USTINOVA E,AJAKAN H,et al.Domain-adversarial training of neural networks[J].Journal of Machine Lear-ning Research,2016,17(59):1-35. [38]GANIN Y,LEMPITSKY V.Unsupervised domain adaptation by backpropagation[C]//International Conference on Machine Learning.PMLR,2015:1180-1189. [39]ZHANG W,OUYANG W,LI W,et al.Collaborative and adversarial network for unsupervised domain adaptation[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2018:3801-3809. [40]PEI Z,CAO Z,LONG M,et al.Multi-adversarial domain adaptation[C]//Proceedings of the AAAI Conference on Artificial Intelligence.2018. [41]TZENG E,HOFFMAN J,ZHANG N,et al.Deep domain confusion:Maximizing for domain invariance[J].arXiv:1412.3474,2014. [42]LONG M,CAO Y,WANG J,et al.Learning transferable fea-tures with deep adaptation networks[C]//International Confe-rence on Machine Learning.PMLR,2015:97-105. [43]LONG M,ZHU H,WANG J,et al.Unsupervised domain adaptation with residual transfer networks[C]//Proceedings of the 30st International Conference on Neural Information Processing Systems.2016:136-144. [44]LONG M,ZHU H,WANG J,et al.Deep transfer learning with joint adaptation networks[C]//International Conference on Machine Learning.PMLR,2017:2208-2217. [45]GHIFARY M,KLEIJN W B,ZHANG M,et al.Deep recon-struction-classification networks for unsupervised domain adaptation[C]//European Conference on Computer Vision.Cham:Springer,2016:597-613. [46]BOUSMALIS K,TRIGEORGIS G,SILBERMAN N,et al.Domain separation networks[C]//NIPS 2016.2016. [47]LI Y J,DAI X,MA C Y,et al.Cross-domain adaptive teacher for object detection[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.2022:7581-7590. [48]KENNERLEY M,WANG J G,VEERAVALLI B,et al.2pcnet:Two-phase consistency training for day-to-night unsupervised domain adaptive object detection[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.2023:11484-11493. [49]LIN T Y,GOYAL P,GIRSHICK R,et al.Focal loss for dense object detection[C]//Proceedings of the IEEE International Conference on Computer Vision.2017:2980-2988. [50]YANG Z,LIU S,HU H,et al.Reppoints:Point set representation for object detection[C]//Proceedings of the IEEE/CVF International Conference on Computer Vision.2019:9657-9666. [51]LI W,CHEN Y,HU K,et al.Oriented reppoints for aerial object detection[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.2022:1829-1838. [52]HOU L,LU K,XUE J,et al.Shape-adaptive selection and mea-surement for oriented object detection[C]//Proceedings of the AAAI Conference on Artificial Intelligence.2022:923-932. [53]WEN L,GAO C,ZOU C.CAP-VSTNet:Content affinity preserved versatile style transfer[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.2023:18300-18309. [54]ZHOU Y,YANG X,ZHANG G,et al.Mmrotate:A rotated object detection benchmark using pytorch[C]//Proceedings of the 30th ACM International Conference on Multimedia.2022:7331-7334. [55]XIA G S,BAI X,DING J,et al.DOTA:A large-scale dataset for object detection in aerial images[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2018:3974-3983. [56]ZHANG R,ISOLA P,EFROS A A,et al.The unreasonable effectiveness of deep features as a perceptual metric[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.2018:586-595. [57]HEUSEL M,RAMSAUER H,UNTERTHINER T,et al.GANs trained by a two time-scale update rule converge to a local nash equilibrium[C]//NIPS 2017.2017. [58]BROOKS T,HOLYNSKI A,EFROS A A.Instructpix2pix:Learning to follow image editing instructions[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.2023:18392-18402. [59]XU Z.SkinDualGen:Prompt-Driven Diffusion for Simultaneous Image-Mask Generation in Skin Lesions[J].arXiv:2507.19970,2025. [60]CAO S,JOSHI D,GUI L Y,et al.Contrastive mean teacher for domain adaptive object detectors[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.2023:23839-23848. [61]TRANHEDEN W,OLSSON V,PINTO J,et al.Dacs:Domainadaptation via cross-domain mixed sampling[C]//Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision.2021:1379-1389. |
|
||