计算机科学 ›› 2025, Vol. 52 ›› Issue (11A): 241200160-7.doi: 10.11896/jsjkx.241200160
许莹, 厉小明, 于丰豪
XU Ying, LI Xiaoming, YU Fenghao
摘要: 在智能化药房的运作中,为实现药品的高效与精准挑选,机器人准确识别药品并完成取药至关重要。聚焦药品名称识别方法,提出一种融合CRAFT算法与OCR技术的CRAFT-OCR算法,以实现药品名称的高效识别。其中,CRAFT算法用于检测药盒文本区域,为提升识别准确率,设计一种基于排序规则的药名区域定位方法来确定药名区域,最终借助先进的OCR技术完成文字识别。在采集的药盒图片数据集上开展的药名识别实验显示,CRAFT-OCR方法检测药名区域的准确率为96.43%,文字识别准确率为96.00%,性能优于现有算法,为智能化药名识别提供了有效的解决方案。
中图分类号:
| [1]XIE J L,CHEN M Y,XIE Y Y.Application of Hospital Logistics Robots in Intelligent Pharmacy[J].China Health Standard Management,2020,11(11):20-22. [2]LIU D Y,ZHANG F S,MENG T,et al.Research on Drug Name Recognition Technology for Medication Boxes Based on Deep Learning[J].Journal of Qingdao University(Natural Science Edition),2021. [3]BAEK Y,LEE B,HAN D,et al.Character region awareness for text detection[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.2019:9365-9374. [4]SMITH R.An overview of the Tesseract OCR engine[C]//Ninth International Conference on Document Analysis Rnd recognition(ICDAR 2007).IEEE,2007,2:629-633. [5]LIU Q X,HAN Y W,MING Z.Pill Box Text IdentificationUsing DBNet-CRNN[J].International Journal of Environmental Research and Public Health,2023,20(5):3881. [6]LIAO M,WAN Z,YAO C,et al.Real-time scene text detection with differentiable binarization[C]//Proceedings of the AAAI Conference on Artificial Intelligence.2020:11474-11481. [7]LIAO M H,ZOU Z S,WAN Z Y,et al.Real-time scene text detection with differentiable binarization and adaptive scale fusion[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2022,45(1):919-931. [8]SHI B,BAI X,YAO C.An end-to-end trainable neural network for image-based sequence recognition and its application to scene text recognition[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2016,39(11):2298-2304. [9]TIAN Z,HUANG W,HE T,et al.Detecting text in natural image with connectionist text proposal network[C]//Computer Vision-ECCV 2016:14th European Conference,Amsterdam,the Netherlands.Springer International Publishing,2016:56-72. [10]VASWANI A,SHAZEER N,PARMAR N,et al.Attention isall you need[C]//Proceedings of the 31st International Conference on Neural Information Processing Systems.2017:6000-6010. [11]GUO J,HAN K,WU H,et al.Cmt:Convolutional neural networks meet vision transformers[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.2022:12175-12185. [12]SONG P P,ZENG X J,ZHENG A Y,et al.Natural Scene Text Detection Based on Attention Mechanism[J].Electronic Measurement Technology,2021,44(4):6. [13]HUANG M,LIU Y,PENG Z,et al.Swintextspotter:Scene text spotting via better synergy between text detection and text recognition[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.2022:4593-4603. [14]ZHOU D,ZHANG J,LI C.DiZNet:An end-to-end text detection and recognition algorithm with detail in text zone[J].Journal of Visual Communication and Image Representation,2024,104:104261. [15]SIMONYAN K,ZISSERMAN A.Very deep convolutional networks for large-scale image recognition[J].arXiv:1409.1556,2014. [16]IOFFE S,SZEGEDY C.Batch normalization:Accelerating deep network training by reducing internal covariate shift[C]//International Conference on Machine Learning.PMLR,2015:448-456. [17]RONNEBERGER O,FISCHER P,BROX T.U-net:Convolu-tional networks for biomedical image segmentation[C]//18th International Conference Medical Image Computing and Computer-assisted Intervention(MICCAI 2015).Springer International Publishing,2015:234-241. [18]HUANG X,SHEN T,WANG R,et al.Text detection and re-cognition in natural scene images[C]//2015 International Conference on Estimation,Detection and Information Fusion(ICEDIF).IEEE,2015:44-49. [19]WANG J X,WANG Z Y,TIAN X.Review of natural scene text detection and recognition based on deep learning[J].Journal of Software,2020,31(5):1465-1496. [20]ZHANG G H,FENG Y B,LU W D.Grayscale processing ofimages and acquisition of feature regions[J].Journal of Qiqihar University:Natural Science Edition,2007,23(4):49-52. [21]DE QUEIROZ R L,BRAUN K M.Color to gray and back:color embedding into textured gray images[J].IEEE transactions on image processing,2006,15(6):1464-1470. [22]LI Y H,HUANG Z H,XU X,et al.Tilt image correction technique based on Hough transform[J].Journal of Hunan University of Engineering(Natural Science Edition),2019,29(3):30-32. [23]LIU D Y.Research on Machine Vision based Medicine Box Detection and Drug Inventory Management System[D].Qingdao:Qingdao University,2021. [24]HAN X,GAO J,YANG C,et al.Focus entirety and perceive en-vironment for arbitrary-shaped text detection[J].arXiv:2409.16827,2024. [25]HOCHREITER S,SCHMIDHUBER J.Long short-term memory[J].Neural Computation,1997,9(8):1735-1780. [26]ASTON.Hands on Deep Learning[M].People’s Posts and Tele-communications Press,2019. [27]LECUN Y,BOTTOU L,BENGIO Y,et al.Gradient-basedlearning applied to document recognition[J].Proceedings of the IEEE,1998,86(11):2278-2324. [28]ZAREMBA W,SUTSKEVER I,VINYALS O.Recurrent neural network regularization[J].arXiv:1409.2329,2014. [29]LI Y.Building Atlas Retrieval System Based on OCR Recognition Technology[D].Shijiazhuang:Hebei Normal University,2020. |
|
||