计算机科学 ›› 2025, Vol. 52 ›› Issue (11A): 241200160-7.doi: 10.11896/jsjkx.241200160

• 计算机图形学&多媒体 • 上一篇    下一篇

基于CRAFT和OCR技术的药品名称识别方法

许莹, 厉小明, 于丰豪   

  1. 湖南大学信息科学与工程学院 长沙 410082
  • 出版日期:2025-11-15 发布日期:2025-11-10
  • 通讯作者: 厉小明(2337475945@qq.com)
  • 作者简介:hnxy@hnu.edu.cn

Drug Name Recognition Method Based on CRAFT and OCR Technology

XU Ying, LI Xiaoming, YU Fenghao   

  1. College of Computer Science and Electronic Engineering,Hunan University,Changsha 410082,China
  • Online:2025-11-15 Published:2025-11-10

摘要: 在智能化药房的运作中,为实现药品的高效与精准挑选,机器人准确识别药品并完成取药至关重要。聚焦药品名称识别方法,提出一种融合CRAFT算法与OCR技术的CRAFT-OCR算法,以实现药品名称的高效识别。其中,CRAFT算法用于检测药盒文本区域,为提升识别准确率,设计一种基于排序规则的药名区域定位方法来确定药名区域,最终借助先进的OCR技术完成文字识别。在采集的药盒图片数据集上开展的药名识别实验显示,CRAFT-OCR方法检测药名区域的准确率为96.43%,文字识别准确率为96.00%,性能优于现有算法,为智能化药名识别提供了有效的解决方案。

关键词: 深度学习, 图像处理, 文本检测, 文字识别, 药名识别

Abstract: In the operation of intelligent pharmacies,it is crucial for robots to accurately identify and retrieve drugs in order to achieve efficient and precise drug selection tasks.This study focuses on drug name recognition methods and proposes a CRAFT-OCR algorithm that integrates CRAFT algorithm and OCR technology to achieve efficient recognition of drug names.Among them,the CRAFT algorithm is used to detect the text area of the medicine box.To improve recognition accuracy,a drug name area localization method based on sorting rules is designed to determine the drug name area,and advanced OCR technology is finally used to complete text recognition.The drug name recognition experiments conduct on the collected dataset of medicine box images show that the accuracy of the CRAFT-OCR method in detecting drug name areas is 96.43%,and the accuracy of text re-cognition is 96.00%.The performance is better than existing algorithms in the literature,providing an effective solution for intelligent drug name recognition.

Key words: Deep learning, Image processing, Text detection, Text recognition, Drug name recognition

中图分类号: 

  • TP317
[1]XIE J L,CHEN M Y,XIE Y Y.Application of Hospital Logistics Robots in Intelligent Pharmacy[J].China Health Standard Management,2020,11(11):20-22.
[2]LIU D Y,ZHANG F S,MENG T,et al.Research on Drug Name Recognition Technology for Medication Boxes Based on Deep Learning[J].Journal of Qingdao University(Natural Science Edition),2021.
[3]BAEK Y,LEE B,HAN D,et al.Character region awareness for text detection[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.2019:9365-9374.
[4]SMITH R.An overview of the Tesseract OCR engine[C]//Ninth International Conference on Document Analysis Rnd recognition(ICDAR 2007).IEEE,2007,2:629-633.
[5]LIU Q X,HAN Y W,MING Z.Pill Box Text IdentificationUsing DBNet-CRNN[J].International Journal of Environmental Research and Public Health,2023,20(5):3881.
[6]LIAO M,WAN Z,YAO C,et al.Real-time scene text detection with differentiable binarization[C]//Proceedings of the AAAI Conference on Artificial Intelligence.2020:11474-11481.
[7]LIAO M H,ZOU Z S,WAN Z Y,et al.Real-time scene text detection with differentiable binarization and adaptive scale fusion[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2022,45(1):919-931.
[8]SHI B,BAI X,YAO C.An end-to-end trainable neural network for image-based sequence recognition and its application to scene text recognition[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2016,39(11):2298-2304.
[9]TIAN Z,HUANG W,HE T,et al.Detecting text in natural image with connectionist text proposal network[C]//Computer Vision-ECCV 2016:14th European Conference,Amsterdam,the Netherlands.Springer International Publishing,2016:56-72.
[10]VASWANI A,SHAZEER N,PARMAR N,et al.Attention isall you need[C]//Proceedings of the 31st International Conference on Neural Information Processing Systems.2017:6000-6010.
[11]GUO J,HAN K,WU H,et al.Cmt:Convolutional neural networks meet vision transformers[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.2022:12175-12185.
[12]SONG P P,ZENG X J,ZHENG A Y,et al.Natural Scene Text Detection Based on Attention Mechanism[J].Electronic Measurement Technology,2021,44(4):6.
[13]HUANG M,LIU Y,PENG Z,et al.Swintextspotter:Scene text spotting via better synergy between text detection and text recognition[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.2022:4593-4603.
[14]ZHOU D,ZHANG J,LI C.DiZNet:An end-to-end text detection and recognition algorithm with detail in text zone[J].Journal of Visual Communication and Image Representation,2024,104:104261.
[15]SIMONYAN K,ZISSERMAN A.Very deep convolutional networks for large-scale image recognition[J].arXiv:1409.1556,2014.
[16]IOFFE S,SZEGEDY C.Batch normalization:Accelerating deep network training by reducing internal covariate shift[C]//International Conference on Machine Learning.PMLR,2015:448-456.
[17]RONNEBERGER O,FISCHER P,BROX T.U-net:Convolu-tional networks for biomedical image segmentation[C]//18th International Conference Medical Image Computing and Computer-assisted Intervention(MICCAI 2015).Springer International Publishing,2015:234-241.
[18]HUANG X,SHEN T,WANG R,et al.Text detection and re-cognition in natural scene images[C]//2015 International Conference on Estimation,Detection and Information Fusion(ICEDIF).IEEE,2015:44-49.
[19]WANG J X,WANG Z Y,TIAN X.Review of natural scene text detection and recognition based on deep learning[J].Journal of Software,2020,31(5):1465-1496.
[20]ZHANG G H,FENG Y B,LU W D.Grayscale processing ofimages and acquisition of feature regions[J].Journal of Qiqihar University:Natural Science Edition,2007,23(4):49-52.
[21]DE QUEIROZ R L,BRAUN K M.Color to gray and back:color embedding into textured gray images[J].IEEE transactions on image processing,2006,15(6):1464-1470.
[22]LI Y H,HUANG Z H,XU X,et al.Tilt image correction technique based on Hough transform[J].Journal of Hunan University of Engineering(Natural Science Edition),2019,29(3):30-32.
[23]LIU D Y.Research on Machine Vision based Medicine Box Detection and Drug Inventory Management System[D].Qingdao:Qingdao University,2021.
[24]HAN X,GAO J,YANG C,et al.Focus entirety and perceive en-vironment for arbitrary-shaped text detection[J].arXiv:2409.16827,2024.
[25]HOCHREITER S,SCHMIDHUBER J.Long short-term memory[J].Neural Computation,1997,9(8):1735-1780.
[26]ASTON.Hands on Deep Learning[M].People’s Posts and Tele-communications Press,2019.
[27]LECUN Y,BOTTOU L,BENGIO Y,et al.Gradient-basedlearning applied to document recognition[J].Proceedings of the IEEE,1998,86(11):2278-2324.
[28]ZAREMBA W,SUTSKEVER I,VINYALS O.Recurrent neural network regularization[J].arXiv:1409.2329,2014.
[29]LI Y.Building Atlas Retrieval System Based on OCR Recognition Technology[D].Shijiazhuang:Hebei Normal University,2020.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!