Computer Science ›› 2025, Vol. 52 ›› Issue (11A): 241200160-7.doi: 10.11896/jsjkx.241200160

• Image Processing & Multimedia Technology • Previous Articles     Next Articles

Drug Name Recognition Method Based on CRAFT and OCR Technology

XU Ying, LI Xiaoming, YU Fenghao   

  1. College of Computer Science and Electronic Engineering,Hunan University,Changsha 410082,China
  • Online:2025-11-15 Published:2025-11-10

Abstract: In the operation of intelligent pharmacies,it is crucial for robots to accurately identify and retrieve drugs in order to achieve efficient and precise drug selection tasks.This study focuses on drug name recognition methods and proposes a CRAFT-OCR algorithm that integrates CRAFT algorithm and OCR technology to achieve efficient recognition of drug names.Among them,the CRAFT algorithm is used to detect the text area of the medicine box.To improve recognition accuracy,a drug name area localization method based on sorting rules is designed to determine the drug name area,and advanced OCR technology is finally used to complete text recognition.The drug name recognition experiments conduct on the collected dataset of medicine box images show that the accuracy of the CRAFT-OCR method in detecting drug name areas is 96.43%,and the accuracy of text re-cognition is 96.00%.The performance is better than existing algorithms in the literature,providing an effective solution for intelligent drug name recognition.

Key words: Deep learning, Image processing, Text detection, Text recognition, Drug name recognition

CLC Number: 

  • TP317
[1]XIE J L,CHEN M Y,XIE Y Y.Application of Hospital Logistics Robots in Intelligent Pharmacy[J].China Health Standard Management,2020,11(11):20-22.
[2]LIU D Y,ZHANG F S,MENG T,et al.Research on Drug Name Recognition Technology for Medication Boxes Based on Deep Learning[J].Journal of Qingdao University(Natural Science Edition),2021.
[3]BAEK Y,LEE B,HAN D,et al.Character region awareness for text detection[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.2019:9365-9374.
[4]SMITH R.An overview of the Tesseract OCR engine[C]//Ninth International Conference on Document Analysis Rnd recognition(ICDAR 2007).IEEE,2007,2:629-633.
[5]LIU Q X,HAN Y W,MING Z.Pill Box Text IdentificationUsing DBNet-CRNN[J].International Journal of Environmental Research and Public Health,2023,20(5):3881.
[6]LIAO M,WAN Z,YAO C,et al.Real-time scene text detection with differentiable binarization[C]//Proceedings of the AAAI Conference on Artificial Intelligence.2020:11474-11481.
[7]LIAO M H,ZOU Z S,WAN Z Y,et al.Real-time scene text detection with differentiable binarization and adaptive scale fusion[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2022,45(1):919-931.
[8]SHI B,BAI X,YAO C.An end-to-end trainable neural network for image-based sequence recognition and its application to scene text recognition[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2016,39(11):2298-2304.
[9]TIAN Z,HUANG W,HE T,et al.Detecting text in natural image with connectionist text proposal network[C]//Computer Vision-ECCV 2016:14th European Conference,Amsterdam,the Netherlands.Springer International Publishing,2016:56-72.
[10]VASWANI A,SHAZEER N,PARMAR N,et al.Attention isall you need[C]//Proceedings of the 31st International Conference on Neural Information Processing Systems.2017:6000-6010.
[11]GUO J,HAN K,WU H,et al.Cmt:Convolutional neural networks meet vision transformers[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.2022:12175-12185.
[12]SONG P P,ZENG X J,ZHENG A Y,et al.Natural Scene Text Detection Based on Attention Mechanism[J].Electronic Measurement Technology,2021,44(4):6.
[13]HUANG M,LIU Y,PENG Z,et al.Swintextspotter:Scene text spotting via better synergy between text detection and text recognition[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition.2022:4593-4603.
[14]ZHOU D,ZHANG J,LI C.DiZNet:An end-to-end text detection and recognition algorithm with detail in text zone[J].Journal of Visual Communication and Image Representation,2024,104:104261.
[15]SIMONYAN K,ZISSERMAN A.Very deep convolutional networks for large-scale image recognition[J].arXiv:1409.1556,2014.
[16]IOFFE S,SZEGEDY C.Batch normalization:Accelerating deep network training by reducing internal covariate shift[C]//International Conference on Machine Learning.PMLR,2015:448-456.
[17]RONNEBERGER O,FISCHER P,BROX T.U-net:Convolu-tional networks for biomedical image segmentation[C]//18th International Conference Medical Image Computing and Computer-assisted Intervention(MICCAI 2015).Springer International Publishing,2015:234-241.
[18]HUANG X,SHEN T,WANG R,et al.Text detection and re-cognition in natural scene images[C]//2015 International Conference on Estimation,Detection and Information Fusion(ICEDIF).IEEE,2015:44-49.
[19]WANG J X,WANG Z Y,TIAN X.Review of natural scene text detection and recognition based on deep learning[J].Journal of Software,2020,31(5):1465-1496.
[20]ZHANG G H,FENG Y B,LU W D.Grayscale processing ofimages and acquisition of feature regions[J].Journal of Qiqihar University:Natural Science Edition,2007,23(4):49-52.
[21]DE QUEIROZ R L,BRAUN K M.Color to gray and back:color embedding into textured gray images[J].IEEE transactions on image processing,2006,15(6):1464-1470.
[22]LI Y H,HUANG Z H,XU X,et al.Tilt image correction technique based on Hough transform[J].Journal of Hunan University of Engineering(Natural Science Edition),2019,29(3):30-32.
[23]LIU D Y.Research on Machine Vision based Medicine Box Detection and Drug Inventory Management System[D].Qingdao:Qingdao University,2021.
[24]HAN X,GAO J,YANG C,et al.Focus entirety and perceive en-vironment for arbitrary-shaped text detection[J].arXiv:2409.16827,2024.
[25]HOCHREITER S,SCHMIDHUBER J.Long short-term memory[J].Neural Computation,1997,9(8):1735-1780.
[26]ASTON.Hands on Deep Learning[M].People’s Posts and Tele-communications Press,2019.
[27]LECUN Y,BOTTOU L,BENGIO Y,et al.Gradient-basedlearning applied to document recognition[J].Proceedings of the IEEE,1998,86(11):2278-2324.
[28]ZAREMBA W,SUTSKEVER I,VINYALS O.Recurrent neural network regularization[J].arXiv:1409.2329,2014.
[29]LI Y.Building Atlas Retrieval System Based on OCR Recognition Technology[D].Shijiazhuang:Hebei Normal University,2020.
[1] LIU Wei, XU Yong, FANG Juan, LI Cheng, ZHU Yujun, FANG Qun, HE Xin. Multimodal Air-writing Gesture Recognition Based on Radar-Vision Fusion [J]. Computer Science, 2025, 52(9): 259-268.
[2] YIN Shi, SHI Zhenyang, WU Menglin, CAI Jinyan, YU De. Deep Learning-based Kidney Segmentation in Ultrasound Imaging:Current Trends and Challenges [J]. Computer Science, 2025, 52(9): 16-24.
[3] ZENG Lili, XIA Jianan, LI Shaowen, JING Maike, ZHAO Huihui, ZHOU Xuezhong. M2T-Net:Cross-task Transfer Learning Tongue Diagnosis Method Based on Multi-source Data [J]. Computer Science, 2025, 52(9): 47-53.
[4] LI Yaru, WANG Qianqian, CHE Chao, ZHU Deheng. Graph-based Compound-Protein Interaction Prediction with Drug Substructures and Protein 3D Information [J]. Computer Science, 2025, 52(9): 71-79.
[5] LUO Chi, LU Lingyun, LIU Fei. Partial Differential Equation Solving Method Based on Locally Enhanced Fourier NeuralOperators [J]. Computer Science, 2025, 52(9): 144-151.
[6] LIU Leyuan, CHEN Gege, WU Wei, WANG Yong, ZHOU Fan. Survey of Data Classification and Grading Studies [J]. Computer Science, 2025, 52(9): 195-211.
[7] LIU Zhengyu, ZHANG Fan, QI Xiaofeng, GAO Yanzhao, SONG Yijing, FAN Wang. Review of Research on Deep Learning Compiler [J]. Computer Science, 2025, 52(8): 29-44.
[8] TANG Boyuan, LI Qi. Review on Application of Spatial-Temporal Graph Neural Network in PM2.5 ConcentrationForecasting [J]. Computer Science, 2025, 52(8): 71-85.
[9] ZHENG Cheng, YANG Nan. Aspect-based Sentiment Analysis Based on Syntax,Semantics and Affective Knowledge [J]. Computer Science, 2025, 52(7): 218-225.
[10] CHEN Shijia, YE Jianyuan, GONG Xuan, ZENG Kang, NI Pengcheng. Aircraft Landing Gear Safety Pin Detection Algorithm Based on Improved YOlOv5s [J]. Computer Science, 2025, 52(6A): 240400189-7.
[11] GAO Junyi, ZHANG Wei, LI Zelin. YOLO-BFEPS:Efficient Attention-enhanced Cross-scale YOLOv10 Fire Detection Model [J]. Computer Science, 2025, 52(6A): 240800134-9.
[12] ZHANG Hang, WEI Shoulin, YIN Jibin. TalentDepth:A Monocular Depth Estimation Model for Complex Weather Scenarios Based onMultiscale Attention Mechanism [J]. Computer Science, 2025, 52(6A): 240900126-7.
[13] HUANG Hong, SU Han, MIN Peng. Small Target Detection Algorithm in UAV Images Integrating Multi-scale Features [J]. Computer Science, 2025, 52(6A): 240700097-5.
[14] WANG Baohui, GAO Zhan, XU Lin, TAN Yingjie. Research and Implementation of Mine Gas Concentration Prediction Algorithm Based on Deep Learning [J]. Computer Science, 2025, 52(6A): 240400188-7.
[15] LIU Chengming, LI Haixia, LI Shaochuan, LI Yinghao. Ensemble Learning Model for Stock Manipulation Detection Based on Multi-scale Data [J]. Computer Science, 2025, 52(6A): 240700108-8.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!