Computer Science ›› 2024, Vol. 51 ›› Issue (11A): 240300150-6.doi: 10.11896/jsjkx.240300150

• Big Data & Data Science • Previous Articles     Next Articles

BEML:A Blended Learning Analysis Paradigm for Hidden Space Representation of Commodities

ZHENG Qijian, LIU Feng   

  1. School of Computer Science and Technology,East China Normal University,Shanghai 200062,China
  • Online:2024-11-16 Published:2024-11-13
  • About author:ZHENG Qijian,born in 2003,master,is a student member of CCF(No.N9988G).His main research interests include deep learning technology and so on.
    LIU Feng,born in 1988,Ph.D,is a se-nior member of CCF(No.93542S).His main research interests include deep learning technology and blockchain technology.
  • Supported by:
    Research Project of Shanghai Science and Technology Commission(20dz2260300) and Special Fund for Talent Cultivation of Artificial Intelligence Enabled Psychology/Education Interdisciplinary Cross-disciplinary Talents(2024JCRC-10),School of Computer Science and Technology,East China Normal University.

Abstract: With the advent of the Internet economy era,the efficient management of e-commerce platforms has garnered widespread attention from both academia and industry.Among various factors,the accuracy and automation level of product classification directly impact users' experience and the optimization of operational efficiency.In light of this,this study delves into the latent space representation of product information,proposing a blended learning analysis paradigm for product latent space representation(BEML).This framework integrates advanced bidirectional encoder representations from transformers(BERT) techno-logy with traditional machine learning methods,aiming to significantly enhance the efficiency and accuracy of automated product classification through detailed analysis of the latent space of product information.By conducting comparative analysis with current mainstream deep learning and machine learning algorithms,this study validates the exceptional performance of the BEML framework in product classification tasks.Experimental results demonstrate that the BEML framework achieves a macro F1 score of 85.79% and a micro F1 score of 84.73%.Both exceed the current best F1 score of 83.3%,reaching a state of the art.Moreover,this framework not only represents a theoretical innovation but also holds significant practical application value in the realm of information management and automation processing within the e-commerce sector,providing an efficient and reliable blended lear-ning analysis paradigm for the field of technology and business.

Key words: Latent space representation, Pre-trained BERT model, Automated commodity classification, Intelligent commodity classification, Sci-tech driven business

CLC Number: 

  • TP311
[1]LANDAUER T K,FOLTZ P W,LAHAM D.An introduction to latent semantic analysis[J].Discourse Processes,1998,25(2/3):259-284.
[2]CHANG T Z,WILDT A R.Price,product information,and purchase intention:An empirical study[J].Journal of the Academy of Marketing Science,1994,22:16-27.
[3]DEVLIN J,CHANG M W,LEE K,et al.Bert:Pre-training of deep bidirectional transformers for language understanding[J].arXiv:1810.04805,2018.
[4]YANG L,SHIJIA E,XU S,et al.Bert with Dynamic Masked Softmax and Pseudo Labeling for Hierarchical Product Classification[C]//MWPD@ ISWC.2020.
[5]BELTAGY I,LO K,COHAN A.SciBERT:A pretrained language model for scientific text[J].arXiv:1903.10676,2019.
[6]LEE J,YOON W,KIM S,et al.BioBERT:a pre-trained biome-dical language representation model for biomedical text mining[J].Bioinformatics,2020,36(4):1234-1240.
[7]PEETERS R,BIZER C.Dual-objective fine-tuning of BERT for entity matching[J].Proceedings of the VLDB Endowment,2021,14:1913-1921.
[8]ZAHERA H M,SHERIF M.ProBERT:Product Data Classification with Fine-tuning BERT Model[C]//MWPD@ ISWC.2020.
[9]MEUSEL R,PRIMPELI A,MEILICKE C,et al.Exploiting microdata annotations to consistently categorize product offers at web scale[C]//International Conference on Electronic Commerce and Web Technologies.Cham:Springer International Publishing,2015:83-99.
[10]YU H F,HO C H,ARUNACHALAM P,et al.Product titleclassification versus text classification[J].Csie.Ntu.Edu.Tw,2012:1-25.
[11]ZHANG Z,SONG X.An exploratory study on utilising the web of linked data for product data mining[J].SN Computer Science,2022,4(1):15.
[12]LOUIZOS C,SWERSKY K,LI Y,et al.The variational fair autoencoder[J].arXiv:1511.00830,2015.
[13]CHAVALTADA C,PASUPA K,HARDOOND R.A comparative study of machine learning techniques for automatic product categorisation[C]//Advances in Neural Networks(ISNN 2017),Part I 14.Springer International Bublishing,2017:10-17.
[14]RISTOSKI P,PETROVSKI P,MIKAP,et al.A machine lear-ning approach for product matching and categorization[J].Semantic web,2018,9(5):707-728.
[15]LANDAUER T K,DUMAIS S T.A solution to Plato's problem:The latent semantic analysis theory of acquisition,induction,and representation of knowledge[J].Psychological Review,1997,104(2):211-240.
[16]LEE H,YOON Y.Engineering doc2vec for automatic classification of product descriptions on O2O applications[J].Electronic Commerce Research,2018,18:433-456.
[17]ZHANG Z,PARAMITA M.Product classification using microdata annotations[C]//The Semantic Web-ISWC 2019:18th International Semantic Web Conference,Auckland,New Zealand,Part I 18.Springer International Publishing,2019:716-732.
[18]REDDY B,RAMAKANTHA R,LOKESH K.Classification of health care products using hybrid CNN-LSTM model[J].Soft Computing,2023,27:9199-9126.
[19]JAHANSHAHI H,OZYEGEN O,CEVIK M,et al.Text Classification for Predicting Multi-level Product Categories[C]//Proceedings of the 31st Annual International Conference on Computer Science and Software Engineering.2021:33-42.
[20]HEUNG B,HO H C,ZHANG J,et al.An overview and com-parison of machine-learning techniques for classification purposes in digital soil mapping[J].Geoderma,2016,265:62-77.
[1] MENG Xiangfu, LI Tianshuo, ZHANG Xiaoyan. Hybrid Index Structure for Trajectory Range Query Combined with Spatio-Temporal Keywords [J]. Computer Science, 2024, 51(11A): 240200114-8.
[2] YAN Xinyu, HUANG Zengfeng. Robust Distributed Monitoring Algorithm Under Limited Interference [J]. Computer Science, 2024, 51(11A): 240200050-7.
[3] LI Wenwang, ZHOU Haohao, DENG Su, MA Wubin, WU Yahui. Joint Optimization of Delay and Energy Consumption of Tasks Offloading for Vehicular EdgeComputing [J]. Computer Science, 2024, 51(11A): 231000080-7.
[4] ZHONG Linhui, YANG Chaoyi, XIA Zihao, HUANG Qixuan, QU Qiaoqiao, LI Fangyun, SUN Wenbin. Style-oriented Software Architecture Evolution Path Generation Method [J]. Computer Science, 2024, 51(11A): 240100130-9.
[5] PEI Xue, WEI Shuai, SHAO Yangxue, YU Hong, GE Chenyang. Compilation Optimization and Implementation of High-order Cryptographic Operators on FPGA [J]. Computer Science, 2024, 51(11A): 231200184-11.
[6] ZHONG Linhui, ZHU Yanxia, HUANG Qixuan, QU Qiaoqiao, XIA Zihao, ZHENG Yi. Study on Information Enhancement Method of API Structural Pattern Based on EBRCG [J]. Computer Science, 2024, 51(11A): 230900121-10.
[7] WANG Qiong, LU Yue, LIU Shun, LI Qingtao, LIU Yang, WANG Hongbiao, LIU Weiliang. Real-time Collaborative Pricing Mechanism of Between Vehicle and Power Grid Based on Bi-levelOptimization [J]. Computer Science, 2024, 51(11A): 240300013-7.
[8] DING Guangming, ZHAO Yuzhong, ZHENG Yong. Data Exchange and Decision Optimization for Intelligent Maintenance of Xinjiang Ship Locks [J]. Computer Science, 2024, 51(11A): 240800116-4.
[9] LU Yue, WANG Qiong, LIU Shun, LI Qingtao, LIU Yang, WANG Hongbiao. Reinforcement Learning Algorithm for Charging/Discharging Control of Electric Vehicles Considering Battery Loss [J]. Computer Science, 2024, 51(11A): 231200147-7.
[10] YANG Dongsheng, WANG Guiling, ZHENG Xin. Hierarchical Hypergraph-based Attention Neural Network for Service Recommendation [J]. Computer Science, 2024, 51(11): 103-111.
[11] MAO Xinjun, LU Yao. Crowdsourced Learning Method and Its Applications in Course Teaching [J]. Computer Science, 2024, 51(10): 50-55.
[12] WANG Shaopeng, FENG Chunkai. All-chain Sets Mining Algorithm for Multi-scale Nearest Time Series [J]. Computer Science, 2024, 51(10): 247-260.
[13] YAN Xiaoting, WANG Xiaoning, DONG Sheng, ZHAO Yining, XIAO Haili. Review on the Development and Application of Checkpointing Technology in High-performanceComputing [J]. Computer Science, 2024, 51(9): 1-14.
[14] GUO Shuaizhe, GAO Jianhua, JI Weixing. Optimizing Distributed GMRES Algorithm with Mixed Precision [J]. Computer Science, 2024, 51(9): 15-22.
[15] LIU Renyu, CHEN Xin, SHANG Honghui, ZHANG Yunquan. Optimization of Atomic Kinetics Monte Carlo Program TensorKMC Based on Machine Learning Atomic Potential Functions [J]. Computer Science, 2024, 51(9): 23-30.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!