Computer Science ›› 2021, Vol. 48 ›› Issue (4): 97-103.doi: 10.11896/jsjkx.200900053

• Database & Big Data & Data Science • Previous Articles     Next Articles

Customs Commodity HS Code Classification Integrating Text Sequence and Graph Information

DU Shao-hua1, WAN Huai-yu1, WU Zhi-hao1,2, LIN You-fang1,2   

  1. 1 School of Computer and Information Technology,Beijing Jiaotong University,Beijing 100044,China
    2 Key Laboratory of Transport Industry of Big Data Application Technologies for Comprehensive Transport, Beijing 100044,China
  • Received:2020-06-24 Revised:2020-10-04 Online:2021-04-15 Published:2021-04-09
  • About author:DU Shao-hua,born in 1996,postgradua-te.Her main research interests include text mining and so on.(18120357@bjtu.edu.cn)
    WAN Huai-yu,born in 1981,Ph.D,associate professor,Ph.D supervisor,is a member of China Computer Federation.His main research interests include social network mining,text mining,user behavior analysis and spatial-temporal data mining.

Abstract: Customs commodity HS code classification is an important international procedure for cross-border trade of enterprises and individuals.HS code classification can be regarded as a text classification problem,that is,given a paragraph of description for a commodity,to determine the category of the commodity represented by HS code.However,this task is more challenging than general text classification task.First,commodity description texts are organized with special hierarchical structures.Then commodity description texts present sequential features at two levels.In addition,the key information in the commodity description text is scattered and the description forms are diverse.Most of the existing classification methods cannot comprehensivelyconsiderthe above factors to capture key information in the commodity description text.In this paper,we proposes a Text Sequence and Graph Information combination Neural Network(TSGINN) to solve the problem of customs commodity HS code classification.The TSGINN defines the HS code classification problem as a subgraph classification problem based on word co-occurrence network,models association between non-contiguous words through graph attention network,and captures multi-level sequential information through hierarchical long short-term memory network.Experiments on the real-world customs datasets show that the classification effect of TSGINN model is better than that of other methods.

Key words: Customs commodity, Graph attention network, HS code, Multi-level sequential information, Text classification

CLC Number: 

  • TP391
[1]KIM Y.Convolutional neural networks for sentence classification[C]//Empirical Methods in Natural Language Processing.2014:1746-1751.
[2]ZHANG X,ZHAO J,LECUN Y,et al.Character-level convolutional networks for text classification[C]//Neural Information Processing Systems.2015:649-657.
[3]CONNEAU A,SCHWENK H,BARRAULT L,et al.Very deep convolutional networks for text classification[C]//Conference of the European Chapter of the Association for Computational Linguistics.2017:1107-1116.
[4]JOHNSON R,ZHANG T.Deep pyramid convolutional neural networks for text categorization[C]//Meeting of the Association for Computational Linguistics.2017:562-570.
[5]JOULIN A,GRAVE E,BOJANOWSHI P,et al.Bag of tricks for efficient text classification[C]//Conference of the European Chapter of the Association for Computational Linguistics.2017:427-431.
[6]TANG D,QIN B,LIU T,et al.Document modeling with gated recurrent neural network for sentiment classification[C]//Empirical Methods in Natural Language Processing.2015:1422-1432.
[7]LIU P,QIU X,HUANG X.Recurrent neural network for text classification with multi-task learning[C]//International Joint Conference on Artificial Intelligence.2016:2873-2879.
[8]LUO Y.Recurrent neural networks for classifying relations in clinical notes[J].Journal of Biomedical Informatics,2017,72:85-95.
[9]ZHANG Y,LIU Q,SONG L.Sentence-state LSTM for textrepresentation[C]//Meeting of the Association for Computational Linguistics.2018:317-327.
[10]YANG Z,YANG D,DYER C,et al.Hierarchical attention networks for document classification[C]//Conference of the North American Chapter of the Association for Computational Linguistics:Human Language Technologies.2016:1480-1489.
[11]PAPPAS N,POPESCUBELIS A.Multilingual hierarchical at-tention networks for document classification[C]//International Joint Conference on Natural Language Processing.2017:1015-1025.
[12]FELBO B,MISLOVE A,SOGAARD A,et al.Using millions of emoji occurrences to learn any-domain representations for detecting sentiment,emotion and sarcasm[C]//Empirical Methods in Natural Language Processing.2017:1615-1625.
[13]VASWANI A,SHAZEER N,PARMAR N,et al.Attention is all you need[C]//Advances in Neural Information Processing Systems.2017:5998-6008.
[14]ZHAO W,YE J,YANG M,et al.Investigating capsule networks with dynamic routing for text classification[C]//Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing.2018.
[15]WANG Y,SUN A,HAN J,et al.Sentiment analysis by capsules[C]//International World Wide Web Conference.2018:1165-1174.
[16]YAO L,MAO C,LUO Y.Graph convolutional networks fortext classification[C]//AAAI Conference on Artificial Intelligence.2019:7370-7377.
[17]LIU X,YOU X,ZHANG X,et al.Tensor graph convolutional networks for text classification[C]//AAAI Conference on Artificial Intelligence.2020.
[18]VELICKOVIC P,CUCURULL G,CASANOVA A,et al.Graph attention networks[C]//International Conference on Learning Representations.2018.
[19]GAO H,JI S.Graph u-nets[C]//International Conference on Machine Learning.2019:2083-2092.
[20]KIPF T N,WELLING M.Semi-supervised classification withgraph convolutional networks[C]//International Conference on Learning Representations.2016.
[1] SHI Dian-xi, ZHAO Chen-ran, ZHANG Yao-wen, YANG Shao-wu, ZHANG Yong-jun. Adaptive Reward Method for End-to-End Cooperation Based on Multi-agent Reinforcement Learning [J]. Computer Science, 2022, 49(8): 247-256.
[2] HAO Zhi-rong, CHEN Long, HUANG Jia-cheng. Class Discriminative Universal Adversarial Attack for Text Classification [J]. Computer Science, 2022, 49(8): 323-329.
[3] WU Hong-xin, HAN Meng, CHEN Zhi-qiang, ZHANG Xi-long, LI Mu-hang. Survey of Multi-label Classification Based on Supervised and Semi-supervised Learning [J]. Computer Science, 2022, 49(8): 12-25.
[4] TAN Ying-ying, WANG Jun-li, ZHANG Chao-bo. Review of Text Classification Methods Based on Graph Convolutional Network [J]. Computer Science, 2022, 49(8): 205-216.
[5] YAN Jia-dan, JIA Cai-yan. Text Classification Method Based on Information Fusion of Dual-graph Neural Network [J]. Computer Science, 2022, 49(8): 230-236.
[6] SHAO Xin-xin. TI-FastText Automatic Goods Classification Algorithm [J]. Computer Science, 2022, 49(6A): 206-210.
[7] DENG Kai, YANG Pin, LI Yi-zhou, YANG Xing, ZENG Fan-rui, ZHANG Zhen-yu. Fast and Transmissible Domain Knowledge Graph Construction Method [J]. Computer Science, 2022, 49(6A): 100-108.
[8] KANG Yan, WU Zhi-wei, KOU Yong-qi, ZHANG Lan, XIE Si-yu, LI Hao. Deep Integrated Learning Software Requirement Classification Fusing Bert and Graph Convolution [J]. Computer Science, 2022, 49(6A): 150-158.
[9] DENG Zhao-yang, ZHONG Guo-qiang, WANG Dong. Text Classification Based on Attention Gated Graph Neural Network [J]. Computer Science, 2022, 49(6): 326-334.
[10] LIU Shuo, WANG Geng-run, PENG Jian-hua, LI Ke. Chinese Short Text Classification Algorithm Based on Hybrid Features of Characters and Words [J]. Computer Science, 2022, 49(4): 282-287.
[11] ZHONG Gui-feng, PANG Xiong-wen, SUI Dong. Text Classification Method Based on Word2Vec and AlexNet-2 with Improved AttentionMechanism [J]. Computer Science, 2022, 49(4): 288-293.
[12] DENG Wei-bin, ZHU Kun, LI Yun-bo, HU Feng. FMNN:Text Classification Model Fused with Multiple Neural Networks [J]. Computer Science, 2022, 49(3): 281-287.
[13] ZHANG Hu, BAI Ping. Graph Convolutional Networks with Long-distance Words Dependency in Sentences for Short Text Classification [J]. Computer Science, 2022, 49(2): 279-284.
[14] ZENG Wei-liang, CHEN Yi-hao, YAO Ruo-yu, LIAO Rui-xiang, SUN Wei-jun. Application of Spatial-Temporal Graph Attention Networks in Trajectory Prediction for Vehicles at Intersections [J]. Computer Science, 2021, 48(6A): 334-341.
[15] LIU Zhi-xin, ZHANG Ze-hua, ZHANG Jie. Top-N Recommendation Method for Graph Attention Based on Multi-level and Multi-view [J]. Computer Science, 2021, 48(4): 104-110.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!