Computer Science ›› 2019, Vol. 46 ›› Issue (6A): 407-411.

• Big Data & Data Mining • Previous Articles     Next Articles

Linear Twin Support Vector Machine Based on Data Distribution Characteristics

SONG Rui-yang1, MENG Hua1,2, LONG Zhi-guo2   

  1. School of Mathematics,Southwest Jiaotong University,Chengdu 611756,China1;
    School of Information Science and Technology,Southwest Jiaotong University,Chengdu 611756,China2
  • Online:2019-06-14 Published:2019-07-02

Abstract: Twin Support Vector Machine(TWSVM) have been successfully applied in many fields.However,the standard TWSVM model have poor robustness when dealing with data classification problems involving distribution characteristics,especially when uncertainty in data fluctuates wildly,the standard classification model,which doesn’t consider the distribution characteristics,is no longer satisfactory for classification accuracy.Therefore,a weighted linear twin support vector machine model based on data distribution characteristics was proposed in this paper.The new model,denoted by TWSVM-U,further considers the influence of data distribution characteristics on the locations of classification hyperplanes,and constructs distance weights quantitatively according to data dispersity at the normal vector directions of classification hyperplanes.TWSVM-U is a generalization of TWSVM.In fact,when training samples do not have distribution characteristics,TWSVM-U model will degenerate to the standard TWSVM model.Experiments with 10-fold cross validation show that the TWSVM-U model performs better than the SVM and the TWSVM on classification problems with large data fluctuation range.

Key words: Binary classification, Twin support vector machine, Uncertain information, Weighted distance

CLC Number: 

  • TP301
[1]CORTES C,VAPNIK V.Support-vector networks[J].Machine Learning,1995,20(3):273-297.
[2]MANGASARIAN O L,WILD E W.Multisurface Proximal Support Vector Machine Classification via Generalized Eigenvalues[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2006,28(1):69-74.
[3]JAYADEVA,KHEMCHANDANI R,CHANDRA S.Twin Support Vector Machines for pattern classification[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2007,29(5):905-910.
[4]KUMAR M A,GOPAL M.Least squares twin support vector machines for pattern classification[J].Expert Systems with Applications,2009,36(4):7535-7543.
[5]PENG X J,XU D.Twin Mahalanobis distance-based support vector machines for pattern recognition[J].Information Sciences,2012,200:22-37.
[6]CHEN S G,WU X J.A new fuzzy twin support vector machine for pattern classification[J].International Journal of Machine Learning and Cybernetics,2018,9:1553-1564.
[7]KHEMCHANDANI R,GOYAL K,CHANDRA S.TWSVR: Regression via Twin Support Vector Machine[J].Neural Networks,2016,74:14-21.
[8]WANG Z,SHAO Y H,BAI L,et al.Twin support vector machine for clustering[J].IEEE Trans Neural Netw Learn Syst,2015,26(10):2583-2588.
[9]CURRAN J M,BUCKLETON J S.An investigation into the performance of methods for adjusting for sampling uncertainty in DNA likelihood ratio calculations[J].Forensic Science International Genetics,2011,5(5):512-516.
[10]MA X,DJOUADI S M,CHARALAMBOUS C D.Optimal Filtering Over Uncertain Wireless Communication Channels[J].IEEE Signal Processing Letters,2011,18(6):359-362.
[11]POWELL W B,BOUZAIENE-AYARI B,BERGER J,et al.The Effect of Robust Decisions on the Cost of Uncertainty in Military Airlift Operations[J].Acm Transactions on Modeling & Computer Simulation,2011,22(1):1-19.
[12]BI J,ZHANG T.Support Vector Classification with Input Data Uncertainty[J].Proc.of Neural Inf.proc.systems,2004,17:161-168.
[13]WENZEL F,GALY-FAJOU T,DEUTSCH M,et al.Bayesian Nonlinear Support Vector Machines for Big Data[C]∥Joint European Conference on Machine Learning and Knowledge Discovery in Databases.Cham,Springer,2017:307-322.
[14]DEISENROTH M P,FOX D,RASMUSSEN C E.Gaussian Processes for Data-Efficient Learning in Robotics and Control[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2015,37(2):408-423.
[15]LANCKRIET G R G,GHAOUI L E,BHATTACHARYYA C,et al.A robust minimax approach to classification[J].Journal of Machine Learning Research,2003,3(3):555-582.
[16]TZELEPIS C,MEZARIS V,PATRAS I.Linear Maximum Margin Classifier for Learning from Uncertain Data[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2017,40(12):2948-2962.
[17]Han R J,Cao Q L.Fuzzy chance constrained least squares twin support vector machine for uncertain classification[J].Journal of Intelligent & Fuzzy Systems,2017,33(5):3041-3049.
[18]JAYADEVA,KHEMCHANDANI R,CHANDRA S.Twin Support Vector Machines:Models,Extensions and Applications [M].Cham,Springer,2017:43-53.
[19]李航.统计学习方法[M].北京:清华大学出版社,2012:225-228.
[20]DUA D,EFI K T.UCI Machine Learning Repository[EB/ OL].http://archive.ics.uci.edu/ml.
[1] ZHU Wei-jun, WANG Xin, ZHONG Ying-hui, FAN Yong-wen, CHEN Yong-hua. Habitability Prediction of Exoplanets Based on GBRT Algorithm [J]. Computer Science, 2019, 46(6A): 71-73.
[2] YU Cheng, ZHU Wan-ning. Tactical Analysis of MOBA Games Based on Hotspot Map of Battlefield [J]. Computer Science, 2018, 45(11A): 149-151.
[3] AN Yue-xuan, DING Shi-fei, HU Ji-pu. Twin Support Vector Machine:A Review [J]. Computer Science, 2018, 45(11): 29-36.
[4] LI Kai, GU Li-feng and HU Shao-fang. Regularized Fuzzy Twin Support Vector Machine [J]. Computer Science, 2017, 44(8): 260-264.
[5] ZHANG Xie-kai and DING Shi-fei. Mahalanobis Distance-based Twin Multi-class Classification Support Vector Machine [J]. Computer Science, 2016, 43(3): 49-53.
[6] . Incremental SVM Intrusion Detection Algorithm Based on Distance Weighted Template Reduction and Attribute Information Entropyc [J]. Computer Science, 2012, 39(12): 76-78.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!