Computer Science ›› 2019, Vol. 46 ›› Issue (11A): 212-215.

• Data Science • Previous Articles     Next Articles

Feature Selection Method Based on Ant Colony Optimization and Random Forest

LI Guang-hua1, LI Jun-qing1,2, ZHANG Liang1, XIN Yan-sen1, DENG Hua-wei1   

  1. (School of Information Science and Engineering,Shandong Agricultural University,Tai’an,Shandong 271018,China)1;
    (Agricultural Big Data Research Center,Shandong Agricultural University,Tai’an,Shandong 271018,China)2
  • Online:2019-11-10 Published:2019-11-20

Abstract: In the face of massive high-dimensional data,eliminating redundant features for feature selection has become one of the important issues faced by information and science and technology today.Traditional feature selection methods are not suitable for searching the whole feature space,and their performance and accuracy are low.In this paper,a me-thod of feature selection based on ant colony optimization and random forest was proposed.This method takes the importance score of random forest as the heuristic factor of ant colony optimization,uses ant colony optimization to search intelligently,and uses the result of feature selection as the evaluation index to feedback the pheromone of ant colony in real time.Experiments show that this feature selection method can effectively reduce the number of features in data sets and improve the accuracy of data classification compared with traditional feature selection methods.

Key words: Ant colony optimization, Feature selection, Random forest

CLC Number: 

  • TP391
[1]姚登举,杨静,詹晓娟.基于随机森林的特征选择算法[J].吉林大学学报(工学版),2014,44(1):137-141.
[2]刘飞飞.特征选择算法及应用综述[J].办公自动化,2018,23(21):47-49.
[3]张翠军,陈贝贝,周冲,等.基于多目标骨架粒子群优化的特征选择算法[J].计算机应用,2018,38(11):3156-3160,3166.
[4]刘依恋.模式分类中特征选择算法研究[D].哈尔滨:哈尔滨理工大学,2014.
[5]BREIMEN L.Random Forests [J].Machine Learning,2001,45(1):5-32.
[6]徐少成,李东喜.基于随机森林的加权特征选择算法[J].统计与决策,2018,34(18):25-28.
[7]杨凯,侯艳,李康.随机森林变量重要性评分及其研究进展[OL].http://www.paper.edu.cn/releasepaper/content/201507-212.
[8]ALBERTO C,MANIEZZO D.Distributed optimization by ant colonies[C]∥Proc of the First European Conf on Artificial Life.Paris:Elsevier Publishing.1991:134-142.
[9]黄丹凤,祁云嵩,许姗娜.基于粗糙集和蚁群算法的特征基因选择方法[J].计算机技术与发展,2012,22(6):68-70,74.
[10]马军建,董增川,王春霞,等.蚁群算法研究进展[J].河海大学学报(自然科学版),2005(2):139-143.
[11]杨丽.基于ReliefF和蚁群算法的特征基因选择方法分析[J].电脑知识与技术,2017,13(32):199-200.
[12]MURPHY P M,AHA D W.UCI repository of machine learning database [DB/OL].(2006-05-12).http://www.ics.uci.edu/mlearn/MLRepository.html.
[13]KIRA K,RENDELL L A.The feature selection problem:Traditional methods and a new algorithm[C]∥AAAI.1992:129-134.
[14]卜华龙,夏静,韩俊波.特征选择算法综述及进展研究[J].巢湖学院学报,2008(6):41-44.
[15]许行,张凯,王文剑.一种小样本数据的特征选择方法[J].计算机研究与发展,2018,55(10):2321-2330.
[16]朱振国,赵凯旋,刘民康.基于强化学习的特征选择算法[J].计算机系统应用,2018,27(10):214-218.
[17]闫春,李亚琪,孙海棠.基于蚁群算法优化随机森林模型的汽车保险欺诈识别研究[J].保险研究,2017(6):114-127.
[19]雷海锐,高秀峰,刘辉.基于机器学习的混合式特征选择算法[J].电子测量技术,2018,41(16):42-46.
[20]邱宁佳,周稳,王鹏,等.一种结合改进CHI和RFFS的特征选择算法研究[J].计算机工程与应用,2018,54(21):133-140.
[21]叶志伟,郑肇葆,万幼川,等.基于蚁群优化的特征选择新方法[J].武汉大学学报(信息科学版),2007(12):1127-1130.
[22]张文杰,蒋烈辉.一种基于遗传算法优化的大数据特征选择方法[J].计算机应用研究,2019:1-5.
[23]李晓岚.基于Relief特征选择算法的研究与应用[D].大连:大连理工大学,2013.
[24]蔡萌萌,张巍巍,王泓霖.大数据时代的数据挖掘综述[J].价值工程,2019,38(5):155-157.
[25]魏茂胜.数据挖掘中的分类算法综述[J].网络安全技术与应用,2017(6):65-66.
[1] LI Bin, WAN Yuan. Unsupervised Multi-view Feature Selection Based on Similarity Matrix Learning and Matrix Alignment [J]. Computer Science, 2022, 49(8): 86-96.
[2] GAO Zhen-zhuo, WANG Zhi-hai, LIU Hai-yang. Random Shapelet Forest Algorithm Embedded with Canonical Time Series Features [J]. Computer Science, 2022, 49(7): 40-49.
[3] HU Yan-yu, ZHAO Long, DONG Xiang-jun. Two-stage Deep Feature Selection Extraction Algorithm for Cancer Classification [J]. Computer Science, 2022, 49(7): 73-78.
[4] KANG Yan, WANG Hai-ning, TAO Liu, YANG Hai-xiao, YANG Xue-kun, WANG Fei, LI Hao. Hybrid Improved Flower Pollination Algorithm and Gray Wolf Algorithm for Feature Selection [J]. Computer Science, 2022, 49(6A): 125-132.
[5] QUE Hua-kun, FENG Xiao-feng, LIU Pan-long, GUO Wen-chong, LI Jian, ZENG Wei-liang, FAN Jing-min. Application of Grassberger Entropy Random Forest to Power-stealing Behavior Detection [J]. Computer Science, 2022, 49(6A): 790-794.
[6] WANG Wen-qiang, JIA Xing-xing, LI Peng. Adaptive Ensemble Ordering Algorithm [J]. Computer Science, 2022, 49(6A): 242-246.
[7] CHU An-qi, DING Zhi-jun. Application of Gray Wolf Optimization Algorithm on Synchronous Processing of Sample Equalization and Feature Selection in Credit Evaluation [J]. Computer Science, 2022, 49(4): 134-139.
[8] SUN Lin, HUANG Miao-miao, XU Jiu-cheng. Weak Label Feature Selection Method Based on Neighborhood Rough Sets and Relief [J]. Computer Science, 2022, 49(4): 152-160.
[9] ZHANG Xiao-qing, FANG Jian-sheng, XIAO Zun-jie, CHEN Bang, Risa HIGASHITA, CHEN Wan, YUAN Jin, LIU Jiang. Classification Algorithm of Nuclear Cataract Based on Anterior Segment Coherence Tomography Image [J]. Computer Science, 2022, 49(3): 204-210.
[10] LI Zong-ran, CHEN XIU-Hong, LU Yun, SHAO Zheng-yi. Robust Joint Sparse Uncorrelated Regression [J]. Computer Science, 2022, 49(2): 191-197.
[11] LIU Zhen-yu, SONG Xiao-ying. Multivariate Regression Forest for Categorical Attribute Data [J]. Computer Science, 2022, 49(1): 108-114.
[12] ZHANG Ye, LI Zhi-hua, WANG Chang-jie. Kernel Density Estimation-based Lightweight IoT Anomaly Traffic Detection Method [J]. Computer Science, 2021, 48(9): 337-344.
[13] YANG Lei, JIANG Ai-lian, QIANG Yan. Structure Preserving Unsupervised Feature Selection Based on Autoencoder and Manifold Regularization [J]. Computer Science, 2021, 48(8): 53-59.
[14] YANG Xiao-qin, LIU Guo-jun, GUO Jian-hui, MA Wen-tao. Full Reference Color Image Quality Assessment Method Based on Spatial and Frequency Domain Joint Features with Random Forest [J]. Computer Science, 2021, 48(8): 99-105.
[15] ZHENG Jian-hua, LI Xiao-min, LIU Shuang-yin, LI Di. Improved Random Forest Imbalance Data Classification Algorithm Combining Cascaded Up-sampling and Down-sampling [J]. Computer Science, 2021, 48(7): 145-154.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!