基于 DQN 增强遗传算法的 Plateaued 函数高效构造研究

doi:10.11896/jsjkx.251100083

摘要/Abstract

摘要： 作为Bent函数的重要推广,Plateaued 函数继承了很多Bent函数的优良密码学性质,具有重要的应用价值。由于传统构造Plateaued 函数的方法存在计算复杂度高、灵活性不足等问题,因此提出一种基于深度 Q 网络(Deep Q-Network,DQN)增强的自适应遗传算法。该算法深度融合 DQN 与遗传算法,构建多维状态空间感知种群进化特征,通过群体共识机制智能选择6种交叉与变异策略组合,实现遗传参数的自适应调控。实验结果表明,该算法的适应度提升幅度达 0.20~0.35,收敛速度更快,稳定性更高,平均可生成 230~300 个有效 Plateaued 函数真值序列,显著优于标准遗传算法和基础 Q-learning 遗传算法。算法能智能调节变异率(0.235~0.276)与交叉操作使用率(70%~90%),在优化 Walsh 谱分布的同时保持种群多样性。尽管计算开销略有增加,但所提算法在解的质量、收敛性能和策略自适应能力上具有显著优势,验证了深度强化学习在密码学函数构造中的有效性,为布尔函数智能化设计提供了新方案。

关键词: Plateaued函数, 真值序列, Q-learning, 深度Q网络, 遗传算法, Walsh谱, 非线性度

Abstract: Plateaued functions,as an important generalization of Bent functions,inherit many of the desirable cryptographic pro-perties of Bent functions and hold significant application value.However,traditional methods for constructing plateaued functions suffer from issues such as high computational complexity and limited flexibility.To address these challenges,this paper proposes an adaptive genetic algorithm enhanced by a deep Q-Network(DQN).This algorithm deeply integrates DQN with the genetic algorithm,constructing a multi-dimensional state space to perceive population evolutionary characteristics.Through a group consensus mechanism,it intelligently selects from six combinations of crossover and mutation strategies,enabling adaptive control of genetic parameters.Experimental results demonstrate that the proposed algorithm achieves a fitness improvement of 0.20~0.35,exhibits faster convergence speed and higher stability,and can generate an average of 230~300 valid Plateaued function truth sequences,significantly outperforming both the standard genetic algorithm and the basic Q-learning-enhanced genetic algorithm.The algorithm intelligently adjusts the mutation rate within the range of 0.235~0.276 and maintains the crossover operation usage rate between 70% and 90%,effectively optimizing the Walsh spectrum distribution while preserving population diversity.Although computational overhead increases slightly,its significant advantages in solution quality,convergence performance,and strategic adaptability validate the effectiveness of deep reinforcement learning in the construction of cryptographic functions,providing a novel approach for the intelligent design of Boolean functions.

Key words: Plateaued function, Truth sequence, Q-learning, Deep Q-Network, Genetic algorithm, Walsh spectrum, Nonlinearity

中图分类号:

TP181

吴严生, 曹心怡, 樊卫北. 基于 DQN 增强遗传算法的 Plateaued 函数高效构造研究[J]. 计算机科学, 2026, 53(4): 57-65. https://doi.org/10.11896/jsjkx.251100083

WU Yansheng, CAO Xinyi, FAN Weibei. Research on Efficient Construction of Plateaued Functions Based on DQN-enhanced Genetic Algorithm[J]. Computer Science, 2026, 53(4): 57-65. https://doi.org/10.11896/jsjkx.251100083

参考文献

[1]SHANNON C E.A mathematical theory of communication[J].The Bell System Technical Journal,1948,27(3):379-423.
[2]HAMMING R W.Error detecting and error correcting codes[J].The Bell System Technical Journal,1950,29(2):147-160.
[3]BOSE R C,RAY-CHAUDHURI D K.On a class of error correcting binary group codes[J].Information and Control,1960,3(1):68-79.
[4]MACWILLIAMS F J,SLOANE N J A.The theory of error-correcting codes[M].Elsevier,1977.
[5]CARLET C.Partially-bent functions[J].Designs,Codes andCryptography,1993,3(2):135-145.
[6]ZHENG Y,ZHANG X M.Plateaued functions[C]//International Conference on Information and Communications Security.Berlin,Springer,1999:284-300.
[7]HYUN J Y,LEE J,LEE Y.Explicit criteria for construction of plateaued functions[J].IEEE Transactions on Information Theo-ry,2016,62(12):7555-7565.
[8]STANKOVIĆ M,MORAGA C,STANKOVIĆ R.Construction of ternary plateaued functions from quadratic forms for ternary bent functions[C]//2021 IEEE 51st International Symposium on Multiple-Valued Logic(ISMVL).IEEE,2021:1-6.
[9]SUN T F,HU B,YANG Y.Research on the Construction ofPlateaued Functions[J].Journal of Electronics & Information Technology,2018,40(10):2352-2357.
[10]CHEN R,YANG B,LI S,et al.A self-learning genetic algorithm based on reinforcement learning for flexible job-shop scheduling problem[J].Computers & Industrial Engineering,2020,149:106778.
[11]CHENG L,TANG Q,ZHANG L,et al.Scheduling flexiblemanufacturing cell with no-idle flow-lines and job-shop via Q-learning-based genetic algorithm[J].Computers & Industrial Engineering,2022,169:108293.
[12]YANG Q.Research on Adaptive Scheduling of AutomatedWarehousing System Based on Q-learning[D].Hangzhou:Hangzhou Dianzi University,2025.
[13]MIRALLES-PECHUÁN L,JIMÉNEZ F,PONCE H,et al.A methodology based on deep q-learning/genetic algorithms for optimizing covid-19 pandemic government actions[C]//Procee-dings of the 29th ACM International Conference on Information &Knowledge Management.2020:1135-1144.
[14]KUKKER A,SHARMA R.A genetic algorithm assisted fuzzy Q-learning epileptic seizure classifier[J].Computers & Electrical Engineering,2021,92:107154.
[15]LIU F,ZENG G.Study of genetic algorithm with reinforcement learning to solve the TSP[J].Expert Systems with Applications,2009,36(3):6995-7001.
[16]ZHANG Z Y,WANG L,CAI J C,et al.Application research of improved Q-learning genetic algorithm in path planning[J].CAAI Transactions on Intelligent Systems,2025,20(6):1493-1504.
[17]ASTHANA R,VERMA N,RATAN R.Generation of Boolean functions using Genetic Algorithm for cryptographic applications[C]//2014 IEEE International Advance Computing Conference(IACC).IEEE,2014:1361-1366.
[18]CARLET C.On the confusion and diffusion properties of Maiorana-McFarland’s and extended Maiorana-McFarland’s functions[J].Journal of Complexity,2004,20(2/3):182-204.
[19]HODŽIĆ S,PASALIC E,WEI Y,et al.Designing plateauedBoolean functions in spectral domain and their classification[J].IEEE Transactions on Information Theory,2019,65(9):5865-5879.
[20]HODŽIĆ S,PASALIC E,WEI Y.A general framework for secondary constructions of bent and plateaued functions[J].Designs,Codes and Cryptography,2020,88(10):2007-2035.
[21]MARIOT L,LEPORATI A.A genetic algorithm for evolving plateaued cryptographic boolean functions[C]//International Conference on Theory and Practice of Natural Computing.Cham:Springer,2015:33-45.
[22]PICEK S,CARLET C,GUILLEY S,et al.Evolutionary algo-rithms for boolean functions in diverse domains of cryptography[J].Evolutionary Computation,2016,24(4):667-694.
[23]JEONG J,LEE Y.Algorithms for constructing balanced pla-teaued functions with maximal algebraic degrees[J].IEEE Transactions on Information Theory,2023,70(2):1408-1421.
[24]BEHERA P K,GANGOPADHYAY S.An improved hybrid genetic algorithm to construct balanced Boolean function with optimal cryptographic properties[J].Evolutionary Intelligence,2022,15(1):639-653.
[25]PICEK S,MARCHIORI E,BATINA L,et al.Combining evolutionary computation and algebraic constructions to find crypto-graphy-relevant boolean functions[C]//International Confe-rence on Parallel Problem Solving from Nature.Cham:Springer,2014:822-831.
[26]CLIFTON J,LABER E.Q-learning:Theory and applications[J].Annual Review of Statistics and Its Application,2020,7(1):279-301.
[27]FAN J,WANG Z,XIE Y,et al.A theoretical analysis of deep Q-learning[C]//Learning for Dynamics and Control.PMLR,2020:486-489.
[28]GU S,LILLICRAP T,SUTSKEVER I,et al.Continuous deep q-learning with model-based acceleration[C]//International Conference on Machine Learning.PMLR,2016:2829-2838.
[29]WU Y N,ZHANG J.An Improved NSGA-II Algorithm forBullet Distribution[J].Ship Electronic Engineering,2023,43(4):29-33.
[30]PANDA S K,LIU R,XIANG Y.Asymptotic Analysis of Sample-averaged Q-learning[J].IEEE Transactions on Information Theory,2025,71(7):5601-5619.
[31]HE K,CHEN C,CHEN S,et al.Reinforcement Learning for Multi-Objective Optimization:A Review[J/OL].Archives of Computational Methods in Engineering,2025:1-30.https://doi.org/10.1007/s11831-025-10389-3.
[32]MIRJALILI S.Evolutionary algorithms and neural networks[J].Studies in Computational Intelligence,2019,780(1):43-53.
[33]ZOJAJI Z,KAZEMI A.Adaptive reinforcement-based geneticalgorithm for combinatorial optimization[J].Journal of Computing and Security,2022,9(1):71-84.

相关文章 15

[1]	文佳, 吴舒霞, 于正欣, 苗旺, 陈哲毅. 基于多目标优化的大规模Hadoop集群虚拟机放置 Multi-objective Optimization for Virtual Machine Placement in Large-scale Hadoop Cluster 计算机科学, 2026, 53(2): 387-395. https://doi.org/10.11896/jsjkx.241200020
[2]	王维, 赵云龙, 彭小玉, 潘小东. 参数协同优化的TSVR增强型TSK模糊系统 TSK Fuzzy System Enhanced by TSVR with Cooperative Parameter Optimization 计算机科学, 2025, 52(7): 75-81. https://doi.org/10.11896/jsjkx.240500086
[3]	王翔, 韩青海, 梁家瑞, 余小莉, 吴麒, 卿利. 面向多用户的任务卸载和服务缓存策略研究 Research on Multi-user Task Offloading and Service Caching Strategies 计算机科学, 2025, 52(7): 307-314. https://doi.org/10.11896/jsjkx.240500036
[4]	黄傲, 李敏, 曾祥光, 潘云伟, 张加衡, 彭倍. 基于PPO的自适应杂交遗传算法求解旅行商问题 Adaptive Hybrid Genetic Algorithm Based on PPO for Solving Traveling Salesman Problem 计算机科学, 2025, 52(6A): 240600096-6. https://doi.org/10.11896/jsjkx.240600096
[5]	王思彤, 林荣恒. 面向异步混合流水车间排产的混合禁忌搜索遗传优化算法 Improved Genetic Algorithm with Tabu Search for Asynchronous Hybrid Flow Shop Scheduling 计算机科学, 2025, 52(4): 271-279. https://doi.org/10.11896/jsjkx.240600049
[6]	赵海霞, 李鑫, 韦永壮. 优良平衡布尔函数的Rank排序混合遗传搜索算法 Rank-sorting Hybrid Genetic Algorithm for Search High Quality Balanced Boolean Functions 计算机科学, 2025, 52(12): 351-357. https://doi.org/10.11896/jsjkx.241200039
[7]	宋日荣, 陈钦文, 陈星. 基于强化学习的分布式Android应用自动化测试方法 Distributed Automated Testing for Android Applications Based on Reinforcement Learning 计算机科学, 2025, 52(12): 40-47. https://doi.org/10.11896/jsjkx.241100054
[8]	李晓耕, 韩校, 肖海怡. 电力监控系统网络空间客体协同防御方法 Cooperative Defense Method for Network Space Object of Power Monitoring System 计算机科学, 2025, 52(11A): 241200158-7. https://doi.org/10.11896/jsjkx.241200158
[9]	稂奥奇, 黄伟杰, 於志勇, 黄昉菀. 城市空气质量数据的时空主动采样与联合推测 Spatiotemporal Active-sampling and Joint Inference of Urban Air Quality Data 计算机科学, 2025, 52(11A): 241000116-9. https://doi.org/10.11896/jsjkx.241000116
[10]	于萍, 颜辉, 鲍杰, 耿晓中. 基于优化模型的MEC网络任务卸载与迁移策略 MEC Network Task Offloading and Migration Strategy Based on Optimization Model 计算机科学, 2025, 52(11A): 241200215-6. https://doi.org/10.11896/jsjkx.241200215
[11]	赵英男, 冷重阳, 韩启龙, 俞程. 基于汤普森采样的自适应安卓程序测试方法 Adaptive Android Program Test Method Based on Thompson Sampling 计算机科学, 2025, 52(11): 330-338. https://doi.org/10.11896/jsjkx.240900150
[12]	徐海涛, 程海燕, 童名文. 基于深度强化学习的自学习排课遗传算法研究 Study on Genetic Algorithm of Course Scheduling Based on Deep Reinforcement Learning 计算机科学, 2024, 51(6A): 230600062-8. https://doi.org/10.11896/jsjkx.230600062
[13]	黄飞, 李永福, 高杨, 夏磊, 廖庆龙, 戴健, 向洪. 基于改进遗传算法的家庭用电调度优化方法 Scheduling Optimization Method for Household Electricity Consumption Based on Improved Genetic Algorithm 计算机科学, 2024, 51(6A): 230600096-6. https://doi.org/10.11896/jsjkx.230600096
[14]	钟雨昂, 袁伟伟, 关东海. 基于softmax的加权Double Q-Learning算法 Weighted Double Q-Learning Algorithm Based on Softmax 计算机科学, 2024, 51(6A): 230600235-5. https://doi.org/10.11896/jsjkx.230600235
[15]	李志博, 李清宝, 兰明敬. 基于ART优化选择策略的遗传算法生成测试数据方法 Method of Generating Test Data by Genetic Algorithm Based on ART Optimal Selection Strategy 计算机科学, 2024, 51(6): 95-103. https://doi.org/10.11896/jsjkx.230100012

Metrics

Viewed

Full text

Abstract

Cited

Shared

Discussed