计算机科学 ›› 2026, Vol. 53 ›› Issue (4): 57-65.doi: 10.11896/jsjkx.251100083

• 人工智能与理论计算机科学交叉融合 • 上一篇    下一篇

基于 DQN 增强遗传算法的 Plateaued 函数高效构造研究

吴严生, 曹心怡, 樊卫北   

  1. 南京邮电大学计算机学院、软件学院、网络空间安全学院 南京 210023
  • 收稿日期:2025-11-17 修回日期:2026-01-16 出版日期:2026-04-15 发布日期:2026-04-08
  • 通讯作者: 吴严生(yanshengwu@njupt.edu.cn)
  • 基金资助:
    国家自然科学基金(62372247);江苏省研究生科研与实践创新计划项目(SJCX24_0326)

Research on Efficient Construction of Plateaued Functions Based on DQN-enhanced Genetic Algorithm

WU Yansheng, CAO Xinyi, FAN Weibei   

  1. College of Computer Science, Nanjing University of Posts and Telecommunications, Nanjing 210023, China
  • Received:2025-11-17 Revised:2026-01-16 Published:2026-04-15 Online:2026-04-08
  • About author:WU Yansheng,born in 1989,postdoctoral,associate professor.His main research interests include coding theory and cryptographic functions.
  • Supported by:
    National Natural Science Foundation of China(62372247) and Jiangsu Province Postgraduate Research and Practice Innovation Program(SJCX24_0326).

摘要: 作为Bent函数的重要推广,Plateaued 函数继承了很多Bent函数的优良密码学性质,具有重要的应用价值。由于传统构造Plateaued 函数的方法存在计算复杂度高、灵活性不足等问题,因此提出一种基于深度 Q 网络(Deep Q-Network,DQN)增强的自适应遗传算法。该算法深度融合 DQN 与遗传算法,构建多维状态空间感知种群进化特征,通过群体共识机制智能选择6种交叉与变异策略组合,实现遗传参数的自适应调控。实验结果表明,该算法的适应度提升幅度达 0.20~0.35,收敛速度更快,稳定性更高,平均可生成 230~300 个有效 Plateaued 函数真值序列,显著优于标准遗传算法和基础 Q-learning 遗传算法。算法能智能调节变异率(0.235~0.276)与交叉操作使用率(70%~90%),在优化 Walsh 谱分布的同时保持种群多样性。尽管计算开销略有增加,但所提算法在解的质量、收敛性能和策略自适应能力上具有显著优势,验证了深度强化学习在密码学函数构造中的有效性,为布尔函数智能化设计提供了新方案。

关键词: Plateaued函数, 真值序列, Q-learning, 深度Q网络, 遗传算法, Walsh谱, 非线性度

Abstract: Plateaued functions,as an important generalization of Bent functions,inherit many of the desirable cryptographic pro-perties of Bent functions and hold significant application value.However,traditional methods for constructing plateaued functions suffer from issues such as high computational complexity and limited flexibility.To address these challenges,this paper proposes an adaptive genetic algorithm enhanced by a deep Q-Network(DQN).This algorithm deeply integrates DQN with the genetic algorithm,constructing a multi-dimensional state space to perceive population evolutionary characteristics.Through a group consensus mechanism,it intelligently selects from six combinations of crossover and mutation strategies,enabling adaptive control of genetic parameters.Experimental results demonstrate that the proposed algorithm achieves a fitness improvement of 0.20~0.35,exhibits faster convergence speed and higher stability,and can generate an average of 230~300 valid Plateaued function truth sequences,significantly outperforming both the standard genetic algorithm and the basic Q-learning-enhanced genetic algorithm.The algorithm intelligently adjusts the mutation rate within the range of 0.235~0.276 and maintains the crossover operation usage rate between 70% and 90%,effectively optimizing the Walsh spectrum distribution while preserving population diversity.Although computational overhead increases slightly,its significant advantages in solution quality,convergence performance,and strategic adaptability validate the effectiveness of deep reinforcement learning in the construction of cryptographic functions,providing a novel approach for the intelligent design of Boolean functions.

Key words: Plateaued function, Truth sequence, Q-learning, Deep Q-Network, Genetic algorithm, Walsh spectrum, Nonlinearity

中图分类号: 

  • TP181
[1]SHANNON C E.A mathematical theory of communication[J].The Bell System Technical Journal,1948,27(3):379-423.
[2]HAMMING R W.Error detecting and error correcting codes[J].The Bell System Technical Journal,1950,29(2):147-160.
[3]BOSE R C,RAY-CHAUDHURI D K.On a class of error correcting binary group codes[J].Information and Control,1960,3(1):68-79.
[4]MACWILLIAMS F J,SLOANE N J A.The theory of error-correcting codes[M].Elsevier,1977.
[5]CARLET C.Partially-bent functions[J].Designs,Codes andCryptography,1993,3(2):135-145.
[6]ZHENG Y,ZHANG X M.Plateaued functions[C]//International Conference on Information and Communications Security.Berlin,Springer,1999:284-300.
[7]HYUN J Y,LEE J,LEE Y.Explicit criteria for construction of plateaued functions[J].IEEE Transactions on Information Theo-ry,2016,62(12):7555-7565.
[8]STANKOVIĆ M,MORAGA C,STANKOVIĆ R.Construction of ternary plateaued functions from quadratic forms for ternary bent functions[C]//2021 IEEE 51st International Symposium on Multiple-Valued Logic(ISMVL).IEEE,2021:1-6.
[9]SUN T F,HU B,YANG Y.Research on the Construction ofPlateaued Functions[J].Journal of Electronics & Information Technology,2018,40(10):2352-2357.
[10]CHEN R,YANG B,LI S,et al.A self-learning genetic algorithm based on reinforcement learning for flexible job-shop scheduling problem[J].Computers & Industrial Engineering,2020,149:106778.
[11]CHENG L,TANG Q,ZHANG L,et al.Scheduling flexiblemanufacturing cell with no-idle flow-lines and job-shop via Q-learning-based genetic algorithm[J].Computers & Industrial Engineering,2022,169:108293.
[12]YANG Q.Research on Adaptive Scheduling of AutomatedWarehousing System Based on Q-learning[D].Hangzhou:Hangzhou Dianzi University,2025.
[13]MIRALLES-PECHUÁN L,JIMÉNEZ F,PONCE H,et al.A methodology based on deep q-learning/genetic algorithms for optimizing covid-19 pandemic government actions[C]//Procee-dings of the 29th ACM International Conference on Information &Knowledge Management.2020:1135-1144.
[14]KUKKER A,SHARMA R.A genetic algorithm assisted fuzzy Q-learning epileptic seizure classifier[J].Computers & Electrical Engineering,2021,92:107154.
[15]LIU F,ZENG G.Study of genetic algorithm with reinforcement learning to solve the TSP[J].Expert Systems with Applications,2009,36(3):6995-7001.
[16]ZHANG Z Y,WANG L,CAI J C,et al.Application research of improved Q-learning genetic algorithm in path planning[J].CAAI Transactions on Intelligent Systems,2025,20(6):1493-1504.
[17]ASTHANA R,VERMA N,RATAN R.Generation of Boolean functions using Genetic Algorithm for cryptographic applications[C]//2014 IEEE International Advance Computing Conference(IACC).IEEE,2014:1361-1366.
[18]CARLET C.On the confusion and diffusion properties of Maiorana-McFarland’s and extended Maiorana-McFarland’s functions[J].Journal of Complexity,2004,20(2/3):182-204.
[19]HODŽIĆ S,PASALIC E,WEI Y,et al.Designing plateauedBoolean functions in spectral domain and their classification[J].IEEE Transactions on Information Theory,2019,65(9):5865-5879.
[20]HODŽIĆ S,PASALIC E,WEI Y.A general framework for secondary constructions of bent and plateaued functions[J].Designs,Codes and Cryptography,2020,88(10):2007-2035.
[21]MARIOT L,LEPORATI A.A genetic algorithm for evolving plateaued cryptographic boolean functions[C]//International Conference on Theory and Practice of Natural Computing.Cham:Springer,2015:33-45.
[22]PICEK S,CARLET C,GUILLEY S,et al.Evolutionary algo-rithms for boolean functions in diverse domains of cryptography[J].Evolutionary Computation,2016,24(4):667-694.
[23]JEONG J,LEE Y.Algorithms for constructing balanced pla-teaued functions with maximal algebraic degrees[J].IEEE Transactions on Information Theory,2023,70(2):1408-1421.
[24]BEHERA P K,GANGOPADHYAY S.An improved hybrid genetic algorithm to construct balanced Boolean function with optimal cryptographic properties[J].Evolutionary Intelligence,2022,15(1):639-653.
[25]PICEK S,MARCHIORI E,BATINA L,et al.Combining evolutionary computation and algebraic constructions to find crypto-graphy-relevant boolean functions[C]//International Confe-rence on Parallel Problem Solving from Nature.Cham:Springer,2014:822-831.
[26]CLIFTON J,LABER E.Q-learning:Theory and applications[J].Annual Review of Statistics and Its Application,2020,7(1):279-301.
[27]FAN J,WANG Z,XIE Y,et al.A theoretical analysis of deep Q-learning[C]//Learning for Dynamics and Control.PMLR,2020:486-489.
[28]GU S,LILLICRAP T,SUTSKEVER I,et al.Continuous deep q-learning with model-based acceleration[C]//International Conference on Machine Learning.PMLR,2016:2829-2838.
[29]WU Y N,ZHANG J.An Improved NSGA-II Algorithm forBullet Distribution[J].Ship Electronic Engineering,2023,43(4):29-33.
[30]PANDA S K,LIU R,XIANG Y.Asymptotic Analysis of Sample-averaged Q-learning[J].IEEE Transactions on Information Theory,2025,71(7):5601-5619.
[31]HE K,CHEN C,CHEN S,et al.Reinforcement Learning for Multi-Objective Optimization:A Review[J/OL].Archives of Computational Methods in Engineering,2025:1-30.https://doi.org/10.1007/s11831-025-10389-3.
[32]MIRJALILI S.Evolutionary algorithms and neural networks[J].Studies in Computational Intelligence,2019,780(1):43-53.
[33]ZOJAJI Z,KAZEMI A.Adaptive reinforcement-based geneticalgorithm for combinatorial optimization[J].Journal of Computing and Security,2022,9(1):71-84.
[1] 文佳, 吴舒霞, 于正欣, 苗旺, 陈哲毅.
基于多目标优化的大规模Hadoop集群虚拟机放置
Multi-objective Optimization for Virtual Machine Placement in Large-scale Hadoop Cluster
计算机科学, 2026, 53(2): 387-395. https://doi.org/10.11896/jsjkx.241200020
[2] 王维, 赵云龙, 彭小玉, 潘小东.
参数协同优化的TSVR增强型TSK模糊系统
TSK Fuzzy System Enhanced by TSVR with Cooperative Parameter Optimization
计算机科学, 2025, 52(7): 75-81. https://doi.org/10.11896/jsjkx.240500086
[3] 王翔, 韩青海, 梁家瑞, 余小莉, 吴麒, 卿利.
面向多用户的任务卸载和服务缓存策略研究
Research on Multi-user Task Offloading and Service Caching Strategies
计算机科学, 2025, 52(7): 307-314. https://doi.org/10.11896/jsjkx.240500036
[4] 黄傲, 李敏, 曾祥光, 潘云伟, 张加衡, 彭倍.
基于PPO的自适应杂交遗传算法求解旅行商问题
Adaptive Hybrid Genetic Algorithm Based on PPO for Solving Traveling Salesman Problem
计算机科学, 2025, 52(6A): 240600096-6. https://doi.org/10.11896/jsjkx.240600096
[5] 王思彤, 林荣恒.
面向异步混合流水车间排产的混合禁忌搜索遗传优化算法
Improved Genetic Algorithm with Tabu Search for Asynchronous Hybrid Flow Shop Scheduling
计算机科学, 2025, 52(4): 271-279. https://doi.org/10.11896/jsjkx.240600049
[6] 赵海霞, 李鑫, 韦永壮.
优良平衡布尔函数的Rank排序混合遗传搜索算法
Rank-sorting Hybrid Genetic Algorithm for Search High Quality Balanced Boolean Functions
计算机科学, 2025, 52(12): 351-357. https://doi.org/10.11896/jsjkx.241200039
[7] 宋日荣, 陈钦文, 陈星.
基于强化学习的分布式Android应用自动化测试方法
Distributed Automated Testing for Android Applications Based on Reinforcement Learning
计算机科学, 2025, 52(12): 40-47. https://doi.org/10.11896/jsjkx.241100054
[8] 李晓耕, 韩校, 肖海怡.
电力监控系统网络空间客体协同防御方法
Cooperative Defense Method for Network Space Object of Power Monitoring System
计算机科学, 2025, 52(11A): 241200158-7. https://doi.org/10.11896/jsjkx.241200158
[9] 稂奥奇, 黄伟杰, 於志勇, 黄昉菀.
城市空气质量数据的时空主动采样与联合推测
Spatiotemporal Active-sampling and Joint Inference of Urban Air Quality Data
计算机科学, 2025, 52(11A): 241000116-9. https://doi.org/10.11896/jsjkx.241000116
[10] 于萍, 颜辉, 鲍杰, 耿晓中.
基于优化模型的MEC网络任务卸载与迁移策略
MEC Network Task Offloading and Migration Strategy Based on Optimization Model
计算机科学, 2025, 52(11A): 241200215-6. https://doi.org/10.11896/jsjkx.241200215
[11] 赵英男, 冷重阳, 韩启龙, 俞程.
基于汤普森采样的自适应安卓程序测试方法
Adaptive Android Program Test Method Based on Thompson Sampling
计算机科学, 2025, 52(11): 330-338. https://doi.org/10.11896/jsjkx.240900150
[12] 徐海涛, 程海燕, 童名文.
基于深度强化学习的自学习排课遗传算法研究
Study on Genetic Algorithm of Course Scheduling Based on Deep Reinforcement Learning
计算机科学, 2024, 51(6A): 230600062-8. https://doi.org/10.11896/jsjkx.230600062
[13] 黄飞, 李永福, 高杨, 夏磊, 廖庆龙, 戴健, 向洪.
基于改进遗传算法的家庭用电调度优化方法
Scheduling Optimization Method for Household Electricity Consumption Based on Improved Genetic Algorithm
计算机科学, 2024, 51(6A): 230600096-6. https://doi.org/10.11896/jsjkx.230600096
[14] 钟雨昂, 袁伟伟, 关东海.
基于softmax的加权Double Q-Learning算法
Weighted Double Q-Learning Algorithm Based on Softmax
计算机科学, 2024, 51(6A): 230600235-5. https://doi.org/10.11896/jsjkx.230600235
[15] 李志博, 李清宝, 兰明敬.
基于ART优化选择策略的遗传算法生成测试数据方法
Method of Generating Test Data by Genetic Algorithm Based on ART Optimal Selection Strategy
计算机科学, 2024, 51(6): 95-103. https://doi.org/10.11896/jsjkx.230100012
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!