计算机科学 ›› 2022, Vol. 49 ›› Issue (5): 227-234.doi: 10.11896/jsjkx.210400179
喻昕, 林植良
YU Xin, LIN Zhi-liang
摘要: 对优化问题的研究一直以来深受科研工作者的关注。非光滑伪凸优化作为非凸优化中的一类特殊问题,频繁出现在机器学习、信号处理、生物信息学以及各类科学与工程领域中,成为学者们研究的重点。基于罚函数以及微分包含的思想,提出了一种解决带有不等式约束条件和等式约束条件的非光滑伪凸优化问题的新型神经网络方法。在给定的假设条件下,该神经网络的解可以在有限时间内进入可行域并永驻其中,最终收敛到优化问题的最优解集。相比其他神经网络模型,该模型具有以下优点:1)结构简单,为单层模型;2)不需要事先计算精确的惩罚因子;3)初始点可任意选取。在MATLAB环境下,通过数值实验得出,所提网络都能在有限时间内收敛到一个最优解;而用现有的神经网络模型解决同样的优化问题时,若初始点选取不恰当,则会导致状态解不能在有效时间内收敛甚至不能收敛。这不仅进一步地验证了所提神经网络的有效性,同时也说明其具有更广泛的应用范围。
中图分类号:
[1]MOKHTAR S B,HANIF D S,SHETTY C M.Nonlinear Programming Theory and Algorithms[M].New York:Wiley,1993. [2]FRANK H C.Optimization and Nonsmooth Analysis[M].New York:Wilek,1983. [3]AUBIN J P,FRANKOWSKA H.Set-Valued Analysis[M].Berlin:Birkuser,1990. [4]FRANK H C.Optimization and Non-Smooth Analysis[M].New York:Wiley,1969. [5]TANK D W,HOPFIELD J.Simple ‘neural’ optimization networks:An A/D converter,signal decision circuit,and a linear programming circuit[J].IEEE Transactions on Circuits and Systems,1986,33(5):533-541. [6]KENNEDY M P,CHUA L O.Neural networks for nonlinearprogramming[J].IEEE Transactions on Circuits and Systems,1988,35(5):554-562. [7]ZHANG S,CONSTANTINIDES A G.Lagarange Programming Neural Networks[J].IEEE Transactions on Circuits and Systems.II,Analog Digit.Signal Process,1992,39(7):441-452. [8]XIA Y,LEUNG H,WANG J.A projection neural network and its application to constrained optimization problems[J].IEEE Transactions on Circuits and Systems,2002,49(4):447-458. [9]HU X,WANG J.An improved dual neural network for solving aclass of quadratic programming problems and its k-winners-take-all application[J].IEEE Transactions on Neural Networks,2008,19(12):2022-2031. [10]LIU S,WANG J.A simplified dual neural network for quadratic programming with its KWTA application[J].IEEE Transactions on Neural Networks,2006,17(6):1500-1510. [11]FORTI M,NISTRI P,QUINCAMPOIX M.Generalized neural network for nonsmooth nonlinear programming problems[J].IEEE Transactions on Circuits and Systems,2004,51(9):1741-1754. [12]LI G,SONG S,WU C.Generalized gradient projection neuralnetworks for nonsmooth optimization problems[J].Science China on Information Sciences,2010,53(5):990-1005. [13]XUE X P,BIAN W.Subgradient-based neural networks for nonsmooth convex optimization problems[J].IEEE Transactions on Circuits and Systems I:Regular Papers,2008,55(8):2378-2391. [14]BIAN W,XUE X P.Subgradient-based neural networks for nonsmooth nonconvex optimization problems[J].IEEE Transactions on Neural Networks,2009,20(6):1024-1038. [15]BIAN W,XUE X P.Neural network for solving constrainedconvex optimization problems with global attractivity[J].IEEE Transactions on Circuits and Systems,2013,60(3):710-723. [16]QIN S T,FAN D,WU G,et al.Neural network for constrained nonsmooth optimization using Tikhonov regularization[J].Neural Networks,2015,63:272-281. [17]QIN S T,XUE X P.A two-layer recurrent neural network for nonsmooth convex optimization problems[J].IEEE Transactions on Neural Networks and Learning Systems,2015,26(6):1149-1160. [18]LIU Q,WANG J.A one-layer recurrent neural network for constrained nonsmooth optimization[J].IEEE Transactions on Systems,Man,and Cybernetics,Part B (Cybernetics),2011,41(5):1323-1333. [19]MARECHAL P,YE J J.Optimizing condition numbers[J].SIAM Journal on Optimization,2009,20(2):935-947. [20]HU X,WANG J.Solving pseudomonotone variational inequalities and pseudoconvex optimization problems using the projection neural network[J].IEEE Transactions on Neural Networks,2006,17(6):1487-1499. [21]GUO Z,LIU Q,WANG J.A one-layer recurrent neural network for pseudoconvex optimization subject to linear equality constraints[J].IEEE Transactions on Neural Networks,2011,22(12):1892-1900. [22]QIN S T,BIAN W,XUE X P.A new one layer recurrent neural network for nonsmooth pseudoconvex optimization[J].Neurocomputing,2013,120:655-662. [23]LIU Q,GUO Z,WANG J.A one-layer recurrent neural network for constrained pseudoconvex optimization and its application for dynamic portfolio optimization[J].Neural Networks,2012,26:99-109. [24]LI Q F,LIU Y Q,ZHU L K.Neural network for non-smooth pseudoconvex optimization with general constraints[J].Neurocomputing,2014,131:336-347. [25]QIN S T,YANG X D,XUE X P,et al.A one layer recurrent neural network for pseudoconvex optimization problems with equality and inequality constraints[J].IEEE Transactions on Cybernetics,2017,47(10):3063-3074. [26]BIAN W,MA L T,QIN S T,et al.Neural network for non-smooth pseudoconvex optimization with general convex constraints[J].Neural Networks,2018,101:1-14. [27]HOSSEINI A,WANG J,HOSSEINI S M.A recurrent neuralnetwork for solving a class of generalized convex optimization problems[J].Neural Networks,2013,44:78-86. [28]CHENG L,HOU Z G,LIN Y Z,et al.Recurrent neural network for non-smooth convex optimization problems with application to the identification of genetic regulatory networks[J].IEEE Transactions on Neural Networks,2011,22(5):714-726. [29]YU X,WU L Z,XU C H,et al.A novel neural network for solving nonsmooth nonconvex optimization problems[J].IEEE Transactions on Neural Networks and Learning Systems,2020,31(5):1475-1488. [30]LI W J,BIAN W,XUE X P.Projected neural network for a class of Non-Lipschitz optimization problems with linear constraints[J].IEEE Transactions on Neural Networks and Learning Systems,2020,31(9):3361-3373. [31]XU C,CHAI Y Y,QIN S T,et al.A neurodynamic approach to nonsmooth constrained pseudoconvex optimization problem[J].Neural Networks,2020,124:180-192. [32]XIA Y S,WANG J,GUO W Z.Two projection neural networks with reduced model complexity for nonlinear programming[J].IEEE Transactions on Neural Networks and Learning Systems,2020,31(6):2020-2029. |
[1] | 彭双, 伍江江, 陈浩, 杜春, 李军. 基于注意力神经网络的对地观测卫星星上自主任务规划方法 Satellite Onboard Observation Task Planning Based on Attention Neural Network 计算机科学, 2022, 49(7): 242-247. https://doi.org/10.11896/jsjkx.210500093 |
[2] | 安鑫, 代子彪, 李阳, 孙晓, 任福继. 基于BERT的端到端语音合成方法 End-to-End Speech Synthesis Based on BERT 计算机科学, 2022, 49(4): 221-226. https://doi.org/10.11896/jsjkx.210300071 |
[3] | 时雨涛, 孙晓. 一种会话理解模型的问题生成方法 Conversational Comprehension Model for Question Generation 计算机科学, 2022, 49(3): 232-238. https://doi.org/10.11896/jsjkx.210200153 |
[4] | 李昊, 曹书瑜, 陈亚青, 张敏. 基于注意力机制的用户轨迹识别模型 User Trajectory Identification Model via Attention Mechanism 计算机科学, 2022, 49(3): 308-312. https://doi.org/10.11896/jsjkx.210300231 |
[5] | 肖丁, 张玙璠, 纪厚业. 基于多头注意力机制的用户窃电行为检测 Electricity Theft Detection Based on Multi-head Attention Mechanism 计算机科学, 2022, 49(1): 140-145. https://doi.org/10.11896/jsjkx.210100177 |
[6] | 曾友渝, 谢强. 基于改进RNN和VAR的船舶设备故障预测方法 Fault Prediction Method Based on Improved RNN and VAR for Ship Equipment 计算机科学, 2021, 48(6): 184-189. https://doi.org/10.11896/jsjkx.200700117 |
[7] | 尹久, 池凯凯, 宦若虹. 基于ATT-DGRU的文本方面级别情感分析 Aspect-level Sentiment Analysis of Text Based on ATT-DGRU 计算机科学, 2021, 48(5): 217-224. https://doi.org/10.11896/jsjkx.200500076 |
[8] | 王习, 张凯, 李军辉, 孔芳, 张熠天. 联合自注意力和循环网络的图像标题生成 Generation of Image Caption of Joint Self-attention and Recurrent Neural Network 计算机科学, 2021, 48(4): 157-163. https://doi.org/10.11896/jsjkx.200300146 |
[9] | 陈千, 车苗苗, 郭鑫, 王素格. 一种循环卷积注意力模型的文本情感分类方法 Recurrent Convolution Attention Model for Sentiment Classification 计算机科学, 2021, 48(2): 245-249. https://doi.org/10.11896/jsjkx.200100078 |
[10] | 吕明琪, 洪照雄, 陈铁明. 一种融合时空关联与社会事件的交通流预测方法 Traffic Flow Forecasting Method Combining Spatio-Temporal Correlations and Social Events 计算机科学, 2021, 48(2): 264-270. https://doi.org/10.11896/jsjkx.200300098 |
[11] | 李亚男, 胡宇佳, 甘伟, 朱敏. 基于深度学习的miRNA靶位点预测研究综述 Survey on Target Site Prediction of Human miRNA Based on Deep Learning 计算机科学, 2021, 48(1): 209-216. https://doi.org/10.11896/jsjkx.191200111 |
[12] | 庄世杰, 於志勇, 郭文忠, 黄昉菀. 基于Zoneout的跨尺度循环神经网络及其在短期电力负荷预测中的应用 Short Term Load Forecasting via Zoneout-based Multi-time Scale Recurrent Neural Network 计算机科学, 2020, 47(9): 105-109. https://doi.org/10.11896/jsjkx.190800030 |
[13] | 游兰, 韩雪薇, 何正伟, 肖丝雨, 何渡, 潘筱萌. 基于改进Seq2Seq的短时AIS轨迹序列预测模型 Improved Sequence-to-Sequence Model for Short-term Vessel Trajectory Prediction Using AIS Data Streams 计算机科学, 2020, 47(9): 169-174. https://doi.org/10.11896/jsjkx.190800060 |
[14] | 赫磊, 邵展鹏, 张剑华, 周小龙. 基于深度学习的行为识别算法综述 Review of Deep Learning-based Action Recognition Algorithms 计算机科学, 2020, 47(6A): 139-147. https://doi.org/10.11896/JsJkx.190900176 |
[15] | 张志扬, 张凤荔, 陈学勤, 王瑞锦. 基于分层注意力的信息级联预测模型 Information Cascade Prediction Model Based on Hierarchical Attention 计算机科学, 2020, 47(6): 201-209. https://doi.org/10.11896/jsjkx.200200117 |
|