计算机科学 ›› 2022, Vol. 49 ›› Issue (5): 227-234.doi: 10.11896/jsjkx.210400179

• 人工智能 • 上一篇    下一篇

解决一类非光滑伪凸优化问题的新型神经网络

喻昕, 林植良   

  1. 广西大学计算机与电子信息学院 南宁530004
  • 收稿日期:2021-04-19 修回日期:2021-05-16 出版日期:2022-05-15 发布日期:2022-05-06
  • 通讯作者: 林植良(616312528@qq.com)
  • 作者简介:(616312528@qq.com)
  • 基金资助:
    国家自然科学基金(61862004)

Novel Neural Network for Dealing with a Kind of Non-smooth Pseudoconvex Optimization Problems

YU Xin, LIN Zhi-liang   

  1. School of Computer,Electronics and Information,Guangxi University,Nanning 530004,China
  • Received:2021-04-19 Revised:2021-05-16 Online:2022-05-15 Published:2022-05-06
  • About author:YU Xin,born in 1973,Ph.D,professor,Ph.D supervisor,is a member of China Computer Federation.His main research interests include artificial neural network theory and optimization.
    LIN Zhi-liang,born in 1996,postgra-duate.His main research interests include neural network theory and so on.
  • Supported by:
    National Natural Science Foundation of China(61862004).

摘要: 对优化问题的研究一直以来深受科研工作者的关注。非光滑伪凸优化作为非凸优化中的一类特殊问题,频繁出现在机器学习、信号处理、生物信息学以及各类科学与工程领域中,成为学者们研究的重点。基于罚函数以及微分包含的思想,提出了一种解决带有不等式约束条件和等式约束条件的非光滑伪凸优化问题的新型神经网络方法。在给定的假设条件下,该神经网络的解可以在有限时间内进入可行域并永驻其中,最终收敛到优化问题的最优解集。相比其他神经网络模型,该模型具有以下优点:1)结构简单,为单层模型;2)不需要事先计算精确的惩罚因子;3)初始点可任意选取。在MATLAB环境下,通过数值实验得出,所提网络都能在有限时间内收敛到一个最优解;而用现有的神经网络模型解决同样的优化问题时,若初始点选取不恰当,则会导致状态解不能在有效时间内收敛甚至不能收敛。这不仅进一步地验证了所提神经网络的有效性,同时也说明其具有更广泛的应用范围。

关键词: 惩罚因子, 非光滑伪凸优化, 微分包含, 循环神经网络, 最优解集

Abstract: The research of optimization problem is favored by researchers.Nonsmooth pseudoconvex optimization problems are a special kind of nonconvex optimization problems,which often appear in machine learning,signal processing,bioinformatics and various scientific and engineering fields.Based on the idea of penalty function and differential inclusion,a new neural network me-thod is proposed to solve the non-smooth pseudoconvex optimization problems with inequality constraints and equality constraints.Under given assumptions,the solution of the RNN can enter in the feasible region in finite time and stay there there-after,at last converge to the optimal solution set of the optimization problem.Compared with other neural networks,the RNN has the following advantages:1)simple structure,it is a single-layer model;2)it is not need to compute an exact penalty parameter in advance;3)the initial point is chosed arbitrarily.Under the environment of MATLAB,mathematical simulation experiments show that state solution can converge to the optimal solution.At the same time,if the initial points are not selected properly,the state solution will not converge in limit time even can not converge.This not only verifies the effectiveness of the proposed RNN,but also shows that the proposed network has a wider range of applications.

Key words: Differential inclusion, Nonsmooth pseudoconvex optimization, Optimal solution set, Penalty parameter, Recurrent neural network(RNN)

中图分类号: 

  • TP183
[1]MOKHTAR S B,HANIF D S,SHETTY C M.Nonlinear Programming Theory and Algorithms[M].New York:Wiley,1993.
[2]FRANK H C.Optimization and Nonsmooth Analysis[M].New York:Wilek,1983.
[3]AUBIN J P,FRANKOWSKA H.Set-Valued Analysis[M].Berlin:Birkuser,1990.
[4]FRANK H C.Optimization and Non-Smooth Analysis[M].New York:Wiley,1969.
[5]TANK D W,HOPFIELD J.Simple ‘neural’ optimization networks:An A/D converter,signal decision circuit,and a linear programming circuit[J].IEEE Transactions on Circuits and Systems,1986,33(5):533-541.
[6]KENNEDY M P,CHUA L O.Neural networks for nonlinearprogramming[J].IEEE Transactions on Circuits and Systems,1988,35(5):554-562.
[7]ZHANG S,CONSTANTINIDES A G.Lagarange Programming Neural Networks[J].IEEE Transactions on Circuits and Systems.II,Analog Digit.Signal Process,1992,39(7):441-452.
[8]XIA Y,LEUNG H,WANG J.A projection neural network and its application to constrained optimization problems[J].IEEE Transactions on Circuits and Systems,2002,49(4):447-458.
[9]HU X,WANG J.An improved dual neural network for solving aclass of quadratic programming problems and its k-winners-take-all application[J].IEEE Transactions on Neural Networks,2008,19(12):2022-2031.
[10]LIU S,WANG J.A simplified dual neural network for quadratic programming with its KWTA application[J].IEEE Transactions on Neural Networks,2006,17(6):1500-1510.
[11]FORTI M,NISTRI P,QUINCAMPOIX M.Generalized neural network for nonsmooth nonlinear programming problems[J].IEEE Transactions on Circuits and Systems,2004,51(9):1741-1754.
[12]LI G,SONG S,WU C.Generalized gradient projection neuralnetworks for nonsmooth optimization problems[J].Science China on Information Sciences,2010,53(5):990-1005.
[13]XUE X P,BIAN W.Subgradient-based neural networks for nonsmooth convex optimization problems[J].IEEE Transactions on Circuits and Systems I:Regular Papers,2008,55(8):2378-2391.
[14]BIAN W,XUE X P.Subgradient-based neural networks for nonsmooth nonconvex optimization problems[J].IEEE Transactions on Neural Networks,2009,20(6):1024-1038.
[15]BIAN W,XUE X P.Neural network for solving constrainedconvex optimization problems with global attractivity[J].IEEE Transactions on Circuits and Systems,2013,60(3):710-723.
[16]QIN S T,FAN D,WU G,et al.Neural network for constrained nonsmooth optimization using Tikhonov regularization[J].Neural Networks,2015,63:272-281.
[17]QIN S T,XUE X P.A two-layer recurrent neural network for nonsmooth convex optimization problems[J].IEEE Transactions on Neural Networks and Learning Systems,2015,26(6):1149-1160.
[18]LIU Q,WANG J.A one-layer recurrent neural network for constrained nonsmooth optimization[J].IEEE Transactions on Systems,Man,and Cybernetics,Part B (Cybernetics),2011,41(5):1323-1333.
[19]MARECHAL P,YE J J.Optimizing condition numbers[J].SIAM Journal on Optimization,2009,20(2):935-947.
[20]HU X,WANG J.Solving pseudomonotone variational inequalities and pseudoconvex optimization problems using the projection neural network[J].IEEE Transactions on Neural Networks,2006,17(6):1487-1499.
[21]GUO Z,LIU Q,WANG J.A one-layer recurrent neural network for pseudoconvex optimization subject to linear equality constraints[J].IEEE Transactions on Neural Networks,2011,22(12):1892-1900.
[22]QIN S T,BIAN W,XUE X P.A new one layer recurrent neural network for nonsmooth pseudoconvex optimization[J].Neurocomputing,2013,120:655-662.
[23]LIU Q,GUO Z,WANG J.A one-layer recurrent neural network for constrained pseudoconvex optimization and its application for dynamic portfolio optimization[J].Neural Networks,2012,26:99-109.
[24]LI Q F,LIU Y Q,ZHU L K.Neural network for non-smooth pseudoconvex optimization with general constraints[J].Neurocomputing,2014,131:336-347.
[25]QIN S T,YANG X D,XUE X P,et al.A one layer recurrent neural network for pseudoconvex optimization problems with equality and inequality constraints[J].IEEE Transactions on Cybernetics,2017,47(10):3063-3074.
[26]BIAN W,MA L T,QIN S T,et al.Neural network for non-smooth pseudoconvex optimization with general convex constraints[J].Neural Networks,2018,101:1-14.
[27]HOSSEINI A,WANG J,HOSSEINI S M.A recurrent neuralnetwork for solving a class of generalized convex optimization problems[J].Neural Networks,2013,44:78-86.
[28]CHENG L,HOU Z G,LIN Y Z,et al.Recurrent neural network for non-smooth convex optimization problems with application to the identification of genetic regulatory networks[J].IEEE Transactions on Neural Networks,2011,22(5):714-726.
[29]YU X,WU L Z,XU C H,et al.A novel neural network for solving nonsmooth nonconvex optimization problems[J].IEEE Transactions on Neural Networks and Learning Systems,2020,31(5):1475-1488.
[30]LI W J,BIAN W,XUE X P.Projected neural network for a class of Non-Lipschitz optimization problems with linear constraints[J].IEEE Transactions on Neural Networks and Learning Systems,2020,31(9):3361-3373.
[31]XU C,CHAI Y Y,QIN S T,et al.A neurodynamic approach to nonsmooth constrained pseudoconvex optimization problem[J].Neural Networks,2020,124:180-192.
[32]XIA Y S,WANG J,GUO W Z.Two projection neural networks with reduced model complexity for nonlinear programming[J].IEEE Transactions on Neural Networks and Learning Systems,2020,31(6):2020-2029.
[1] 彭双, 伍江江, 陈浩, 杜春, 李军.
基于注意力神经网络的对地观测卫星星上自主任务规划方法
Satellite Onboard Observation Task Planning Based on Attention Neural Network
计算机科学, 2022, 49(7): 242-247. https://doi.org/10.11896/jsjkx.210500093
[2] 安鑫, 代子彪, 李阳, 孙晓, 任福继.
基于BERT的端到端语音合成方法
End-to-End Speech Synthesis Based on BERT
计算机科学, 2022, 49(4): 221-226. https://doi.org/10.11896/jsjkx.210300071
[3] 时雨涛, 孙晓.
一种会话理解模型的问题生成方法
Conversational Comprehension Model for Question Generation
计算机科学, 2022, 49(3): 232-238. https://doi.org/10.11896/jsjkx.210200153
[4] 李昊, 曹书瑜, 陈亚青, 张敏.
基于注意力机制的用户轨迹识别模型
User Trajectory Identification Model via Attention Mechanism
计算机科学, 2022, 49(3): 308-312. https://doi.org/10.11896/jsjkx.210300231
[5] 肖丁, 张玙璠, 纪厚业.
基于多头注意力机制的用户窃电行为检测
Electricity Theft Detection Based on Multi-head Attention Mechanism
计算机科学, 2022, 49(1): 140-145. https://doi.org/10.11896/jsjkx.210100177
[6] 曾友渝, 谢强.
基于改进RNN和VAR的船舶设备故障预测方法
Fault Prediction Method Based on Improved RNN and VAR for Ship Equipment
计算机科学, 2021, 48(6): 184-189. https://doi.org/10.11896/jsjkx.200700117
[7] 尹久, 池凯凯, 宦若虹.
基于ATT-DGRU的文本方面级别情感分析
Aspect-level Sentiment Analysis of Text Based on ATT-DGRU
计算机科学, 2021, 48(5): 217-224. https://doi.org/10.11896/jsjkx.200500076
[8] 王习, 张凯, 李军辉, 孔芳, 张熠天.
联合自注意力和循环网络的图像标题生成
Generation of Image Caption of Joint Self-attention and Recurrent Neural Network
计算机科学, 2021, 48(4): 157-163. https://doi.org/10.11896/jsjkx.200300146
[9] 陈千, 车苗苗, 郭鑫, 王素格.
一种循环卷积注意力模型的文本情感分类方法
Recurrent Convolution Attention Model for Sentiment Classification
计算机科学, 2021, 48(2): 245-249. https://doi.org/10.11896/jsjkx.200100078
[10] 吕明琪, 洪照雄, 陈铁明.
一种融合时空关联与社会事件的交通流预测方法
Traffic Flow Forecasting Method Combining Spatio-Temporal Correlations and Social Events
计算机科学, 2021, 48(2): 264-270. https://doi.org/10.11896/jsjkx.200300098
[11] 李亚男, 胡宇佳, 甘伟, 朱敏.
基于深度学习的miRNA靶位点预测研究综述
Survey on Target Site Prediction of Human miRNA Based on Deep Learning
计算机科学, 2021, 48(1): 209-216. https://doi.org/10.11896/jsjkx.191200111
[12] 庄世杰, 於志勇, 郭文忠, 黄昉菀.
基于Zoneout的跨尺度循环神经网络及其在短期电力负荷预测中的应用
Short Term Load Forecasting via Zoneout-based Multi-time Scale Recurrent Neural Network
计算机科学, 2020, 47(9): 105-109. https://doi.org/10.11896/jsjkx.190800030
[13] 游兰, 韩雪薇, 何正伟, 肖丝雨, 何渡, 潘筱萌.
基于改进Seq2Seq的短时AIS轨迹序列预测模型
Improved Sequence-to-Sequence Model for Short-term Vessel Trajectory Prediction Using AIS Data Streams
计算机科学, 2020, 47(9): 169-174. https://doi.org/10.11896/jsjkx.190800060
[14] 赫磊, 邵展鹏, 张剑华, 周小龙.
基于深度学习的行为识别算法综述
Review of Deep Learning-based Action Recognition Algorithms
计算机科学, 2020, 47(6A): 139-147. https://doi.org/10.11896/JsJkx.190900176
[15] 张志扬, 张凤荔, 陈学勤, 王瑞锦.
基于分层注意力的信息级联预测模型
Information Cascade Prediction Model Based on Hierarchical Attention
计算机科学, 2020, 47(6): 201-209. https://doi.org/10.11896/jsjkx.200200117
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!