强化学习
水准点(测量)
计算机科学
数学优化
过程(计算)
人工智能
最优化问题
约束(计算机辅助设计)
先验与后验
参考模型
任务(项目管理)
分解
机器学习
算法
数学
工程类
生态学
哲学
几何学
大地测量学
认识论
生物
地理
操作系统
软件工程
系统工程
作者
Lianbo Ma,Nan Li,Yinan Guo,Xingwei Wang,Shengxiang Yang,Min Huang,Hao Zhang
标识
DOI:10.1109/tcyb.2021.3086501
摘要
The performance of decomposition-based algorithms is sensitive to the Pareto front shapes since their reference vectors preset in advance are not always adaptable to various problem characteristics with no a priori knowledge. For this issue, this article proposes an adaptive reference vector reinforcement learning (RVRL) approach to decomposition-based algorithms for industrial copper burdening optimization. The proposed approach involves two main operations, that is: 1) a reinforcement learning (RL) operation and 2) a reference point sampling operation. Given the fact that the states of reference vectors interact with the landscape environment (quite often), the RL operation treats the reference vector adaption process as an RL task, where each reference vector learns from the environmental feedback and selects optimal actions for gradually fitting the problem characteristics. Accordingly, the reference point sampling operation uses estimation-of-distribution learning models to sample new reference points. Finally, the resultant algorithm is applied to handle the proposed industrial copper burdening problem. For this problem, an adaptive penalty function and a soft constraint-based relaxing approach are used to handle complex constraints. Experimental results on both benchmark problems and real-world instances verify the competitiveness and effectiveness of the proposed algorithm.
科研通智能强力驱动
Strongly Powered by AbleSci AI