发布文献求助

Primal-Dual Reinforcement Learning for Zero-Sum Games in the Optimal Tracking Control

强化学习零（语言学）对偶（语法数字）钢筋跟踪（教育）零和博弈计算机科学控制（管理）数学优化数学人工智能纳什均衡心理学社会心理学艺术哲学语言学教育学文学类

作者

Xuejie Que,Zhenlei Wang

出处

期刊：IEEE Transactions on Circuits and Systems Ii-express Briefs [Institute of Electrical and Electronics Engineers]
日期：2024-01-25 卷期号：71 (6): 3146-3150 被引量：1

标识

DOI：10.1109/tcsii.2024.3358676

摘要

The two-player zero-sum game method for solving optimal tracking problems with external disturbance has been extensively explored. However, challenges such as the selection of initial admissible policies and learning errors diminish the accuracy of the Nash equilibrium, even limiting the method's application to some extent. The proposed model-free primal-dual reinforcement learning algorithm utilizes state-input trajectories generated by a set of linearly independent initial vectors to obtain Nash equilibrium without the need for probing noise. Admissible policies for both players are treated as a non-convex constraint and solved from a primal-dual perspective. Simulation results for an inverter confirm that the proposed unbiased learning method not only exhibits superior tracking performance but also demonstrates a faster convergence speed.

求助该文献

最长约 10秒，即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI

我的文献求助列表浏览历史

一分钟了解求助规则 | 捐赠本站 | 历史今天

更新

2025年影响因子查询已上线 (2025-6-18)

更新

PDF的下载单位、IP信息已删除 (2025-6-4)

科研通是完全免费的文献互助平台，具备全网最快的应助速度，最高的求助完成率。对每一个文献求助，科研通都将尽心尽力，给求助人一个满意的交代。

实时播报: 杨宏章完成签到，获得积分10

刚刚; 蜡笔小欣发布了新的文献求助10

刚刚; mx上传了应助文件

刚刚; 等风的人完成签到，获得积分10

1秒前; Muller完成签到，获得积分10

2秒前; 热心市民小红花上传了应助文件

2秒前; wisdom上传了应助文件

2秒前; 称心乐枫完成签到，获得积分10

2秒前; Hello的应助被欢欢呀采纳，获得10

2秒前; 豆豆发布了新的文献求助10

2秒前; CipherSage的应助被王一采纳，获得10

3秒前; 怡然的代玉发布了新的文献求助10

3秒前; 哈哈哈卷发布了新的文献求助10

3秒前; 钙离子关闭了钙离子的文献求助

4秒前; CipherSage的应助被蚂蚁Y嘿采纳，获得10

4秒前; 三三完成签到，获得积分10

4秒前; 烟花的应助被一个好昵称采纳，获得30

5秒前; 直率的犀牛发布了新的文献求助10

5秒前; 机智篮球关闭了机智篮球的文献求助

5秒前; 一个兜兜完成签到，获得积分10

5秒前; 言无间发布了新的文献求助10

6秒前; 眼睛大雨筠上传了应助文件

6秒前; 扎心上传了应助文件

6秒前; labxgr关闭了labxgr的文献求助

6秒前; Ava的应助被可乐采纳，获得10

6秒前; 在水一方的应助被GGGG采纳，获得10

8秒前; 呼了个呼完成签到，获得积分10

8秒前; 兴奋大地完成签到，获得积分10

9秒前; redisni完成签到，获得积分10

9秒前; Owen上传了应助文件

9秒前; NexusExplorer上传了应助文件

10秒前; 田様的应助被Urologyzz采纳，获得10

10秒前; sanages给sanages的求助进行了留言

11秒前; 圆锥香蕉上传了应助文件

11秒前; Lucas上传了应助文件

11秒前; 帅气书文完成签到，获得积分10

11秒前; 科研不通完成签到，获得积分10

11秒前; 深情安青上传了应助文件

12秒前; 赘婿上传了应助文件

12秒前; azusa发布了新的文献求助10

12秒前

高分求助中: Ophthalmic Equipment Market by Devices(surgical: vitreorentinal,IOLs,OVDs,contact lens,RGP lens,backflush,diagnostic&monitoring:OCT,actorefractor,keratometer,tonometer,ophthalmoscpe,OVD), End User,Buying Criteria-Global Forecast to2029 2000; A new approach to the extrapolation of accelerated life test data 1000; Cognitive Neuroscience: The Biology of the Mind 1000; Cognitive Neuroscience: The Biology of the Mind (Sixth Edition) 1000; ACSM’s Guidelines for Exercise Testing and Prescription, 12th edition 588; Christian Women in Chinese Society: The Anglican Story 500; A Preliminary Study on Correlation Between Independent Components of Facial Thermal Images and Subjective Assessment of Chronic Stress 500

热门求助领域（近24小时）

热门帖子: 关注科研通微信公众号，转发送积分 3961408; 求助须知：如何正确求助？哪些是违规求助？ 3507744; 关于积分的说明 11137921; 捐赠科研通 3240204; 什么是DOI，文献DOI怎么找？ 1790848; 邀请新用户注册赠送积分活动 872587; 科研通“疑难数据库（出版商）”最低求助积分说明 803288

今日热心研友

热心市民小红花

眼睛大雨筠

昏睡的蟠桃

眯眯眼的衬衫

注：热心度 = 本日应助数 + 本日被采纳获取积分÷10

Copyright © 2020-2025 AbleSci.COM, 科研通, All Right Reserved

科研通是非营利科研互助平台，不忘初心，为科研助力

本站互助的所有文件仅供个人学习研究用，禁止任何人把求助的所得文献进行盈利或传播

皖ICP备2024041134号-1

皖公网安备34019202002308

科研通【文献互助QQ群】：如果您有特殊求助，或发布求助超过24小时未得到应助，可加群求助，群号：941272744【点击一键加群】

科研通【志愿服务QQ群】：如果您热爱文献互助，有热心愿意为更多人服务，请加入小伙伴群，点击申请加入

关注微信服务号

科研通