发布文献求助

Expectation Optimization with Probabilistic Guarantees in POMDPs with Discounted-Sum Objectives

随机博弈概率逻辑数学优化马尔可夫决策过程计算机科学航程（航空）马尔可夫过程数理经济学数学人工智能统计复合材料材料科学

作者

Krishnendu Chatterjee,Adrián Elgyütt,Petr Novotný,Owen Rouillé

链接

ijcai.org arxiv.org arxiv.orgdoi.org

标识

DOI：10.24963/ijcai.2018/652

摘要

Partially-observable Markov decision processes (POMDPs) with discounted-sum payoff are a standard framework to model a wide range of problems related to decision making under uncertainty. Traditionally, the goal has been to obtain policies that optimize the expectation of the discounted-sum payoff. A key drawback of the expectation measure is that even low probability events with extreme payoff can significantly affect the expectation, and thus the obtained policies are not necessarily risk averse. An alternate approach is to optimize the probability that the payoff is above a certain threshold, which allows to obtain risk-averse policies, but ignore optimization of the expectation. We consider the expectation optimization with probabilistic guarantee (EOPG) problem where the goal is to optimize the expectation ensuring that the payoff is above a given threshold with at least a specified probability. We present several results on the EOPG problem, including the first algorithm to solve it.

求助该文献

最长约 10秒，即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI

我的文献求助列表浏览历史

一分钟了解求助规则 | 捐赠本站 | 历史今天

更新

2025年影响因子查询已上线 (2025-6-18)

更新

PDF的下载单位、IP信息已删除 (2025-6-4)

科研通是完全免费的文献互助平台，具备全网最快的应助速度，最高的求助完成率。对每一个文献求助，科研通都将尽心尽力，给求助人一个满意的交代。

实时播报: 李爱国上传了应助文件

刚刚; 科研通AI5上传了应助文件

刚刚; haojiaolv完成签到，获得积分10

1秒前; 科目三上传了应助文件

2秒前; 浮游上传了应助文件

2秒前; 马潇完成签到，获得积分20

3秒前; 梁三柏上传了应助文件

4秒前; xiaoshi完成签到，获得积分10

4秒前; 完美世界上传了应助文件

4秒前; meta完成签到，获得积分10

5秒前; lin发布了新的文献求助10

5秒前; NexusExplorer的应助被周周采纳，获得10

5秒前; 宁人发布了新的文献求助10

5秒前; 357完成签到，获得积分20

6秒前; Meyako上传了应助文件

6秒前; qwe31533发布了新的文献求助30

7秒前; 别吃我蛋糕发布了新的文献求助10

7秒前; 量子星尘发布了新的文献求助10

8秒前; 称心的乘云发布了新的文献求助10

9秒前; 爆米花上传了应助文件

9秒前; 强子今天读文献了嘛完成签到，获得积分10

10秒前; 浮浮世世发布了新的文献求助10

10秒前; 李不开你关闭了李不开你的文献求助

10秒前; CTtoF完成签到，获得积分10

10秒前; 今日不再蛇皇上传了应助文件

11秒前; huanger完成签到，获得积分0

12秒前; 科研通AI6上传了应助文件

13秒前; harrison完成签到，获得积分20

13秒前; 狂野未来发布了新的文献求助10

14秒前; 花露水完成签到，获得积分20

14秒前; 脑洞疼上传了应助文件

14秒前; 打打上传了应助文件

15秒前; 小蘑菇的应助被咔咔采纳，获得10

17秒前; qzp发布了新的文献求助10

17秒前; leaolf的应助被称心曼安采纳，获得20

17秒前; 顺心的巨人完成签到，获得积分10

17秒前; 李健的粉丝团团长上传了应助文件

17秒前; 梁三柏上传了应助文件

17秒前; 慕青上传了应助文件

18秒前; 项目多多完成签到，获得积分10

18秒前

高分求助中: (应助此贴封号)【重要！！请各用户(尤其是新用户)详细阅读】【科研通的精品贴汇总】 10000; 计划经济时代的工厂管理与工人状况（1949-1966）——以郑州市国营工厂为例 500; INQUIRY-BASED PEDAGOGY TO SUPPORT STEM LEARNING AND 21ST CENTURY SKILLS: PREPARING NEW TEACHERS TO IMPLEMENT PROJECT AND PROBLEM-BASED LEARNING 500; The Pedagogical Leadership in the Early Years (PLEY) Quality Rating Scale 410; Modern Britain, 1750 to the Present （第2版） 300; Writing to the Rhythm of Labor Cultural Politics of the Chinese Revolution, 1942–1976 300; Lightning Wires: The Telegraph and China's Technological Modernization, 1860-1890 250

热门求助领域（近24小时）

热门帖子: 关注科研通微信公众号，转发送积分 4600474; 求助须知：如何正确求助？哪些是违规求助？ 4010608; 关于积分的说明 12416866; 捐赠科研通 3690360; 什么是DOI，文献DOI怎么找？ 2034326; 邀请新用户注册赠送积分活动 1067728; 科研通“疑难数据库（出版商）”最低求助积分说明 952513

今日热心研友

昏睡的蟠桃

遇上就这样吧

个性的紫菜

注：热心度 = 本日应助数 + 本日被采纳获取积分÷10

Copyright © 2020-2025 AbleSci.COM, 科研通, All Right Reserved

科研通是非营利科研互助平台，不忘初心，为科研助力

本站互助的所有文件仅供个人学习研究用，禁止任何人把求助的所得文献进行盈利或传播

皖ICP备2024041134号-1

皖公网安备34019202002308

科研通【文献互助QQ群】：如果您有特殊求助，或发布求助超过24小时未得到应助，可加群求助，群号：941272744【点击一键加群】

科研通【志愿服务QQ群】：如果您热爱文献互助，有热心愿意为更多人服务，请加入小伙伴群，点击申请加入

关注微信服务号

科研通