部分可观测马尔可夫决策过程
马尔可夫决策过程
可见的
计算机科学
一般化
决策者
马尔可夫过程
航程(航空)
过程(计算)
数学优化
人工智能
马尔可夫模型
马尔可夫链
机器学习
运筹学
数学
数学分析
统计
物理
材料科学
量子力学
复合材料
操作系统
作者
Emine Yaylalı,Julie S. Ivy
出处
期刊:Wiley Encyclopedia of Operations Research and Management Science
日期:2011-01-01
被引量:5
标识
DOI:10.1002/9780470400531.eorms0646
摘要
Abstract A partially observable Markov decision process (POMDP) is a generalization of a Markov decision process where the states of the model are not completely observable by the decision maker. Noisy observations provide a belief regarding the underlying state, while the decision maker has some control over the progression of the model through the selection of actions. In this article, we introduce POMDPs and discuss the relationship between Markov models and POMDPs. A general POMDP formulation and a wide range of POMDP applications from the literature are also presented.
科研通智能强力驱动
Strongly Powered by AbleSci AI