马尔可夫决策过程
计算机科学
马尔可夫链
马尔可夫过程
人工智能
机器学习
数学
统计
标识
DOI:10.1201/9781315140223
摘要
This book provides a unified approach for the study of constrained Markov decision processes with a finite state space and unbounded costs. Unlike the single controller case considered in many other books, the author considers a single controller with several objectives, such as minimizing delays and loss, probabilities, and maximization of throughputs. It is desirable to design a controller that minimizes one cost objective, subject to inequality constraints on other cost objectives. This framework describes dynamic decision problems arising frequently in many engineering fields. A thorough overview of these applications is presented in the introduction. The book is then divided into three sections that build upon each other.
科研通智能强力驱动
Strongly Powered by AbleSci AI