有界函数
期望效用假设
熵(时间箭头)
合理规划模型
信息论
决策论
计算机科学
有限理性
决策问题
最优决策
数理经济学
数学优化
管理科学
数学
经济
人工智能
决策树
物理
热力学
算法
微观经济学
数学分析
统计
管理
作者
Pedro A. Ortega,Daniel A. Braun
标识
DOI:10.1098/rspa.2012.0683
摘要
Perfectly rational decision-makers maximize expected utility, but crucially ignore the resource costs incurred when determining optimal actions. Here, we propose a thermodynamically inspired formalization of bounded rational decision-making where information processing is modelled as state changes in thermodynamic systems that can be quantified by differences in free energy. By optimizing a free energy, bounded rational decision-makers trade off expected utility gains and information-processing costs measured by the relative entropy. As a result, the bounded rational decision-making problem can be rephrased in terms of well-known variational principles from statistical physics. In the limit when computational costs are ignored, the maximum expected utility principle is recovered. We discuss links to existing decision-making frameworks and applications to human decision-making experiments that are at odds with expected utility theory. Since most of the mathematical machinery can be borrowed from statistical physics, the main contribution is to re-interpret the formalism of thermodynamic free-energy differences in terms of bounded rational decision-making and to discuss its relationship to human decision-making experiments.
科研通智能强力驱动
Strongly Powered by AbleSci AI