期刊:Journal of Applied Probability [Cambridge University Press] 日期:1970-12-01卷期号:7 (3): 649-656被引量:92
标识
DOI:10.2307/3211944
摘要
The semi-Markov decision model is considered under the criterion of long-run average cost. A new criterion, which for any policy considers the limit of the expected cost incurred during the first n transitions divided by the expected length of the first n transitions, is considered. Conditions guaranteeing that an optimal stationary (non-randomized) policy exist are then presented. It is also shown that the above criterion is equivalent to the usual one under certain conditions.