代表(政治)
计算机科学
过程(计算)
人工智能
价值(数学)
机器学习
政治学
政治
操作系统
法学
作者
Dwi H. Widyantoro,Yus G. Vembrina
标识
DOI:10.1109/iceei.2009.5254776
摘要
This paper reports our experiment on applying Q Learning algorithm for learning to play Tic-tac-toe. The original algorithm is modified by updating the Q value only when the game terminates, propagating the update process from the final move backward to the first move, and incorporating a new update rule. We evaluate the agent performance using full-board and partial-board representations. In this evaluation, the agent plays the tic-tac-toe game against human players. The evaluation results show that the performance of modified Q Learning algorithm with partial-board representation is comparable to that of human players.
科研通智能强力驱动
Strongly Powered by AbleSci AI