偏爱
强化学习
偏好学习
计算机科学
钢筋
旅游行为
增强学习
过程(计算)
运筹学
人工智能
微观经济学
工程类
经济
心理学
社会心理学
操作系统
作者
Xueqin Long,Jianxu Mao,Zhongbao Qiao,Peng Li,Wei He
标识
DOI:10.1080/19427867.2023.2231689
摘要
ABSTRACTABSTRACTTravelers always perform some preference during the decision-making process. The preference will affect the decision results and can be improved by continuously learning. In order to understand the influence of individual preference on travel behavior choice , two individual preferences, including indifference preference and compulsive preference are considered in the paper. Two updating mechanisms of compulsive preference are proposed to obtain the choosing probability of all alternatives. Reinforcement learning models are established integrating the gain stimulating and loss stimulating considering expected utility. Nguyen Dupuis network is adopted for numerical simulation to study the updating process. Simulation results denote that the equilibrium state is much more efficient when preference learning mechanism is considered comparing with the traditional stochastic user equilibrium model, and can decrease the total travel time greatly, which can be applied for urban traffic management. Personalized traffic guidance is the effective solution to traffic congestion in the futureKEYWORDS: Route choicereinforcement learninggeneralized travel timeindifference thresholdcompulsive preference AcknowledgmentsThis work was supported by the National Key Research and Development Program (2019YFB1600500); Science Program of Shaanxi Province (2021JQ-276).Disclosure statementNo potential conflict of interest was reported by the authors.Data availability statementNo data, models, or code were generated or used during the study.Additional informationFundingThe work was supported by the Science program of Shaanxi Province [2021JQ-276].
科研通智能强力驱动
Strongly Powered by AbleSci AI