强化学习
华法林
计算机科学
人工智能
机器学习
医学
内科学
心房颤动
作者
Hannah Ji,Matthew Gill,Evan Draper,David Liedl,David O. Hodge,Damon E. Houghton,Ana I. Casanegra
标识
DOI:10.1109/jbhi.2025.3545384
摘要
Warfarin is a commonly prescribed anticoagulant with a narrow therapeutic window, requiring frequent and specialized monitoring. This work aims to develop standardized optimal warfarin dose decision support using a machine learning model based on time series anticoagulation data and patient demographic characteristics. We propose an offline reinforcement learning model (RL) using a Batch-Constrained Q-Learning algorithm (BCQ) in the discrete action setting to predict the cumulative warfarin dose for the days until the next INR (International Normalized Ratio) test. Prior approaches utilized time-series supervised learning methods such as regression or Long Short Term Memory (LSTM) neural networks. The key advantage of reinforcement learning is its capacity to learn optimal dosing strategies from suboptimal clinical states in the data. To evaluate the model we compared the predicted warfarin doses with the physician-prescribed doses. Our BCQ model with a prediction accuracy of 98.6% significantly outperformed our baseline Long Short Term Memory (LSTM) model with a prediction accuracy of 71.09%. Further qualitative evaluation for explainability indicated that the model correctly adjusted the warfarin dose at time steps when patients had out-of-range INRs.
科研通智能强力驱动
Strongly Powered by AbleSci AI