强化学习
工业互联网
计算机科学
调度(生产过程)
分布式计算
物联网
马尔可夫决策过程
作业车间调度
互联网
马尔可夫过程
人工智能
数学优化
计算机网络
嵌入式系统
统计
布线(电子设计自动化)
数学
万维网
作者
Qiuyang Zhang,Ying Wang
出处
期刊:IEEE Transactions on Network Science and Engineering
[Institute of Electrical and Electronics Engineers]
日期:2024-01-01
卷期号:11 (1): 1065-1076
被引量:1
标识
DOI:10.1109/tnse.2023.3321048
摘要
The industrial Internet of Things (IIoT) has led to the emergence of various information-based industrial applications. Due to the uninterrupted and complex nature of production processes in industrial systems, these applications often run continuously and rely on information from multiple sensors. As a result, a single sensor can support multiple applications simultaneously, leading to complex correlations in the system. To address this challenge, we introduce the concept of the age of correlated information (AoCI) and formulate the scheduling problem as a Markov game problem to optimize the information freshness of industrial applications. To solve the problem, we propose a multi-heterogeneous-agent-reinforcement-learning (MHARL) scheme, which uses neural networks with different structures to represent agents participating in the game. Our numerical results demonstrate that the proposed MHARL scheme outperforms typical baselines, such as Qmix and VDN, in terms of AoCI and energy efficiency.
科研通智能强力驱动
Strongly Powered by AbleSci AI