亲爱的研友该休息了!由于当前在线用户较少,发布求助请尽量完整地填写文献信息,科研通机器人24小时在线,伴您度过漫漫科研夜!身体可是革命的本钱,早点休息,好梦!

Two-stage deep reinforcement learning method for agile optical satellite scheduling problem

强化学习 敏捷软件开发 阶段(地层学) 计算机科学 调度(生产过程) 卫星 人工智能 钢筋 计算智能 运筹学 工业工程 工程类 航空航天工程 运营管理 地质学 软件工程 结构工程 古生物学
作者
Zheng Liu,Wei Xiong,Zhuoya Jia,Chi Han
出处
期刊:Complex & Intelligent Systems 卷期号:11 (1)
标识
DOI:10.1007/s40747-024-01667-x
摘要

This paper investigates the agile optical satellite scheduling problem, which aims to arrange an observation sequence and observation actions for observation tasks. Existing research mainly aims to maximize the number of completed tasks or the total priorities of the completed tasks but ignores the influence of the observation actions on the imaging quality. Besides, the conventional exact methods and heuristic methods can hardly obtain a high-quality solution in a short time due to the complicated constraints and considerable solution space of this problem. Thus, this paper proposes a two-stage scheduling framework with two-stage deep reinforcement learning to address this problem. First, the scheduling process is decomposed into a task sequencing stage and an observation scheduling stage, and a mathematical model with complex constraints and two-stage optimization objectives is established to describe the problem. Then, a pointer network with a local selection mechanism and a rough pruning mechanism is constructed as the sequencing network to generate an executable task sequence in the task sequencing stage. Next, a decomposition strategy decomposes the executable task sequence into multiple sub-sequences in the observation scheduling stage, and the observation scheduling process of these sub-sequences is modeled as a concatenated Markov decision process. A neural network is designed as the observation scheduling network to determine observation actions for the sequenced tasks, which is well trained by the soft actor-critic algorithm. Finally, extensive experiments show that the proposed method, along with the designed mechanisms and strategy, is superior to comparison algorithms in terms of solution quality, generalization performance, and computation efficiency.

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
王清亚完成签到,获得积分10
9秒前
完美世界应助yang采纳,获得10
14秒前
ChencanFang完成签到,获得积分10
16秒前
27秒前
yang完成签到,获得积分10
28秒前
小二郎应助xuan采纳,获得10
30秒前
CipherSage应助xuan采纳,获得10
30秒前
深情安青应助xuan采纳,获得30
30秒前
共享精神应助xuan采纳,获得10
30秒前
Owen应助xuan采纳,获得30
30秒前
NexusExplorer应助xuan采纳,获得30
30秒前
科研通AI6.1应助约翰森ner采纳,获得10
31秒前
yang发布了新的文献求助10
31秒前
55秒前
cxs发布了新的文献求助10
58秒前
星辰大海应助xuan采纳,获得10
58秒前
李爱国应助xuan采纳,获得10
58秒前
万能图书馆应助xuan采纳,获得30
58秒前
脑洞疼应助xuan采纳,获得30
58秒前
领导范儿应助xuan采纳,获得10
58秒前
SciGPT应助xuan采纳,获得30
58秒前
Jasper应助xuan采纳,获得10
58秒前
香蕉觅云应助xuan采纳,获得10
58秒前
天天快乐应助xuan采纳,获得10
58秒前
打打应助xuan采纳,获得10
58秒前
研友_ZlPolZ发布了新的文献求助10
59秒前
高兴魂幽发布了新的文献求助30
1分钟前
1分钟前
约翰森ner发布了新的文献求助10
1分钟前
1分钟前
WEileen完成签到 ,获得积分0
1分钟前
嘟嘟嘟嘟完成签到 ,获得积分10
1分钟前
1分钟前
1分钟前
科研通AI6.4应助Sober采纳,获得10
1分钟前
张欢馨应助科研通管家采纳,获得10
1分钟前
情怀应助科研通管家采纳,获得10
1分钟前
张欢馨应助科研通管家采纳,获得10
1分钟前
张欢馨应助科研通管家采纳,获得10
1分钟前
张欢馨应助科研通管家采纳,获得10
1分钟前
高分求助中
(应助此贴封号)【重要!!请各用户(尤其是新用户)详细阅读】【科研通的精品贴汇总】 10000
PowerCascade: A Synthetic Dataset for Cascading Failure Analysis in Power Systems 2000
The Composition and Relative Chronology of Dynasties 16 and 17 in Egypt 1500
Picture this! Including first nations fiction picture books in school library collections 1500
Signals, Systems, and Signal Processing 610
Unlocking Chemical Thinking: Reimagining Chemistry Teaching and Learning 555
Scientific Writing and Communication: Papers, Proposals, and Presentations 500
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6371600
求助须知:如何正确求助?哪些是违规求助? 8185214
关于积分的说明 17271303
捐赠科研通 5426013
什么是DOI,文献DOI怎么找? 2870525
邀请新用户注册赠送积分活动 1847432
关于科研通互助平台的介绍 1694042