基于分布式元强化学习的多敏捷卫星多目标调度算法

doi:10.13195/j.kzyjc.2025.0112

首页 > 过刊浏览>2026年第41卷第4期 >1065-1076. DOI:10.13195/j.kzyjc.2025.0112

基于分布式元强化学习的多敏捷卫星多目标调度算法
DOI:
                        10.13195/j.kzyjc.2025.0112
                    
CSTR:
                        
                    
作者:
                        
                        
                    
作者单位:
作者简介:
通讯作者:
中图分类号:V19
基金项目:河北省自然科学基金项目(F2024204007)；西安交通大学机械制造系统工程国家重点实验室开放课题(sklms2023002).

Multiple agile satellites multi-objective scheduling algorithm based on distributed meta-reinforcement learning

Author:

Affiliation:

Fund Project:

摘要

图/表

访问统计

参考文献

相似文献

引证文献

资源附件

文章评论

摘要:

随着地球观测领域相关技术的高速发展, 近年来具有先进姿态调整能力的敏捷地球观测卫星已经引起了广泛的关注. 敏捷卫星任务调度具有时间依赖性切换时间, 在多星、多轨道、多需求的卫星观测场景下, 产生了复杂的时间依赖性多敏捷卫星多目标调度问题. 针对该问题, 首先, 基于问题特征和优化目标建立问题的数学规划模型; 其次, 提出一种分布式元$ Q$学习协同进化框架, 包括预训练和进化搜索两个阶段, 预训练阶段通过分布式$ Q$学习提高训练效率, 进化搜索阶段通过训练好的分布式$ Q$学习模型实现多种群进化算子的自适应选择; 然后, 基于所提出的进化框架和问题特征, 设计多样化的进化算子和动态种群划分选择策略, 建立一种分布式元$Q $学习协同进化算法(DMCEA); 最后通过实验验证DMCEA求解问题的有效性.

Abstract:

With the rapid advancement of earth observation technologies in recent years, agile earth observation satellites (AEOS) equipped with advanced attitude adjustment capabilities have attracted widespread attention. The scheduling of AEOS involves time-dependent transition times, which generates a complex time-dependent multi-objective scheduling problem in multi-satellite, multi-orbit, and multi-demand observation scenarios. To solve this problem, we first formulate a mathematical programming model based on the problem characteristics and optimization objectives. Subsequently, a distributed meta-$Q $-learning co-evolutionary framework is proposed, which consists of a pretraining phase and an evolutionary search phase. In the pretraining phase, distributed $Q $-learning is used to enhance the pretraining efficiency, while the evolutionary search phase leverages the pretrained distributed $Q $-learning model to adaptively select the evolutionary operators. Based on the proposed evolutionary framework and problem characteristics, different evolutionary operators and a dynamic population division strategy are designed. Further, a distributed meta-$Q $-learning co-evolutionary algorithm (DMCEA) is constructed. Finally, computational experiments validate the effectiveness of the DMCEA in solving the problem under consideration.

参考文献

相似文献

引证文献

引用本文

张广辉,魏晨轩,冯彦翔,等.基于分布式元强化学习的多敏捷卫星多目标调度算法[J].控制与决策,2026,41(4):1065-1076

复制

文章指标

点击次数:
下载次数:
HTML阅读次数:
引用次数:

历史

收稿日期:2025-01-22
最后修改日期:
录用日期:
在线发布日期: 2026-03-24
出版日期: 2026-04-10

首页

期刊简介

编委会

作者中心

精选专辑

品牌联动

引用本文

相关视频

分享

文章指标

历史

文章二维码