王逍头像

王逍

讲师 教师 博士研究生毕业

部门: 信息科学与技术学院 研究方向: 飞行器动力学与控制、集群航天器自主任务规划

电子邮箱: 2021500089(at)buct.edu.cn 办公地址: 科技大厦

ORCID: 0000-0003-2305-2666 DBLP:

10 访问

个人简介

王逍,硕士生导师,2015年获北京理工大学学士学位,2021年获北京航空航天大学飞行器设计工学博士学位,从事飞行器动力学与控制、集群航天器自主任务规划方面的研究,围绕相关领域,已发表相关学术论文18篇,其中以第一作者在IEEE Transactions on Aerospace and Electronic SystemsNeurocomputingAdvances in Space Research等期刊发表的SCI学术论文10篇、EI及核心论文5篇;申请发明专利8项,其中已授权5项;主持省部级纵向以及星群算力聚合与任务规划软件开发基于多智能体强化学习的航天器动态博弈对抗技术研究、“地月空间自主任务规划模块开发”、“动态不确定空间目标的时空因果推理决策方法研究”等纵横向项目与军事科学院、中国科学院、北航、北理等优势团队保持长期深度科研合作关系

邮箱:w_xiao@buct.edu.cn


空间,是人类探索的最终前沿;智能化,是开启这片前沿的钥匙课题组聚焦于航天控制与智能自主系统的前沿交叉领域,致力于解决空间智能在轨操控的核心科学问题。在这里,你将获得:

1.在尖端交叉领域中锤炼核心竞争力:你将深度掌握控制理论、人工智能与航天动力学的交叉知识,成长为领域内稀缺的复合型高端人才。

2.完备的科研支撑与学术培养提供高性能计算资源、先进的半物理仿真验证平台,并全力支持你参加国内外顶级会议,与领域内顶尖专家学者交流。


如果你对“将智能赋予卫星”,让它们在浩瀚宇宙中按照你设计的智慧之心执行任务充满向往,欢迎联系,与我们一同探索空间智能控制的奥秘!


教育经历

入学时间 毕业时间 学位授予单位 学历
2017-09-01 2021-06-30 北京航空航天大学 博士研究生毕业
2015-09-01 2017-06-30 北京航空航天大学 硕士研究生毕业
2011-09-01 2015-06-30 北京理工大学 大学本科毕业

工作经历

社会职务

社会活动

研究领域

飞行器动力学与控制
集群航天器自主任务规划
多智能体博弈技术

已取得成果简介:
1. 
非合作目标接近机动规划

(1)空间飞行器自主机动规划

  为应对空间非合作目标的在轨智能博弈问题构建融合动力学建模、智能规划与仿真验证的自主决策技术链系统开展基于微分对策的机理驱动方法,与基于强化学习的数据驱动方法的对比研究明晰两类方法在求解精度、计算效率、环境适应性等方面的内在机理、优劣特性,为实际空间机动规划技术实现提供坚实的理论依据


(2)非合作目标意图分析

 针对空间非合作目标飞行意图的动态感知与预测问题,构建一种时空因果作用模型。通过设计不同的目标态势感知函数,并生成与之关联的层级决策树,实现对隐蔽意图的动态、可解释推理,确保研判过程的每一步均可追溯。

(3)空间在轨操控技术

  针对对空间非合作目标的在轨操控问题,研究对目标进行抵近变轨、绕飞控制、悬停跟踪等在轨操控技术,为空间目标的近距离详查、特性识别与行为研判提供技术支撑,为空间智能自主与博弈对抗技术提供验证基础。



2. 集群飞行器自主任务规划

(1)星间轨迹安全评估

    针对卫星集群的轨迹安全问题,聚焦自主安全预警技术,包括:动态安全走廊的智能构建、基于非线性相对动力学的快速碰撞概率计算、以及禁飞球外的分布式协同预警机制,以解决在轨实时感知、风险评估与规避决策一体化的核心难题,确保集群飞行的本质安全。


(2)多智能体强化学习博弈技术

  为解决空间复杂环境下集群卫星的自主对抗难题,采用多智能体强化学习范式,构建兼具高精度与强泛化能力的策略价值网络,通过“集中训练、分散执行”等先进架构,训练集群智能体在模拟的强对抗场景中学习协作、竞争等高级博弈行为,最终实现集群在真实空间环境中快速、自主的分布式决策。


(3)强化学习泛化性技术

  攻克多智能体强化学习的环境泛化性与策略自适应这一核心挑战,探索构建对环境参数变化不敏感的鲁棒策略模型,其关键在于让智能体学会感知并理解不同任务背后的本质博弈规律,并在MPE至Football等异构环境中进行系统性验证,是实现强化学习从“纸上谈兵”走向“现实应用”的关键一步。

3. 卫星资源建模与管理
(1)遥感卫星资源管理与任务调度

  针对遥感卫星星载资源紧耦合、任务调度高动态的核心难题,构建精确的多约束耦合模型,对存储、电池、动量轮等关键资源的动态行为与交互关系进行精细化建模,重点研究在多资源强约束下的高动态任务序列自主规划问题,实现资源利用效率与任务收益的全局最优,为新一代智能遥感卫星的自主任务管理提供核心技术支撑。

(2)异构集群卫星“云-边-端”平台搭建

  为攻克智能集群算法“在轨真实部署”前的核心验证瓶颈,基于Kubernetes云原生架构,构建高保真“云-边-端”异构集群数字孪生平台。该平台通过异构节点资源池化、跨主机网络通信模拟及一体化可视化管控,精准复现了在轨环境下的计算、通信与资源约束,支持各类自主任务规划算法在贴近真实的边缘端进行功能与性能验证


本科生课程

课程名称 开课学年 课程总学时 选课人数 课程性质
自动控制原理 2025 56 专业必修
自动控制原理 2024 56 28 专业必修
自动控制原理 2023 48 32 专业必修
学科前沿讲座 2022 16 27 专业选修
自动控制原理 2022 48 30 专业必修
自动控制原理 2021 56 57 专业必修

研究生课程

课程名称 开课学年 总学时 开课方式
智能控制理论及应用 2025 32 D专业选修课
智能控制理论及应用 2024 32 D专业选修课

校级项目

  • 1. 基于多智能体强化学习的航天器动态博弈对抗技术研究 ,信息科学与技术学院,项目时间:2024-03-01 至 2024-12-01

纵向项目

横向项目

  • 1. 敏捷机动卫星控制系统试验研究 ,国家科技部,项目时间:2021-04-01 至 2022-02-28

论文信息

[+][-]代表性论文
  • 1. DOI Wang, Xiao;Li, Jiake;Cao, Lu;Ran, Dechao;Ji, Mingjiang;Sun, Kewu;Han, Yuying;Ma, Zhe
    A data-knowledge joint-driven reinforcement learning algorithm based on guided policy and state-prediction for satellite continuous-thrust tracking[期刊论文],ADVANCES IN SPACE RESEARCH,2024-10-15
  • 2. DOI Wang, Xiao;Yang, Zhuo;Han, Yuying;Li, Hao;Shi, Peng
    Method of sequential intention inference for a space target based on meta-fuzzy decision tree[期刊论文],ADVANCES IN SPACE RESEARCH,2024-10-15
  • 3. DOI Wang, Xiao;Li, Dazi
    Bioinspired actor-critic algorithm for reinforcement learning interpretation with Levy-Brown hybrid exploration strategy[期刊论文],Neurocomputing,2024-03-14
  • 4. DOI Wang, Xiao;Ma, Zhe;Cao, Lu;Ran, Dechao;Ji, Mingjiang;Sun, Kewu;Han, Yuying;Li, Jiake
    A planar tracking strategy based on multiple-interpretable improved PPO algorithm with few-shot technique[期刊论文],SCIENTIFIC REPORTS,2024-02-16
  • 5. DOI Wang, Xiao;Zhu, Haijiang;Li, Zhiqing
    Decoupled Region of Interest Feature Pooling Diffusion Network for UAV Image Object Detection[会议论文],Chinese Control Conference, CCC,2024-01-01
  • 6. DOI Wang, Xiao;Yang, Zhaohui;Bai, Xueqian;Ji, Mingjiang;Li, Hao;Ran, Dechao
    A Consistent Round-Up Strategy Based on PPO Path Optimization for the Leader–Follower Tracking Problem[期刊论文],Sensors,2023-11-01
  • 7. DOI Wang, Xiao;Ma, Zhe;Mao, Lei;Sun, Kewu;Huang, Xuhui;Fan, Changchao;Li, Jiake
    Accelerating Fuzzy Actor-Critic Learning via Suboptimal Knowledge for a Multi-Agent Tracking Problem[期刊论文],ELECTRONICS,2023-04-01
[+][-] 2030年
  • 1. DOI Wang, Xiao;Li, Jiake;Cao, Lu;Ran, Dechao;Ji, Mingjiang;Sun, Kewu;Han, Yuying;Ma, Zhe
    A data-knowledge joint-driven reinforcement learning algorithm based on guided policy and state-prediction for satellite continuous-thrust tracking[期刊论文],ADVANCES IN SPACE RESEARCH,2024-10-15
  • 2. DOI Wang, Xiao;Yang, Zhuo;Han, Yuying;Li, Hao;Shi, Peng
    Method of sequential intention inference for a space target based on meta-fuzzy decision tree[期刊论文],ADVANCES IN SPACE RESEARCH,2024-10-15
  • 3. DOI Wang, Xiao;Li, Dazi
    Bioinspired actor-critic algorithm for reinforcement learning interpretation with Levy-Brown hybrid exploration strategy[期刊论文],Neurocomputing,2024-03-14
  • 4. DOI Wang, Xiao;Ma, Zhe;Cao, Lu;Ran, Dechao;Ji, Mingjiang;Sun, Kewu;Han, Yuying;Li, Jiake
    A planar tracking strategy based on multiple-interpretable improved PPO algorithm with few-shot technique[期刊论文],SCIENTIFIC REPORTS,2024-02-16
  • 5. DOI Wang, Xiao;Zhu, Haijiang;Li, Zhiqing
    Decoupled Region of Interest Feature Pooling Diffusion Network for UAV Image Object Detection[会议论文],Chinese Control Conference, CCC,2024-01-01
  • 6. DOI Wang, Xiao;Yang, Zhaohui;Bai, Xueqian;Ji, Mingjiang;Li, Hao;Ran, Dechao
    A Consistent Round-Up Strategy Based on PPO Path Optimization for the Leader–Follower Tracking Problem[期刊论文],Sensors,2023-11-01
  • 7. DOI Wang, Xiao;Ma, Zhe;Mao, Lei;Sun, Kewu;Huang, Xuhui;Fan, Changchao;Li, Jiake
    Accelerating Fuzzy Actor-Critic Learning via Suboptimal Knowledge for a Multi-Agent Tracking Problem[期刊论文],ELECTRONICS,2023-04-01
[+][-] 2029年
  • 1. DOI Wang, Xiao;Li, Jiake;Cao, Lu;Ran, Dechao;Ji, Mingjiang;Sun, Kewu;Han, Yuying;Ma, Zhe
    A data-knowledge joint-driven reinforcement learning algorithm based on guided policy and state-prediction for satellite continuous-thrust tracking[期刊论文],ADVANCES IN SPACE RESEARCH,2024-10-15
  • 2. DOI Wang, Xiao;Yang, Zhuo;Han, Yuying;Li, Hao;Shi, Peng
    Method of sequential intention inference for a space target based on meta-fuzzy decision tree[期刊论文],ADVANCES IN SPACE RESEARCH,2024-10-15
  • 3. DOI Wang, Xiao;Li, Dazi
    Bioinspired actor-critic algorithm for reinforcement learning interpretation with Levy-Brown hybrid exploration strategy[期刊论文],Neurocomputing,2024-03-14
  • 4. DOI Wang, Xiao;Ma, Zhe;Cao, Lu;Ran, Dechao;Ji, Mingjiang;Sun, Kewu;Han, Yuying;Li, Jiake
    A planar tracking strategy based on multiple-interpretable improved PPO algorithm with few-shot technique[期刊论文],SCIENTIFIC REPORTS,2024-02-16
  • 5. DOI Wang, Xiao;Zhu, Haijiang;Li, Zhiqing
    Decoupled Region of Interest Feature Pooling Diffusion Network for UAV Image Object Detection[会议论文],Chinese Control Conference, CCC,2024-01-01
  • 6. DOI Wang, Xiao;Yang, Zhaohui;Bai, Xueqian;Ji, Mingjiang;Li, Hao;Ran, Dechao
    A Consistent Round-Up Strategy Based on PPO Path Optimization for the Leader–Follower Tracking Problem[期刊论文],Sensors,2023-11-01
  • 7. DOI Wang, Xiao;Ma, Zhe;Mao, Lei;Sun, Kewu;Huang, Xuhui;Fan, Changchao;Li, Jiake
    Accelerating Fuzzy Actor-Critic Learning via Suboptimal Knowledge for a Multi-Agent Tracking Problem[期刊论文],ELECTRONICS,2023-04-01
[+][-] 2028年
  • 1. DOI Wang, Xiao;Li, Jiake;Cao, Lu;Ran, Dechao;Ji, Mingjiang;Sun, Kewu;Han, Yuying;Ma, Zhe
    A data-knowledge joint-driven reinforcement learning algorithm based on guided policy and state-prediction for satellite continuous-thrust tracking[期刊论文],ADVANCES IN SPACE RESEARCH,2024-10-15
  • 2. DOI Wang, Xiao;Yang, Zhuo;Han, Yuying;Li, Hao;Shi, Peng
    Method of sequential intention inference for a space target based on meta-fuzzy decision tree[期刊论文],ADVANCES IN SPACE RESEARCH,2024-10-15
  • 3. DOI Wang, Xiao;Li, Dazi
    Bioinspired actor-critic algorithm for reinforcement learning interpretation with Levy-Brown hybrid exploration strategy[期刊论文],Neurocomputing,2024-03-14
  • 4. DOI Wang, Xiao;Ma, Zhe;Cao, Lu;Ran, Dechao;Ji, Mingjiang;Sun, Kewu;Han, Yuying;Li, Jiake
    A planar tracking strategy based on multiple-interpretable improved PPO algorithm with few-shot technique[期刊论文],SCIENTIFIC REPORTS,2024-02-16
  • 5. DOI Wang, Xiao;Zhu, Haijiang;Li, Zhiqing
    Decoupled Region of Interest Feature Pooling Diffusion Network for UAV Image Object Detection[会议论文],Chinese Control Conference, CCC,2024-01-01
  • 6. DOI Wang, Xiao;Yang, Zhaohui;Bai, Xueqian;Ji, Mingjiang;Li, Hao;Ran, Dechao
    A Consistent Round-Up Strategy Based on PPO Path Optimization for the Leader–Follower Tracking Problem[期刊论文],Sensors,2023-11-01
  • 7. DOI Wang, Xiao;Ma, Zhe;Mao, Lei;Sun, Kewu;Huang, Xuhui;Fan, Changchao;Li, Jiake
    Accelerating Fuzzy Actor-Critic Learning via Suboptimal Knowledge for a Multi-Agent Tracking Problem[期刊论文],ELECTRONICS,2023-04-01
[+][-] 2027年
  • 1. DOI Wang, Xiao;Li, Jiake;Cao, Lu;Ran, Dechao;Ji, Mingjiang;Sun, Kewu;Han, Yuying;Ma, Zhe
    A data-knowledge joint-driven reinforcement learning algorithm based on guided policy and state-prediction for satellite continuous-thrust tracking[期刊论文],ADVANCES IN SPACE RESEARCH,2024-10-15
  • 2. DOI Wang, Xiao;Yang, Zhuo;Han, Yuying;Li, Hao;Shi, Peng
    Method of sequential intention inference for a space target based on meta-fuzzy decision tree[期刊论文],ADVANCES IN SPACE RESEARCH,2024-10-15
  • 3. DOI Wang, Xiao;Li, Dazi
    Bioinspired actor-critic algorithm for reinforcement learning interpretation with Levy-Brown hybrid exploration strategy[期刊论文],Neurocomputing,2024-03-14
  • 4. DOI Wang, Xiao;Ma, Zhe;Cao, Lu;Ran, Dechao;Ji, Mingjiang;Sun, Kewu;Han, Yuying;Li, Jiake
    A planar tracking strategy based on multiple-interpretable improved PPO algorithm with few-shot technique[期刊论文],SCIENTIFIC REPORTS,2024-02-16
  • 5. DOI Wang, Xiao;Zhu, Haijiang;Li, Zhiqing
    Decoupled Region of Interest Feature Pooling Diffusion Network for UAV Image Object Detection[会议论文],Chinese Control Conference, CCC,2024-01-01
  • 6. DOI Wang, Xiao;Yang, Zhaohui;Bai, Xueqian;Ji, Mingjiang;Li, Hao;Ran, Dechao
    A Consistent Round-Up Strategy Based on PPO Path Optimization for the Leader–Follower Tracking Problem[期刊论文],Sensors,2023-11-01
  • 7. DOI Wang, Xiao;Ma, Zhe;Mao, Lei;Sun, Kewu;Huang, Xuhui;Fan, Changchao;Li, Jiake
    Accelerating Fuzzy Actor-Critic Learning via Suboptimal Knowledge for a Multi-Agent Tracking Problem[期刊论文],ELECTRONICS,2023-04-01
[+][-] 2026年
  • 1. DOI Wang, Xiao;Li, Jiake;Cao, Lu;Ran, Dechao;Ji, Mingjiang;Sun, Kewu;Han, Yuying;Ma, Zhe
    A data-knowledge joint-driven reinforcement learning algorithm based on guided policy and state-prediction for satellite continuous-thrust tracking[期刊论文],ADVANCES IN SPACE RESEARCH,2024-10-15
  • 2. DOI Wang, Xiao;Yang, Zhuo;Han, Yuying;Li, Hao;Shi, Peng
    Method of sequential intention inference for a space target based on meta-fuzzy decision tree[期刊论文],ADVANCES IN SPACE RESEARCH,2024-10-15
  • 3. DOI Wang, Xiao;Li, Dazi
    Bioinspired actor-critic algorithm for reinforcement learning interpretation with Levy-Brown hybrid exploration strategy[期刊论文],Neurocomputing,2024-03-14
  • 4. DOI Wang, Xiao;Ma, Zhe;Cao, Lu;Ran, Dechao;Ji, Mingjiang;Sun, Kewu;Han, Yuying;Li, Jiake
    A planar tracking strategy based on multiple-interpretable improved PPO algorithm with few-shot technique[期刊论文],SCIENTIFIC REPORTS,2024-02-16
  • 5. DOI Wang, Xiao;Zhu, Haijiang;Li, Zhiqing
    Decoupled Region of Interest Feature Pooling Diffusion Network for UAV Image Object Detection[会议论文],Chinese Control Conference, CCC,2024-01-01
  • 6. DOI Wang, Xiao;Yang, Zhaohui;Bai, Xueqian;Ji, Mingjiang;Li, Hao;Ran, Dechao
    A Consistent Round-Up Strategy Based on PPO Path Optimization for the Leader–Follower Tracking Problem[期刊论文],Sensors,2023-11-01
  • 7. DOI Wang, Xiao;Ma, Zhe;Mao, Lei;Sun, Kewu;Huang, Xuhui;Fan, Changchao;Li, Jiake
    Accelerating Fuzzy Actor-Critic Learning via Suboptimal Knowledge for a Multi-Agent Tracking Problem[期刊论文],ELECTRONICS,2023-04-01
[+][-] 2025年
  • 1. DOI Wang, Xiao;Li, Jiake;Cao, Lu;Ran, Dechao;Ji, Mingjiang;Sun, Kewu;Han, Yuying;Ma, Zhe
    A data-knowledge joint-driven reinforcement learning algorithm based on guided policy and state-prediction for satellite continuous-thrust tracking[期刊论文],ADVANCES IN SPACE RESEARCH,2024-10-15
  • 2. DOI Wang, Xiao;Yang, Zhuo;Han, Yuying;Li, Hao;Shi, Peng
    Method of sequential intention inference for a space target based on meta-fuzzy decision tree[期刊论文],ADVANCES IN SPACE RESEARCH,2024-10-15
  • 3. DOI Wang, Xiao;Li, Dazi
    Bioinspired actor-critic algorithm for reinforcement learning interpretation with Levy-Brown hybrid exploration strategy[期刊论文],Neurocomputing,2024-03-14
  • 4. DOI Wang, Xiao;Ma, Zhe;Cao, Lu;Ran, Dechao;Ji, Mingjiang;Sun, Kewu;Han, Yuying;Li, Jiake
    A planar tracking strategy based on multiple-interpretable improved PPO algorithm with few-shot technique[期刊论文],SCIENTIFIC REPORTS,2024-02-16
  • 5. DOI Wang, Xiao;Zhu, Haijiang;Li, Zhiqing
    Decoupled Region of Interest Feature Pooling Diffusion Network for UAV Image Object Detection[会议论文],Chinese Control Conference, CCC,2024-01-01
  • 6. DOI Wang, Xiao;Yang, Zhaohui;Bai, Xueqian;Ji, Mingjiang;Li, Hao;Ran, Dechao
    A Consistent Round-Up Strategy Based on PPO Path Optimization for the Leader–Follower Tracking Problem[期刊论文],Sensors,2023-11-01
  • 7. DOI Wang, Xiao;Ma, Zhe;Mao, Lei;Sun, Kewu;Huang, Xuhui;Fan, Changchao;Li, Jiake
    Accelerating Fuzzy Actor-Critic Learning via Suboptimal Knowledge for a Multi-Agent Tracking Problem[期刊论文],ELECTRONICS,2023-04-01
[+][-] 2024年
  • 1. DOI Wang, Xiao;Li, Jiake;Cao, Lu;Ran, Dechao;Ji, Mingjiang;Sun, Kewu;Han, Yuying;Ma, Zhe
    A data-knowledge joint-driven reinforcement learning algorithm based on guided policy and state-prediction for satellite continuous-thrust tracking[期刊论文],ADVANCES IN SPACE RESEARCH,2024-10-15
  • 2. DOI Wang, Xiao;Yang, Zhuo;Han, Yuying;Li, Hao;Shi, Peng
    Method of sequential intention inference for a space target based on meta-fuzzy decision tree[期刊论文],ADVANCES IN SPACE RESEARCH,2024-10-15
  • 3. DOI Wang, Xiao;Li, Dazi
    Bioinspired actor-critic algorithm for reinforcement learning interpretation with Levy-Brown hybrid exploration strategy[期刊论文],Neurocomputing,2024-03-14
  • 4. DOI Wang, Xiao;Ma, Zhe;Cao, Lu;Ran, Dechao;Ji, Mingjiang;Sun, Kewu;Han, Yuying;Li, Jiake
    A planar tracking strategy based on multiple-interpretable improved PPO algorithm with few-shot technique[期刊论文],SCIENTIFIC REPORTS,2024-02-16
  • 5. DOI Wang, Xiao;Zhu, Haijiang;Li, Zhiqing
    Decoupled Region of Interest Feature Pooling Diffusion Network for UAV Image Object Detection[会议论文],Chinese Control Conference, CCC,2024-01-01
  • 6. DOI Wang, Xiao;Yang, Zhaohui;Bai, Xueqian;Ji, Mingjiang;Li, Hao;Ran, Dechao
    A Consistent Round-Up Strategy Based on PPO Path Optimization for the Leader–Follower Tracking Problem[期刊论文],Sensors,2023-11-01
  • 7. DOI Wang, Xiao;Ma, Zhe;Mao, Lei;Sun, Kewu;Huang, Xuhui;Fan, Changchao;Li, Jiake
    Accelerating Fuzzy Actor-Critic Learning via Suboptimal Knowledge for a Multi-Agent Tracking Problem[期刊论文],ELECTRONICS,2023-04-01
[+][-] 2023年
  • 1. DOI Wang, Xiao;Li, Jiake;Cao, Lu;Ran, Dechao;Ji, Mingjiang;Sun, Kewu;Han, Yuying;Ma, Zhe
    A data-knowledge joint-driven reinforcement learning algorithm based on guided policy and state-prediction for satellite continuous-thrust tracking[期刊论文],ADVANCES IN SPACE RESEARCH,2024-10-15
  • 2. DOI Wang, Xiao;Yang, Zhuo;Han, Yuying;Li, Hao;Shi, Peng
    Method of sequential intention inference for a space target based on meta-fuzzy decision tree[期刊论文],ADVANCES IN SPACE RESEARCH,2024-10-15
  • 3. DOI Wang, Xiao;Li, Dazi
    Bioinspired actor-critic algorithm for reinforcement learning interpretation with Levy-Brown hybrid exploration strategy[期刊论文],Neurocomputing,2024-03-14
  • 4. DOI Wang, Xiao;Ma, Zhe;Cao, Lu;Ran, Dechao;Ji, Mingjiang;Sun, Kewu;Han, Yuying;Li, Jiake
    A planar tracking strategy based on multiple-interpretable improved PPO algorithm with few-shot technique[期刊论文],SCIENTIFIC REPORTS,2024-02-16
  • 5. DOI Wang, Xiao;Zhu, Haijiang;Li, Zhiqing
    Decoupled Region of Interest Feature Pooling Diffusion Network for UAV Image Object Detection[会议论文],Chinese Control Conference, CCC,2024-01-01
  • 6. DOI Wang, Xiao;Yang, Zhaohui;Bai, Xueqian;Ji, Mingjiang;Li, Hao;Ran, Dechao
    A Consistent Round-Up Strategy Based on PPO Path Optimization for the Leader–Follower Tracking Problem[期刊论文],Sensors,2023-11-01
  • 7. DOI Wang, Xiao;Ma, Zhe;Mao, Lei;Sun, Kewu;Huang, Xuhui;Fan, Changchao;Li, Jiake
    Accelerating Fuzzy Actor-Critic Learning via Suboptimal Knowledge for a Multi-Agent Tracking Problem[期刊论文],ELECTRONICS,2023-04-01
[+][-]2023年之前
  • 1. DOI Wang, Xiao;Li, Jiake;Cao, Lu;Ran, Dechao;Ji, Mingjiang;Sun, Kewu;Han, Yuying;Ma, Zhe
    A data-knowledge joint-driven reinforcement learning algorithm based on guided policy and state-prediction for satellite continuous-thrust tracking[期刊论文],ADVANCES IN SPACE RESEARCH,2024-10-15
  • 2. DOI Wang, Xiao;Yang, Zhuo;Han, Yuying;Li, Hao;Shi, Peng
    Method of sequential intention inference for a space target based on meta-fuzzy decision tree[期刊论文],ADVANCES IN SPACE RESEARCH,2024-10-15
  • 3. DOI Wang, Xiao;Li, Dazi
    Bioinspired actor-critic algorithm for reinforcement learning interpretation with Levy-Brown hybrid exploration strategy[期刊论文],Neurocomputing,2024-03-14
  • 4. DOI Wang, Xiao;Ma, Zhe;Cao, Lu;Ran, Dechao;Ji, Mingjiang;Sun, Kewu;Han, Yuying;Li, Jiake
    A planar tracking strategy based on multiple-interpretable improved PPO algorithm with few-shot technique[期刊论文],SCIENTIFIC REPORTS,2024-02-16
  • 5. DOI Wang, Xiao;Zhu, Haijiang;Li, Zhiqing
    Decoupled Region of Interest Feature Pooling Diffusion Network for UAV Image Object Detection[会议论文],Chinese Control Conference, CCC,2024-01-01
  • 6. DOI Wang, Xiao;Yang, Zhaohui;Bai, Xueqian;Ji, Mingjiang;Li, Hao;Ran, Dechao
    A Consistent Round-Up Strategy Based on PPO Path Optimization for the Leader–Follower Tracking Problem[期刊论文],Sensors,2023-11-01
  • 7. DOI Wang, Xiao;Ma, Zhe;Mao, Lei;Sun, Kewu;Huang, Xuhui;Fan, Changchao;Li, Jiake
    Accelerating Fuzzy Actor-Critic Learning via Suboptimal Knowledge for a Multi-Agent Tracking Problem[期刊论文],ELECTRONICS,2023-04-01

软件著作

专利

荣誉及奖励

招生信息

招收自动化、测控等相关专业的硕士研究生,欢迎对航天器自主控制、集群智能、任务规划与决策等方向感兴趣的同学加入,期待你与我们共同探索空间科技,我们的目标是星辰大海