王逍头像

王逍

讲师 教师 博士研究生毕业

部门: 信息科学与技术学院 研究方向: 强化学习、自动控制

电子邮箱: 2021500089@mail.buct.edu.cn 办公地址: 科技大厦

ORCID: 0000-0003-2305-2666 DBLP:

10 访问

个人简介

博士毕业于北京航空航天大学飞行器设计专业,主要研究航天器动力学与控制、多智能体博弈对抗等方向。围绕非合作目标近距离自主接近与跟踪控制问题,已发表相关学术论文18篇,其中以第一作者在IEEE Transactions on Aerospace and Electronic SystemsNeurocomputingAdvances in Space Research等期刊发表SCI学术论文10篇、EI及核心论文6篇;申请发明专利8项,其中已授权5项;主持专项纵向“XXXX自主规划软件”、中央高校基本科研业务费项目基于多智能体强化学习的航天器动态博弈对抗技术研究”、以及中国科学院空间应用工程与技术中心地月空间自主任务规划模块开发、军事科学院国防科技创新研究院星群算力聚合与任务规划软件开发、北京航空航天大学 “空间目标时空因果推理决策方法研究”、“敏捷机动卫星控制系统试验研究”等多项课题与项目。

教育经历

入学时间 毕业时间 学位授予单位 学历
2017-09-01 2021-06-30 北京航空航天大学 博士研究生毕业
2015-09-01 2017-06-30 北京航空航天大学 硕士研究生毕业
2011-09-01 2015-06-30 北京理工大学 大学本科毕业

工作经历

社会职务

社会活动

研究领域

针对空间非合作目标博弈对抗场景,开展有关微分博弈策略求解、集群协同飞行、自适应跟踪控制等相关研究;面向空间不确定环境,开展基于多智能体强化学习算法的卫星集群博弈方法、自主飞行任务规划方法、动态环境下的智能决策方法等相关研究

本科生课程

课程名称 开课学年 课程总学时 选课人数 课程性质
自动控制原理 2023 48 32 专业必修
学科前沿讲座 2022 16 27 专业选修
自动控制原理 2022 48 30 专业必修
自动控制原理 2021 56 57 专业必修

研究生课程

课程名称 开课学年 总学时 开课方式
智能控制理论及应用 2024 32 D专业选修课

校级项目

  • 1. 基于多智能体强化学习的航天器动态博弈对抗技术研究 ,信息科学与技术学院,项目时间:2024-03-01 至 2024-12-01

纵向项目

横向项目

  • 1. 敏捷机动卫星控制系统试验研究 ,国家科技部,项目时间:2021-04-01 至 2022-02-28

论文信息

[+][-]代表性论文
  • 1. DOI Wang, Xiao;Li, Jiake;Cao, Lu;Ran, Dechao;Ji, Mingjiang;Sun, Kewu;Han, Yuying;Ma, Zhe
    A data-knowledge joint-driven reinforcement learning algorithm based on guided policy and state-prediction for satellite continuous-thrust tracking[期刊论文],ADVANCES IN SPACE RESEARCH,2024-10-15
  • 2. DOI Wang, Xiao;Yang, Zhuo;Han, Yuying;Li, Hao;Shi, Peng
    Method of sequential intention inference for a space target based on meta-fuzzy decision tree[期刊论文],ADVANCES IN SPACE RESEARCH,2024-10-15
  • 3. DOI Wang, Xiao;Li, Dazi
    Bioinspired actor-critic algorithm for reinforcement learning interpretation with Levy-Brown hybrid exploration strategy[期刊论文],Neurocomputing,2024-03-14
  • 4. DOI Wang, Xiao;Ma, Zhe;Cao, Lu;Ran, Dechao;Ji, Mingjiang;Sun, Kewu;Han, Yuying;Li, Jiake
    A planar tracking strategy based on multiple-interpretable improved PPO algorithm with few-shot technique[期刊论文],SCIENTIFIC REPORTS,2024-02-16
  • 5. DOI Wang, Xiao;Zhu, Haijiang;Li, Zhiqing
    Decoupled Region of Interest Feature Pooling Diffusion Network for UAV Image Object Detection[会议论文],Chinese Control Conference, CCC,2024-01-01
  • 6. DOI Wang, Xiao;Yang, Zhaohui;Bai, Xueqian;Ji, Mingjiang;Li, Hao;Ran, Dechao
    A Consistent Round-Up Strategy Based on PPO Path Optimization for the Leader–Follower Tracking Problem[期刊论文],Sensors,2023-11-01
  • 7. DOI Wang, Xiao;Ma, Zhe;Mao, Lei;Sun, Kewu;Huang, Xuhui;Fan, Changchao;Li, Jiake
    Accelerating Fuzzy Actor-Critic Learning via Suboptimal Knowledge for a Multi-Agent Tracking Problem[期刊论文],ELECTRONICS,2023-04-01
[+][-] 2030年
  • 1. DOI Wang, Xiao;Li, Jiake;Cao, Lu;Ran, Dechao;Ji, Mingjiang;Sun, Kewu;Han, Yuying;Ma, Zhe
    A data-knowledge joint-driven reinforcement learning algorithm based on guided policy and state-prediction for satellite continuous-thrust tracking[期刊论文],ADVANCES IN SPACE RESEARCH,2024-10-15
  • 2. DOI Wang, Xiao;Yang, Zhuo;Han, Yuying;Li, Hao;Shi, Peng
    Method of sequential intention inference for a space target based on meta-fuzzy decision tree[期刊论文],ADVANCES IN SPACE RESEARCH,2024-10-15
  • 3. DOI Wang, Xiao;Li, Dazi
    Bioinspired actor-critic algorithm for reinforcement learning interpretation with Levy-Brown hybrid exploration strategy[期刊论文],Neurocomputing,2024-03-14
  • 4. DOI Wang, Xiao;Ma, Zhe;Cao, Lu;Ran, Dechao;Ji, Mingjiang;Sun, Kewu;Han, Yuying;Li, Jiake
    A planar tracking strategy based on multiple-interpretable improved PPO algorithm with few-shot technique[期刊论文],SCIENTIFIC REPORTS,2024-02-16
  • 5. DOI Wang, Xiao;Zhu, Haijiang;Li, Zhiqing
    Decoupled Region of Interest Feature Pooling Diffusion Network for UAV Image Object Detection[会议论文],Chinese Control Conference, CCC,2024-01-01
  • 6. DOI Wang, Xiao;Yang, Zhaohui;Bai, Xueqian;Ji, Mingjiang;Li, Hao;Ran, Dechao
    A Consistent Round-Up Strategy Based on PPO Path Optimization for the Leader–Follower Tracking Problem[期刊论文],Sensors,2023-11-01
  • 7. DOI Wang, Xiao;Ma, Zhe;Mao, Lei;Sun, Kewu;Huang, Xuhui;Fan, Changchao;Li, Jiake
    Accelerating Fuzzy Actor-Critic Learning via Suboptimal Knowledge for a Multi-Agent Tracking Problem[期刊论文],ELECTRONICS,2023-04-01
[+][-] 2029年
  • 1. DOI Wang, Xiao;Li, Jiake;Cao, Lu;Ran, Dechao;Ji, Mingjiang;Sun, Kewu;Han, Yuying;Ma, Zhe
    A data-knowledge joint-driven reinforcement learning algorithm based on guided policy and state-prediction for satellite continuous-thrust tracking[期刊论文],ADVANCES IN SPACE RESEARCH,2024-10-15
  • 2. DOI Wang, Xiao;Yang, Zhuo;Han, Yuying;Li, Hao;Shi, Peng
    Method of sequential intention inference for a space target based on meta-fuzzy decision tree[期刊论文],ADVANCES IN SPACE RESEARCH,2024-10-15
  • 3. DOI Wang, Xiao;Li, Dazi
    Bioinspired actor-critic algorithm for reinforcement learning interpretation with Levy-Brown hybrid exploration strategy[期刊论文],Neurocomputing,2024-03-14
  • 4. DOI Wang, Xiao;Ma, Zhe;Cao, Lu;Ran, Dechao;Ji, Mingjiang;Sun, Kewu;Han, Yuying;Li, Jiake
    A planar tracking strategy based on multiple-interpretable improved PPO algorithm with few-shot technique[期刊论文],SCIENTIFIC REPORTS,2024-02-16
  • 5. DOI Wang, Xiao;Zhu, Haijiang;Li, Zhiqing
    Decoupled Region of Interest Feature Pooling Diffusion Network for UAV Image Object Detection[会议论文],Chinese Control Conference, CCC,2024-01-01
  • 6. DOI Wang, Xiao;Yang, Zhaohui;Bai, Xueqian;Ji, Mingjiang;Li, Hao;Ran, Dechao
    A Consistent Round-Up Strategy Based on PPO Path Optimization for the Leader–Follower Tracking Problem[期刊论文],Sensors,2023-11-01
  • 7. DOI Wang, Xiao;Ma, Zhe;Mao, Lei;Sun, Kewu;Huang, Xuhui;Fan, Changchao;Li, Jiake
    Accelerating Fuzzy Actor-Critic Learning via Suboptimal Knowledge for a Multi-Agent Tracking Problem[期刊论文],ELECTRONICS,2023-04-01
[+][-] 2028年
  • 1. DOI Wang, Xiao;Li, Jiake;Cao, Lu;Ran, Dechao;Ji, Mingjiang;Sun, Kewu;Han, Yuying;Ma, Zhe
    A data-knowledge joint-driven reinforcement learning algorithm based on guided policy and state-prediction for satellite continuous-thrust tracking[期刊论文],ADVANCES IN SPACE RESEARCH,2024-10-15
  • 2. DOI Wang, Xiao;Yang, Zhuo;Han, Yuying;Li, Hao;Shi, Peng
    Method of sequential intention inference for a space target based on meta-fuzzy decision tree[期刊论文],ADVANCES IN SPACE RESEARCH,2024-10-15
  • 3. DOI Wang, Xiao;Li, Dazi
    Bioinspired actor-critic algorithm for reinforcement learning interpretation with Levy-Brown hybrid exploration strategy[期刊论文],Neurocomputing,2024-03-14
  • 4. DOI Wang, Xiao;Ma, Zhe;Cao, Lu;Ran, Dechao;Ji, Mingjiang;Sun, Kewu;Han, Yuying;Li, Jiake
    A planar tracking strategy based on multiple-interpretable improved PPO algorithm with few-shot technique[期刊论文],SCIENTIFIC REPORTS,2024-02-16
  • 5. DOI Wang, Xiao;Zhu, Haijiang;Li, Zhiqing
    Decoupled Region of Interest Feature Pooling Diffusion Network for UAV Image Object Detection[会议论文],Chinese Control Conference, CCC,2024-01-01
  • 6. DOI Wang, Xiao;Yang, Zhaohui;Bai, Xueqian;Ji, Mingjiang;Li, Hao;Ran, Dechao
    A Consistent Round-Up Strategy Based on PPO Path Optimization for the Leader–Follower Tracking Problem[期刊论文],Sensors,2023-11-01
  • 7. DOI Wang, Xiao;Ma, Zhe;Mao, Lei;Sun, Kewu;Huang, Xuhui;Fan, Changchao;Li, Jiake
    Accelerating Fuzzy Actor-Critic Learning via Suboptimal Knowledge for a Multi-Agent Tracking Problem[期刊论文],ELECTRONICS,2023-04-01
[+][-] 2027年
  • 1. DOI Wang, Xiao;Li, Jiake;Cao, Lu;Ran, Dechao;Ji, Mingjiang;Sun, Kewu;Han, Yuying;Ma, Zhe
    A data-knowledge joint-driven reinforcement learning algorithm based on guided policy and state-prediction for satellite continuous-thrust tracking[期刊论文],ADVANCES IN SPACE RESEARCH,2024-10-15
  • 2. DOI Wang, Xiao;Yang, Zhuo;Han, Yuying;Li, Hao;Shi, Peng
    Method of sequential intention inference for a space target based on meta-fuzzy decision tree[期刊论文],ADVANCES IN SPACE RESEARCH,2024-10-15
  • 3. DOI Wang, Xiao;Li, Dazi
    Bioinspired actor-critic algorithm for reinforcement learning interpretation with Levy-Brown hybrid exploration strategy[期刊论文],Neurocomputing,2024-03-14
  • 4. DOI Wang, Xiao;Ma, Zhe;Cao, Lu;Ran, Dechao;Ji, Mingjiang;Sun, Kewu;Han, Yuying;Li, Jiake
    A planar tracking strategy based on multiple-interpretable improved PPO algorithm with few-shot technique[期刊论文],SCIENTIFIC REPORTS,2024-02-16
  • 5. DOI Wang, Xiao;Zhu, Haijiang;Li, Zhiqing
    Decoupled Region of Interest Feature Pooling Diffusion Network for UAV Image Object Detection[会议论文],Chinese Control Conference, CCC,2024-01-01
  • 6. DOI Wang, Xiao;Yang, Zhaohui;Bai, Xueqian;Ji, Mingjiang;Li, Hao;Ran, Dechao
    A Consistent Round-Up Strategy Based on PPO Path Optimization for the Leader–Follower Tracking Problem[期刊论文],Sensors,2023-11-01
  • 7. DOI Wang, Xiao;Ma, Zhe;Mao, Lei;Sun, Kewu;Huang, Xuhui;Fan, Changchao;Li, Jiake
    Accelerating Fuzzy Actor-Critic Learning via Suboptimal Knowledge for a Multi-Agent Tracking Problem[期刊论文],ELECTRONICS,2023-04-01
[+][-] 2026年
  • 1. DOI Wang, Xiao;Li, Jiake;Cao, Lu;Ran, Dechao;Ji, Mingjiang;Sun, Kewu;Han, Yuying;Ma, Zhe
    A data-knowledge joint-driven reinforcement learning algorithm based on guided policy and state-prediction for satellite continuous-thrust tracking[期刊论文],ADVANCES IN SPACE RESEARCH,2024-10-15
  • 2. DOI Wang, Xiao;Yang, Zhuo;Han, Yuying;Li, Hao;Shi, Peng
    Method of sequential intention inference for a space target based on meta-fuzzy decision tree[期刊论文],ADVANCES IN SPACE RESEARCH,2024-10-15
  • 3. DOI Wang, Xiao;Li, Dazi
    Bioinspired actor-critic algorithm for reinforcement learning interpretation with Levy-Brown hybrid exploration strategy[期刊论文],Neurocomputing,2024-03-14
  • 4. DOI Wang, Xiao;Ma, Zhe;Cao, Lu;Ran, Dechao;Ji, Mingjiang;Sun, Kewu;Han, Yuying;Li, Jiake
    A planar tracking strategy based on multiple-interpretable improved PPO algorithm with few-shot technique[期刊论文],SCIENTIFIC REPORTS,2024-02-16
  • 5. DOI Wang, Xiao;Zhu, Haijiang;Li, Zhiqing
    Decoupled Region of Interest Feature Pooling Diffusion Network for UAV Image Object Detection[会议论文],Chinese Control Conference, CCC,2024-01-01
  • 6. DOI Wang, Xiao;Yang, Zhaohui;Bai, Xueqian;Ji, Mingjiang;Li, Hao;Ran, Dechao
    A Consistent Round-Up Strategy Based on PPO Path Optimization for the Leader–Follower Tracking Problem[期刊论文],Sensors,2023-11-01
  • 7. DOI Wang, Xiao;Ma, Zhe;Mao, Lei;Sun, Kewu;Huang, Xuhui;Fan, Changchao;Li, Jiake
    Accelerating Fuzzy Actor-Critic Learning via Suboptimal Knowledge for a Multi-Agent Tracking Problem[期刊论文],ELECTRONICS,2023-04-01
[+][-] 2025年
  • 1. DOI Wang, Xiao;Li, Jiake;Cao, Lu;Ran, Dechao;Ji, Mingjiang;Sun, Kewu;Han, Yuying;Ma, Zhe
    A data-knowledge joint-driven reinforcement learning algorithm based on guided policy and state-prediction for satellite continuous-thrust tracking[期刊论文],ADVANCES IN SPACE RESEARCH,2024-10-15
  • 2. DOI Wang, Xiao;Yang, Zhuo;Han, Yuying;Li, Hao;Shi, Peng
    Method of sequential intention inference for a space target based on meta-fuzzy decision tree[期刊论文],ADVANCES IN SPACE RESEARCH,2024-10-15
  • 3. DOI Wang, Xiao;Li, Dazi
    Bioinspired actor-critic algorithm for reinforcement learning interpretation with Levy-Brown hybrid exploration strategy[期刊论文],Neurocomputing,2024-03-14
  • 4. DOI Wang, Xiao;Ma, Zhe;Cao, Lu;Ran, Dechao;Ji, Mingjiang;Sun, Kewu;Han, Yuying;Li, Jiake
    A planar tracking strategy based on multiple-interpretable improved PPO algorithm with few-shot technique[期刊论文],SCIENTIFIC REPORTS,2024-02-16
  • 5. DOI Wang, Xiao;Zhu, Haijiang;Li, Zhiqing
    Decoupled Region of Interest Feature Pooling Diffusion Network for UAV Image Object Detection[会议论文],Chinese Control Conference, CCC,2024-01-01
  • 6. DOI Wang, Xiao;Yang, Zhaohui;Bai, Xueqian;Ji, Mingjiang;Li, Hao;Ran, Dechao
    A Consistent Round-Up Strategy Based on PPO Path Optimization for the Leader–Follower Tracking Problem[期刊论文],Sensors,2023-11-01
  • 7. DOI Wang, Xiao;Ma, Zhe;Mao, Lei;Sun, Kewu;Huang, Xuhui;Fan, Changchao;Li, Jiake
    Accelerating Fuzzy Actor-Critic Learning via Suboptimal Knowledge for a Multi-Agent Tracking Problem[期刊论文],ELECTRONICS,2023-04-01
[+][-] 2024年
  • 1. DOI Wang, Xiao;Li, Jiake;Cao, Lu;Ran, Dechao;Ji, Mingjiang;Sun, Kewu;Han, Yuying;Ma, Zhe
    A data-knowledge joint-driven reinforcement learning algorithm based on guided policy and state-prediction for satellite continuous-thrust tracking[期刊论文],ADVANCES IN SPACE RESEARCH,2024-10-15
  • 2. DOI Wang, Xiao;Yang, Zhuo;Han, Yuying;Li, Hao;Shi, Peng
    Method of sequential intention inference for a space target based on meta-fuzzy decision tree[期刊论文],ADVANCES IN SPACE RESEARCH,2024-10-15
  • 3. DOI Wang, Xiao;Li, Dazi
    Bioinspired actor-critic algorithm for reinforcement learning interpretation with Levy-Brown hybrid exploration strategy[期刊论文],Neurocomputing,2024-03-14
  • 4. DOI Wang, Xiao;Ma, Zhe;Cao, Lu;Ran, Dechao;Ji, Mingjiang;Sun, Kewu;Han, Yuying;Li, Jiake
    A planar tracking strategy based on multiple-interpretable improved PPO algorithm with few-shot technique[期刊论文],SCIENTIFIC REPORTS,2024-02-16
  • 5. DOI Wang, Xiao;Zhu, Haijiang;Li, Zhiqing
    Decoupled Region of Interest Feature Pooling Diffusion Network for UAV Image Object Detection[会议论文],Chinese Control Conference, CCC,2024-01-01
  • 6. DOI Wang, Xiao;Yang, Zhaohui;Bai, Xueqian;Ji, Mingjiang;Li, Hao;Ran, Dechao
    A Consistent Round-Up Strategy Based on PPO Path Optimization for the Leader–Follower Tracking Problem[期刊论文],Sensors,2023-11-01
  • 7. DOI Wang, Xiao;Ma, Zhe;Mao, Lei;Sun, Kewu;Huang, Xuhui;Fan, Changchao;Li, Jiake
    Accelerating Fuzzy Actor-Critic Learning via Suboptimal Knowledge for a Multi-Agent Tracking Problem[期刊论文],ELECTRONICS,2023-04-01
[+][-] 2023年
  • 1. DOI Wang, Xiao;Li, Jiake;Cao, Lu;Ran, Dechao;Ji, Mingjiang;Sun, Kewu;Han, Yuying;Ma, Zhe
    A data-knowledge joint-driven reinforcement learning algorithm based on guided policy and state-prediction for satellite continuous-thrust tracking[期刊论文],ADVANCES IN SPACE RESEARCH,2024-10-15
  • 2. DOI Wang, Xiao;Yang, Zhuo;Han, Yuying;Li, Hao;Shi, Peng
    Method of sequential intention inference for a space target based on meta-fuzzy decision tree[期刊论文],ADVANCES IN SPACE RESEARCH,2024-10-15
  • 3. DOI Wang, Xiao;Li, Dazi
    Bioinspired actor-critic algorithm for reinforcement learning interpretation with Levy-Brown hybrid exploration strategy[期刊论文],Neurocomputing,2024-03-14
  • 4. DOI Wang, Xiao;Ma, Zhe;Cao, Lu;Ran, Dechao;Ji, Mingjiang;Sun, Kewu;Han, Yuying;Li, Jiake
    A planar tracking strategy based on multiple-interpretable improved PPO algorithm with few-shot technique[期刊论文],SCIENTIFIC REPORTS,2024-02-16
  • 5. DOI Wang, Xiao;Zhu, Haijiang;Li, Zhiqing
    Decoupled Region of Interest Feature Pooling Diffusion Network for UAV Image Object Detection[会议论文],Chinese Control Conference, CCC,2024-01-01
  • 6. DOI Wang, Xiao;Yang, Zhaohui;Bai, Xueqian;Ji, Mingjiang;Li, Hao;Ran, Dechao
    A Consistent Round-Up Strategy Based on PPO Path Optimization for the Leader–Follower Tracking Problem[期刊论文],Sensors,2023-11-01
  • 7. DOI Wang, Xiao;Ma, Zhe;Mao, Lei;Sun, Kewu;Huang, Xuhui;Fan, Changchao;Li, Jiake
    Accelerating Fuzzy Actor-Critic Learning via Suboptimal Knowledge for a Multi-Agent Tracking Problem[期刊论文],ELECTRONICS,2023-04-01
[+][-]2023年之前
  • 1. DOI Wang, Xiao;Li, Jiake;Cao, Lu;Ran, Dechao;Ji, Mingjiang;Sun, Kewu;Han, Yuying;Ma, Zhe
    A data-knowledge joint-driven reinforcement learning algorithm based on guided policy and state-prediction for satellite continuous-thrust tracking[期刊论文],ADVANCES IN SPACE RESEARCH,2024-10-15
  • 2. DOI Wang, Xiao;Yang, Zhuo;Han, Yuying;Li, Hao;Shi, Peng
    Method of sequential intention inference for a space target based on meta-fuzzy decision tree[期刊论文],ADVANCES IN SPACE RESEARCH,2024-10-15
  • 3. DOI Wang, Xiao;Li, Dazi
    Bioinspired actor-critic algorithm for reinforcement learning interpretation with Levy-Brown hybrid exploration strategy[期刊论文],Neurocomputing,2024-03-14
  • 4. DOI Wang, Xiao;Ma, Zhe;Cao, Lu;Ran, Dechao;Ji, Mingjiang;Sun, Kewu;Han, Yuying;Li, Jiake
    A planar tracking strategy based on multiple-interpretable improved PPO algorithm with few-shot technique[期刊论文],SCIENTIFIC REPORTS,2024-02-16
  • 5. DOI Wang, Xiao;Zhu, Haijiang;Li, Zhiqing
    Decoupled Region of Interest Feature Pooling Diffusion Network for UAV Image Object Detection[会议论文],Chinese Control Conference, CCC,2024-01-01
  • 6. DOI Wang, Xiao;Yang, Zhaohui;Bai, Xueqian;Ji, Mingjiang;Li, Hao;Ran, Dechao
    A Consistent Round-Up Strategy Based on PPO Path Optimization for the Leader–Follower Tracking Problem[期刊论文],Sensors,2023-11-01
  • 7. DOI Wang, Xiao;Ma, Zhe;Mao, Lei;Sun, Kewu;Huang, Xuhui;Fan, Changchao;Li, Jiake
    Accelerating Fuzzy Actor-Critic Learning via Suboptimal Knowledge for a Multi-Agent Tracking Problem[期刊论文],ELECTRONICS,2023-04-01

软件著作

专利

荣誉及奖励

招生信息