个人简介
博士毕业于北京航空航天大学飞行器设计专业,主要研究航天器动力学与控制、多智能体博弈对抗等方向。围绕非合作目标近距离自主接近与跟踪控制问题,已发表相关学术论文18篇,其中以第一作者在IEEE Transactions on Aerospace and Electronic Systems、Neurocomputing、Advances in Space Research等期刊发表SCI学术论文10篇、EI及核心论文6篇;申请发明专利8项,其中已授权5项;主持专项纵向“XXXX自主规划软件”、中央高校基本科研业务费项目“基于多智能体强化学习的航天器动态博弈对抗技术研究”、以及中国科学院空间应用工程与技术中心“地月空间自主任务规划模块开发”、军事科学院国防科技创新研究院“星群算力聚合与任务规划软件开发”、北京航空航天大学 “空间目标时空因果推理决策方法研究”、“敏捷机动卫星控制系统试验研究”等多项课题与项目。
教育经历
入学时间 |
毕业时间 |
学位授予单位 |
学历 |
2017-09-01 |
2021-06-30 |
北京航空航天大学 |
博士研究生毕业 |
2015-09-01 |
2017-06-30 |
北京航空航天大学 |
硕士研究生毕业 |
2011-09-01 |
2015-06-30 |
北京理工大学 |
大学本科毕业 |
研究领域
针对空间非合作目标博弈对抗场景,开展有关微分博弈策略求解、集群协同飞行、自适应跟踪控制等相关研究;面向空间不确定环境,开展基于多智能体强化学习算法的卫星集群博弈方法、自主飞行任务规划方法、动态环境下的智能决策方法等相关研究
本科生课程
课程名称 |
开课学年 |
课程总学时 |
选课人数 |
课程性质 |
自动控制原理 |
2023 |
48 |
32 |
专业必修 |
学科前沿讲座 |
2022 |
16 |
27 |
专业选修 |
自动控制原理 |
2022 |
48 |
30 |
专业必修 |
自动控制原理 |
2021 |
56 |
57 |
专业必修 |
研究生课程
课程名称 |
开课学年 |
总学时 |
开课方式 |
智能控制理论及应用 |
2024 |
32 |
D专业选修课 |
校级项目
- 1. 基于多智能体强化学习的航天器动态博弈对抗技术研究 ,信息科学与技术学院,项目时间:2024-03-01 至 2024-12-01
横向项目
- 1. 敏捷机动卫星控制系统试验研究 ,国家科技部,项目时间:2021-04-01 至 2022-02-28
论文信息
[+][-]代表性论文
-
1. DOI
Wang, Xiao;Li, Jiake;Cao, Lu;Ran, Dechao;Ji, Mingjiang;Sun, Kewu;Han, Yuying;Ma, Zhe
A data-knowledge joint-driven reinforcement learning algorithm based on guided policy and state-prediction for satellite continuous-thrust tracking[期刊论文],ADVANCES IN SPACE RESEARCH,2024-10-15
-
2. DOI
Wang, Xiao;Yang, Zhuo;Han, Yuying;Li, Hao;Shi, Peng
Method of sequential intention inference for a space target based on meta-fuzzy decision tree[期刊论文],ADVANCES IN SPACE RESEARCH,2024-10-15
-
3. DOI
Wang, Xiao;Li, Dazi
Bioinspired actor-critic algorithm for reinforcement learning interpretation with Levy-Brown hybrid exploration strategy[期刊论文],Neurocomputing,2024-03-14
-
4. DOI
Wang, Xiao;Ma, Zhe;Cao, Lu;Ran, Dechao;Ji, Mingjiang;Sun, Kewu;Han, Yuying;Li, Jiake
A planar tracking strategy based on multiple-interpretable improved PPO algorithm with few-shot technique[期刊论文],SCIENTIFIC REPORTS,2024-02-16
-
5. DOI
Wang, Xiao;Zhu, Haijiang;Li, Zhiqing
Decoupled Region of Interest Feature Pooling Diffusion Network for UAV Image Object Detection[会议论文],Chinese Control Conference, CCC,2024-01-01
-
6. DOI
Wang, Xiao;Yang, Zhaohui;Bai, Xueqian;Ji, Mingjiang;Li, Hao;Ran, Dechao
A Consistent Round-Up Strategy Based on PPO Path Optimization for the Leader–Follower Tracking Problem[期刊论文],Sensors,2023-11-01
-
7. DOI
Wang, Xiao;Ma, Zhe;Mao, Lei;Sun, Kewu;Huang, Xuhui;Fan, Changchao;Li, Jiake
Accelerating Fuzzy Actor-Critic Learning via Suboptimal Knowledge for a Multi-Agent Tracking Problem[期刊论文],ELECTRONICS,2023-04-01
[+][-]
2030年
-
1. DOI
Wang, Xiao;Li, Jiake;Cao, Lu;Ran, Dechao;Ji, Mingjiang;Sun, Kewu;Han, Yuying;Ma, Zhe
A data-knowledge joint-driven reinforcement learning algorithm based on guided policy and state-prediction for satellite continuous-thrust tracking[期刊论文],ADVANCES IN SPACE RESEARCH,2024-10-15
-
2. DOI
Wang, Xiao;Yang, Zhuo;Han, Yuying;Li, Hao;Shi, Peng
Method of sequential intention inference for a space target based on meta-fuzzy decision tree[期刊论文],ADVANCES IN SPACE RESEARCH,2024-10-15
-
3. DOI
Wang, Xiao;Li, Dazi
Bioinspired actor-critic algorithm for reinforcement learning interpretation with Levy-Brown hybrid exploration strategy[期刊论文],Neurocomputing,2024-03-14
-
4. DOI
Wang, Xiao;Ma, Zhe;Cao, Lu;Ran, Dechao;Ji, Mingjiang;Sun, Kewu;Han, Yuying;Li, Jiake
A planar tracking strategy based on multiple-interpretable improved PPO algorithm with few-shot technique[期刊论文],SCIENTIFIC REPORTS,2024-02-16
-
5. DOI
Wang, Xiao;Zhu, Haijiang;Li, Zhiqing
Decoupled Region of Interest Feature Pooling Diffusion Network for UAV Image Object Detection[会议论文],Chinese Control Conference, CCC,2024-01-01
-
6. DOI
Wang, Xiao;Yang, Zhaohui;Bai, Xueqian;Ji, Mingjiang;Li, Hao;Ran, Dechao
A Consistent Round-Up Strategy Based on PPO Path Optimization for the Leader–Follower Tracking Problem[期刊论文],Sensors,2023-11-01
-
7. DOI
Wang, Xiao;Ma, Zhe;Mao, Lei;Sun, Kewu;Huang, Xuhui;Fan, Changchao;Li, Jiake
Accelerating Fuzzy Actor-Critic Learning via Suboptimal Knowledge for a Multi-Agent Tracking Problem[期刊论文],ELECTRONICS,2023-04-01
[+][-]
2029年
-
1. DOI
Wang, Xiao;Li, Jiake;Cao, Lu;Ran, Dechao;Ji, Mingjiang;Sun, Kewu;Han, Yuying;Ma, Zhe
A data-knowledge joint-driven reinforcement learning algorithm based on guided policy and state-prediction for satellite continuous-thrust tracking[期刊论文],ADVANCES IN SPACE RESEARCH,2024-10-15
-
2. DOI
Wang, Xiao;Yang, Zhuo;Han, Yuying;Li, Hao;Shi, Peng
Method of sequential intention inference for a space target based on meta-fuzzy decision tree[期刊论文],ADVANCES IN SPACE RESEARCH,2024-10-15
-
3. DOI
Wang, Xiao;Li, Dazi
Bioinspired actor-critic algorithm for reinforcement learning interpretation with Levy-Brown hybrid exploration strategy[期刊论文],Neurocomputing,2024-03-14
-
4. DOI
Wang, Xiao;Ma, Zhe;Cao, Lu;Ran, Dechao;Ji, Mingjiang;Sun, Kewu;Han, Yuying;Li, Jiake
A planar tracking strategy based on multiple-interpretable improved PPO algorithm with few-shot technique[期刊论文],SCIENTIFIC REPORTS,2024-02-16
-
5. DOI
Wang, Xiao;Zhu, Haijiang;Li, Zhiqing
Decoupled Region of Interest Feature Pooling Diffusion Network for UAV Image Object Detection[会议论文],Chinese Control Conference, CCC,2024-01-01
-
6. DOI
Wang, Xiao;Yang, Zhaohui;Bai, Xueqian;Ji, Mingjiang;Li, Hao;Ran, Dechao
A Consistent Round-Up Strategy Based on PPO Path Optimization for the Leader–Follower Tracking Problem[期刊论文],Sensors,2023-11-01
-
7. DOI
Wang, Xiao;Ma, Zhe;Mao, Lei;Sun, Kewu;Huang, Xuhui;Fan, Changchao;Li, Jiake
Accelerating Fuzzy Actor-Critic Learning via Suboptimal Knowledge for a Multi-Agent Tracking Problem[期刊论文],ELECTRONICS,2023-04-01
[+][-]
2028年
-
1. DOI
Wang, Xiao;Li, Jiake;Cao, Lu;Ran, Dechao;Ji, Mingjiang;Sun, Kewu;Han, Yuying;Ma, Zhe
A data-knowledge joint-driven reinforcement learning algorithm based on guided policy and state-prediction for satellite continuous-thrust tracking[期刊论文],ADVANCES IN SPACE RESEARCH,2024-10-15
-
2. DOI
Wang, Xiao;Yang, Zhuo;Han, Yuying;Li, Hao;Shi, Peng
Method of sequential intention inference for a space target based on meta-fuzzy decision tree[期刊论文],ADVANCES IN SPACE RESEARCH,2024-10-15
-
3. DOI
Wang, Xiao;Li, Dazi
Bioinspired actor-critic algorithm for reinforcement learning interpretation with Levy-Brown hybrid exploration strategy[期刊论文],Neurocomputing,2024-03-14
-
4. DOI
Wang, Xiao;Ma, Zhe;Cao, Lu;Ran, Dechao;Ji, Mingjiang;Sun, Kewu;Han, Yuying;Li, Jiake
A planar tracking strategy based on multiple-interpretable improved PPO algorithm with few-shot technique[期刊论文],SCIENTIFIC REPORTS,2024-02-16
-
5. DOI
Wang, Xiao;Zhu, Haijiang;Li, Zhiqing
Decoupled Region of Interest Feature Pooling Diffusion Network for UAV Image Object Detection[会议论文],Chinese Control Conference, CCC,2024-01-01
-
6. DOI
Wang, Xiao;Yang, Zhaohui;Bai, Xueqian;Ji, Mingjiang;Li, Hao;Ran, Dechao
A Consistent Round-Up Strategy Based on PPO Path Optimization for the Leader–Follower Tracking Problem[期刊论文],Sensors,2023-11-01
-
7. DOI
Wang, Xiao;Ma, Zhe;Mao, Lei;Sun, Kewu;Huang, Xuhui;Fan, Changchao;Li, Jiake
Accelerating Fuzzy Actor-Critic Learning via Suboptimal Knowledge for a Multi-Agent Tracking Problem[期刊论文],ELECTRONICS,2023-04-01
[+][-]
2027年
-
1. DOI
Wang, Xiao;Li, Jiake;Cao, Lu;Ran, Dechao;Ji, Mingjiang;Sun, Kewu;Han, Yuying;Ma, Zhe
A data-knowledge joint-driven reinforcement learning algorithm based on guided policy and state-prediction for satellite continuous-thrust tracking[期刊论文],ADVANCES IN SPACE RESEARCH,2024-10-15
-
2. DOI
Wang, Xiao;Yang, Zhuo;Han, Yuying;Li, Hao;Shi, Peng
Method of sequential intention inference for a space target based on meta-fuzzy decision tree[期刊论文],ADVANCES IN SPACE RESEARCH,2024-10-15
-
3. DOI
Wang, Xiao;Li, Dazi
Bioinspired actor-critic algorithm for reinforcement learning interpretation with Levy-Brown hybrid exploration strategy[期刊论文],Neurocomputing,2024-03-14
-
4. DOI
Wang, Xiao;Ma, Zhe;Cao, Lu;Ran, Dechao;Ji, Mingjiang;Sun, Kewu;Han, Yuying;Li, Jiake
A planar tracking strategy based on multiple-interpretable improved PPO algorithm with few-shot technique[期刊论文],SCIENTIFIC REPORTS,2024-02-16
-
5. DOI
Wang, Xiao;Zhu, Haijiang;Li, Zhiqing
Decoupled Region of Interest Feature Pooling Diffusion Network for UAV Image Object Detection[会议论文],Chinese Control Conference, CCC,2024-01-01
-
6. DOI
Wang, Xiao;Yang, Zhaohui;Bai, Xueqian;Ji, Mingjiang;Li, Hao;Ran, Dechao
A Consistent Round-Up Strategy Based on PPO Path Optimization for the Leader–Follower Tracking Problem[期刊论文],Sensors,2023-11-01
-
7. DOI
Wang, Xiao;Ma, Zhe;Mao, Lei;Sun, Kewu;Huang, Xuhui;Fan, Changchao;Li, Jiake
Accelerating Fuzzy Actor-Critic Learning via Suboptimal Knowledge for a Multi-Agent Tracking Problem[期刊论文],ELECTRONICS,2023-04-01
[+][-]
2026年
-
1. DOI
Wang, Xiao;Li, Jiake;Cao, Lu;Ran, Dechao;Ji, Mingjiang;Sun, Kewu;Han, Yuying;Ma, Zhe
A data-knowledge joint-driven reinforcement learning algorithm based on guided policy and state-prediction for satellite continuous-thrust tracking[期刊论文],ADVANCES IN SPACE RESEARCH,2024-10-15
-
2. DOI
Wang, Xiao;Yang, Zhuo;Han, Yuying;Li, Hao;Shi, Peng
Method of sequential intention inference for a space target based on meta-fuzzy decision tree[期刊论文],ADVANCES IN SPACE RESEARCH,2024-10-15
-
3. DOI
Wang, Xiao;Li, Dazi
Bioinspired actor-critic algorithm for reinforcement learning interpretation with Levy-Brown hybrid exploration strategy[期刊论文],Neurocomputing,2024-03-14
-
4. DOI
Wang, Xiao;Ma, Zhe;Cao, Lu;Ran, Dechao;Ji, Mingjiang;Sun, Kewu;Han, Yuying;Li, Jiake
A planar tracking strategy based on multiple-interpretable improved PPO algorithm with few-shot technique[期刊论文],SCIENTIFIC REPORTS,2024-02-16
-
5. DOI
Wang, Xiao;Zhu, Haijiang;Li, Zhiqing
Decoupled Region of Interest Feature Pooling Diffusion Network for UAV Image Object Detection[会议论文],Chinese Control Conference, CCC,2024-01-01
-
6. DOI
Wang, Xiao;Yang, Zhaohui;Bai, Xueqian;Ji, Mingjiang;Li, Hao;Ran, Dechao
A Consistent Round-Up Strategy Based on PPO Path Optimization for the Leader–Follower Tracking Problem[期刊论文],Sensors,2023-11-01
-
7. DOI
Wang, Xiao;Ma, Zhe;Mao, Lei;Sun, Kewu;Huang, Xuhui;Fan, Changchao;Li, Jiake
Accelerating Fuzzy Actor-Critic Learning via Suboptimal Knowledge for a Multi-Agent Tracking Problem[期刊论文],ELECTRONICS,2023-04-01
[+][-]
2025年
-
1. DOI
Wang, Xiao;Li, Jiake;Cao, Lu;Ran, Dechao;Ji, Mingjiang;Sun, Kewu;Han, Yuying;Ma, Zhe
A data-knowledge joint-driven reinforcement learning algorithm based on guided policy and state-prediction for satellite continuous-thrust tracking[期刊论文],ADVANCES IN SPACE RESEARCH,2024-10-15
-
2. DOI
Wang, Xiao;Yang, Zhuo;Han, Yuying;Li, Hao;Shi, Peng
Method of sequential intention inference for a space target based on meta-fuzzy decision tree[期刊论文],ADVANCES IN SPACE RESEARCH,2024-10-15
-
3. DOI
Wang, Xiao;Li, Dazi
Bioinspired actor-critic algorithm for reinforcement learning interpretation with Levy-Brown hybrid exploration strategy[期刊论文],Neurocomputing,2024-03-14
-
4. DOI
Wang, Xiao;Ma, Zhe;Cao, Lu;Ran, Dechao;Ji, Mingjiang;Sun, Kewu;Han, Yuying;Li, Jiake
A planar tracking strategy based on multiple-interpretable improved PPO algorithm with few-shot technique[期刊论文],SCIENTIFIC REPORTS,2024-02-16
-
5. DOI
Wang, Xiao;Zhu, Haijiang;Li, Zhiqing
Decoupled Region of Interest Feature Pooling Diffusion Network for UAV Image Object Detection[会议论文],Chinese Control Conference, CCC,2024-01-01
-
6. DOI
Wang, Xiao;Yang, Zhaohui;Bai, Xueqian;Ji, Mingjiang;Li, Hao;Ran, Dechao
A Consistent Round-Up Strategy Based on PPO Path Optimization for the Leader–Follower Tracking Problem[期刊论文],Sensors,2023-11-01
-
7. DOI
Wang, Xiao;Ma, Zhe;Mao, Lei;Sun, Kewu;Huang, Xuhui;Fan, Changchao;Li, Jiake
Accelerating Fuzzy Actor-Critic Learning via Suboptimal Knowledge for a Multi-Agent Tracking Problem[期刊论文],ELECTRONICS,2023-04-01
[+][-]
2024年
-
1. DOI
Wang, Xiao;Li, Jiake;Cao, Lu;Ran, Dechao;Ji, Mingjiang;Sun, Kewu;Han, Yuying;Ma, Zhe
A data-knowledge joint-driven reinforcement learning algorithm based on guided policy and state-prediction for satellite continuous-thrust tracking[期刊论文],ADVANCES IN SPACE RESEARCH,2024-10-15
-
2. DOI
Wang, Xiao;Yang, Zhuo;Han, Yuying;Li, Hao;Shi, Peng
Method of sequential intention inference for a space target based on meta-fuzzy decision tree[期刊论文],ADVANCES IN SPACE RESEARCH,2024-10-15
-
3. DOI
Wang, Xiao;Li, Dazi
Bioinspired actor-critic algorithm for reinforcement learning interpretation with Levy-Brown hybrid exploration strategy[期刊论文],Neurocomputing,2024-03-14
-
4. DOI
Wang, Xiao;Ma, Zhe;Cao, Lu;Ran, Dechao;Ji, Mingjiang;Sun, Kewu;Han, Yuying;Li, Jiake
A planar tracking strategy based on multiple-interpretable improved PPO algorithm with few-shot technique[期刊论文],SCIENTIFIC REPORTS,2024-02-16
-
5. DOI
Wang, Xiao;Zhu, Haijiang;Li, Zhiqing
Decoupled Region of Interest Feature Pooling Diffusion Network for UAV Image Object Detection[会议论文],Chinese Control Conference, CCC,2024-01-01
-
6. DOI
Wang, Xiao;Yang, Zhaohui;Bai, Xueqian;Ji, Mingjiang;Li, Hao;Ran, Dechao
A Consistent Round-Up Strategy Based on PPO Path Optimization for the Leader–Follower Tracking Problem[期刊论文],Sensors,2023-11-01
-
7. DOI
Wang, Xiao;Ma, Zhe;Mao, Lei;Sun, Kewu;Huang, Xuhui;Fan, Changchao;Li, Jiake
Accelerating Fuzzy Actor-Critic Learning via Suboptimal Knowledge for a Multi-Agent Tracking Problem[期刊论文],ELECTRONICS,2023-04-01
[+][-]
2023年
-
1. DOI
Wang, Xiao;Li, Jiake;Cao, Lu;Ran, Dechao;Ji, Mingjiang;Sun, Kewu;Han, Yuying;Ma, Zhe
A data-knowledge joint-driven reinforcement learning algorithm based on guided policy and state-prediction for satellite continuous-thrust tracking[期刊论文],ADVANCES IN SPACE RESEARCH,2024-10-15
-
2. DOI
Wang, Xiao;Yang, Zhuo;Han, Yuying;Li, Hao;Shi, Peng
Method of sequential intention inference for a space target based on meta-fuzzy decision tree[期刊论文],ADVANCES IN SPACE RESEARCH,2024-10-15
-
3. DOI
Wang, Xiao;Li, Dazi
Bioinspired actor-critic algorithm for reinforcement learning interpretation with Levy-Brown hybrid exploration strategy[期刊论文],Neurocomputing,2024-03-14
-
4. DOI
Wang, Xiao;Ma, Zhe;Cao, Lu;Ran, Dechao;Ji, Mingjiang;Sun, Kewu;Han, Yuying;Li, Jiake
A planar tracking strategy based on multiple-interpretable improved PPO algorithm with few-shot technique[期刊论文],SCIENTIFIC REPORTS,2024-02-16
-
5. DOI
Wang, Xiao;Zhu, Haijiang;Li, Zhiqing
Decoupled Region of Interest Feature Pooling Diffusion Network for UAV Image Object Detection[会议论文],Chinese Control Conference, CCC,2024-01-01
-
6. DOI
Wang, Xiao;Yang, Zhaohui;Bai, Xueqian;Ji, Mingjiang;Li, Hao;Ran, Dechao
A Consistent Round-Up Strategy Based on PPO Path Optimization for the Leader–Follower Tracking Problem[期刊论文],Sensors,2023-11-01
-
7. DOI
Wang, Xiao;Ma, Zhe;Mao, Lei;Sun, Kewu;Huang, Xuhui;Fan, Changchao;Li, Jiake
Accelerating Fuzzy Actor-Critic Learning via Suboptimal Knowledge for a Multi-Agent Tracking Problem[期刊论文],ELECTRONICS,2023-04-01
[+][-]2023年之前
-
1. DOI
Wang, Xiao;Li, Jiake;Cao, Lu;Ran, Dechao;Ji, Mingjiang;Sun, Kewu;Han, Yuying;Ma, Zhe
A data-knowledge joint-driven reinforcement learning algorithm based on guided policy and state-prediction for satellite continuous-thrust tracking[期刊论文],ADVANCES IN SPACE RESEARCH,2024-10-15
-
2. DOI
Wang, Xiao;Yang, Zhuo;Han, Yuying;Li, Hao;Shi, Peng
Method of sequential intention inference for a space target based on meta-fuzzy decision tree[期刊论文],ADVANCES IN SPACE RESEARCH,2024-10-15
-
3. DOI
Wang, Xiao;Li, Dazi
Bioinspired actor-critic algorithm for reinforcement learning interpretation with Levy-Brown hybrid exploration strategy[期刊论文],Neurocomputing,2024-03-14
-
4. DOI
Wang, Xiao;Ma, Zhe;Cao, Lu;Ran, Dechao;Ji, Mingjiang;Sun, Kewu;Han, Yuying;Li, Jiake
A planar tracking strategy based on multiple-interpretable improved PPO algorithm with few-shot technique[期刊论文],SCIENTIFIC REPORTS,2024-02-16
-
5. DOI
Wang, Xiao;Zhu, Haijiang;Li, Zhiqing
Decoupled Region of Interest Feature Pooling Diffusion Network for UAV Image Object Detection[会议论文],Chinese Control Conference, CCC,2024-01-01
-
6. DOI
Wang, Xiao;Yang, Zhaohui;Bai, Xueqian;Ji, Mingjiang;Li, Hao;Ran, Dechao
A Consistent Round-Up Strategy Based on PPO Path Optimization for the Leader–Follower Tracking Problem[期刊论文],Sensors,2023-11-01
-
7. DOI
Wang, Xiao;Ma, Zhe;Mao, Lei;Sun, Kewu;Huang, Xuhui;Fan, Changchao;Li, Jiake
Accelerating Fuzzy Actor-Critic Learning via Suboptimal Knowledge for a Multi-Agent Tracking Problem[期刊论文],ELECTRONICS,2023-04-01
|