高速铁路系统中无人机辅助通信的资源分配：一种基于多智能体强化学习方法

周兰兰; 余建国

0
0
浏览
下载

摘要
关键词
基金信息
论文图表
同行评议
相关论文
评论

Resource Allocation for UAV-aided Communication in High-Speed Railway Systems: A Multi-Agent Reinforcement Learning Approach

首发时间：2021-03-05

Zhou Lanlan ¹
Zhou Lanlan(1996-), female ,postgraduate student, main research: resource allocation, deep reinforcement learning
Yu Jianguo ¹
Yu Jianguo(1966-), male ,professor and PhD supervisor, main research: optical fiber communication, wireless mobile communication

1、Beijing Key Laboratory of Work Safety Intelligent Monitoring (Beijing University of Posts and Telecommunications), School of Electronic Engineering, Beijing University of Posts and Telecommunications, 100876

Abstract：The past decades have witnessed the rapid developments of high-speed railways (HSRs) communications. To provide seamless communication services between high-speed trains, both mobile edge computing (MEC) servers and UAVs are integrated into HSRs to provide on-demand resource access. However, the sensitive delay requirements of high-speed services pose significant challenges to the resource allocation in HSRs. This paper will formulate the UAV-aided resource allocation in high-speed railways (HSRs) as a distributed optimization problem to optimize the resource utilization while minimizing the path blocking probability. To address this problem, a multi-agent deep deterministic policy gradient (multi-agent DDPG) approach is proposed. The MEC servers are taken as the agents to make resource allocation decisions in the training phase. The simulation shows demonstrate that multi-agent DDPG outperforms the traditional single-agent method. The proposed multi-agent DDPG-based resource allocation algorithm can achieve satisfactory performance.?????

keywords： High-speed railway deep reinforcement learning unmanned aerial vehicle resource allocation multi-agent DDPG.

点击查看论文中文信息

高速铁路系统中无人机辅助通信的资源分配：一种基于多智能体强化学习方法

周兰兰 ¹
Zhou Lanlan(1996-), female ,postgraduate student, main research: resource allocation, deep reinforcement learning
余建国 ¹
Yu Jianguo(1966-), male ,professor and PhD supervisor, main research: optical fiber communication, wireless mobile communication

1、安全生产智能监控北京市重点实验室（北京邮电大学），电子工程学院，北京邮电大学，100876

摘要：在过去的几十年间，高速铁路（High-Speed Railway, HSR）系统得到了飞速发展。为了给高铁列车提供无缝的通信服务，移动边缘计算（Mobile Edge Computing, MEC）和无人机（Unmanned Aerial Vehicle, UAV）被集成到高铁系统中为用户提供按需的资源接入服务。然而，高铁通信对时延的超高需求给高铁系统的资源分配带来了重大挑战。本文将HSR系统中的无人机辅助资源分配问题制定为分布式优化问题，在优化资源利用的同时最小化路径阻塞概率。针对这一问题，本文提出了一种基于多智能体深度确定性策略梯度(Multi-Agent Deep Deterministic Policy Gradient, Multi-Agent DDPG)的资源分配方法。在训练阶段，将MEC服务器和UAVs作为代理进行资源分配决策。仿真结果表明，多智能体DDPG的学习性能优于传统的单智能体方法，并且资源分配的仿真实验中取得了优越的性能。

关键词：高速铁路深度强化学习无人机资源分配多智能体DDPG

基金：

论文图表：

引用

导出参考文献

.txt

.ris

.doc

Zhou Lanlan,Yu Jianguo. Resource Allocation for UAV-aided Communication in High-Speed Railway Systems: A Multi-Agent Reinforcement Learning Approach[EB/OL]. Beijing:Sciencepaper Online[2021-03-05]. https://www.paper.edu.cn/releasepaper/content/202103-75.

No.****

同行评议

共计0人参与

全部评论

0/1000

论文编号	202103-75
论文题目	高速铁路系统中无人机辅助通信的资源分配：一种基于多智能体强化学习方法
文献类型
收录期刊	上传封面中文期刊英文期刊期刊名称（中文）期刊名称（英文）年，卷（）上传封面中文专著英文专著书名（中文）书名（英文）出版地出版社出版年上传封面中文译著英文译著书名（中文）书名（英文）出版地出版社出版年上传封面中文论文集英文论文集编者.论文集名称（中文） [c]. 出版地出版社出版年， - 编者.论文集名称（英文） [c]. 出版地出版社出版年，- 上传封面中文文献英文文献期刊名称（中文）期刊名称（英文）日期-- 在线地址http:// 上传封面中文文献英文文献文题（中文）文题（英文）出版地出版社,出版日期-- 上传封面中文文献英文文献文题（中文）文题（英文）出版地出版社,出版日期--
英文作者写法：中外文作者均姓前名后，姓大写，名的第一个字母大写，姓全称写出，名可只写第一个字母，其后不加实心圆点“.”, 作者之间用逗号“，”分隔，最后为实心圆点“.”, 示例1：原姓名写法：Albert Einstein,编入参考文献时写法：Einstein A. 示例2：原姓名写法：李时珍；编入参考文献时写法：LI S Z. 示例3：YELLAND R L,JONES S C,EASTON K S,et al.