多智能体深度强化学习下多目标船舶避碰决策

公告通知

下载文档

联系方式

主管单位:: 中国船舶集团有限公司

主办单位:: 中国舰船研究院、中国船舶集团有限公司第七一四研究所

编辑出版:: 《舰船科学技术》编辑部

联系地址:: 北京市朝阳区科荟路55号院

邮编:: 100101

电话:: 陈老师：010-83027277
宋老师：010-83027276
李老师：010-83027269
梁老师：010-83027281

邮箱:: jckxjs@163.com

ISSN:: 1672-7649

CN:: 11-1885/U

友情链接

当前位置：首页 > 过刊浏览->2025年47卷23期

多智能体深度强化学习下多目标船舶避碰决策
Multi objective ship collision avoidance decision-making under multi-agent deep reinforcement learning

DOI:

作者:: 郭洪宇, 窦林涛, 贾贞, 顾浩
GUO Hongyu, DOU Lintao, JIA Zhen, GU Hao

作者单位:: 江苏自动化研究所，江苏连云港 222006
Jiangsu Automation Research Institute, Lianyungang 222006, China

关键词:: 多智能体深度强化学习;碰撞危险度;协同决策;多目标船舶避碰
multi agent deep reinforcement learning; collision risk level; collaborative decision-making; multi target ship collision avoidance

摘要:: 交通密集水域航行船舶较多，交通组成复杂，通航密度较大，多船会遇情况下由于船舶操纵性不同，难以准确预判他船动态轨迹与自身航线的冲突点，导致避碰策略失效。因此，提出多智能体深度强化学习下多目标船舶避碰决策方法。首先，基于人工势场法计算船舶的碰撞危险度，以确定存在碰撞风险的船舶；其次，基于船舶的状态、动作和奖励函数来构建船舶多智能体避碰决策系统，并将船舶的航行目标和安全性目标引入奖励函数设计中；最后，利用深度强化学习算法获得系统中的最优避碰策略。实验结果表明，该方法能够有效评估船舶碰撞风险，且避碰效果好，避碰路径短，具有实际应用价值。
There are many ships sailing in densely populated waters, with complex traffic composition and high navigation density. In situations where multiple ships encounter different maneuverability, it is difficult to accurately predict the conflict points between the dynamic trajectory of other ships and their own route, resulting in the failure of collision avoidance strategies. Therefore, a multi-objective ship collision avoidance decision-making method based on multi-agent deep reinforcement learning is proposed. Firstly, the collision risk of ships is calculated based on the artificial potential field method to determine the ships with collision risks; Secondly, based on the state, actions, and reward functions of the ship, a multi-agent collision avoidance decision-making system for the ship is constructed, and the navigation and safety objectives of the ship are introduced into the design of the reward function; Finally, the optimal collision avoidance strategy in the system is obtained using deep reinforcement learning algorithms. The experimental results show that this method can effectively evaluate the risk of ship collision, and has good collision avoidance effect, short collision avoidance path, and practical application value.

2025,47(23): 71-77 收稿日期：2025-5-8

DOI：10.3404/j.issn.1672-7649.2025.23.011

分类号：U675.96

作者简介：郭洪宇(1996-),男,博士,工程师,研究方向为人工智能决策技术、容器调度

参考文献：
[1] 廖诗管, 翁金贤. 基于贝叶斯时空log-logistic模型的船舶碰撞频率[J]. 中国航海, 2023, 46(1): 24-29+38.
LIAO S G, WENG J X. Ship collision frequency prediction with bayesian spatiotemporal log-logistic model[J]. Navigation of China, 2023, 46(1): 24-29+38.
[2] 杨琪森, 王慎执, 桑金楠, 等. 复杂开放水域下智能船舶路径规划与避障方法[J]. 计算机集成制造系统, 2022, 28(7): 2030-2040.
YANG Q S, WANG S Z, SANG J N, et al. Path planning and real-time obstacle avoidance methods of intelligent ships in complex open water environment[J]. Computer Integrated Manufacturing Systems, 2022, 28(7): 2030-2040.
[3] 宁君, 黄寓旸, 尤恽, 等. 基于混合粒子群算法的船舶避碰决策[J]. 大连海事大学学报, 2023, 49(1): 34-43.
NING J, HUANG Y Y, YOU Y, et al. Ship collision avoidance decision based on hybrid particle swarm algorithm[J]. Journal of Dalian Maritime University, 2023, 49(1): 34-43.
[4] 关巍, 王淼淼, 韩虎生, 等. 基于Dueling-DDQN的船舶智能避碰决策方法[J]. 大连海事大学学报, 2024, 50(4): 22-30.
GUAN W, WANG M M, HAN H S, et al. Intelligent collision avoidance decision-making method for ships based on Dueling DDQN[J]. Journal of Dalian Maritime University, 2024, 50(4): 22-30.
[5] 张伟龙, 单梁, 常路, 等. 基于改进DWA的多无人水面艇分布式避碰算法[J]. 控制与决策, 2023, 38(4): 951-962.
ZHANG W L, SHAN L, CHANG L, et al. Distributed collision avoidance algorithm for multiple unmanned surface vessels based on improved DWA[J]. Control and Decision, 2023, 38(4): 951-962.
[6] 谭智坤, 张隆辉, 刘正锋, 等. 融合改进动态窗口法与速度障碍法的无人船局部路径规划[J]. 船舶力学, 2023, 27(3): 311-322.
TAN Z K, ZHANG L H, LIU Z F, et al. Local path planning for USVs based on the fusion algorithm of improved dynamic window approach and velocity obstacle algorithm[J]. Journal of Ship Mechanics, 2023, 27(3): 311-322.
[7] 王庆禄, 吴冯国, 郑成辰, 等. 基于优化人工势场法的无人机航迹规划[J]. 系统工程与电子技术, 2023, 45(5): 1461-1468.
WANG Q L, WU F G, ZHENG C C, et al. UAV path planning based on optimized artificial potential field method[J]. Systems Engineering and Electronics, 2023, 45(5): 1461-1468.
[8] 郑维, 王昊, 王洪斌. 动态环境下基于自适应步长Informed-RRT*和人工势场法的机器人混合路径规划[J]. 计量学报, 2023, 44(1): 26-34.
ZHENG W, WANG H, WANG H B. Adaptive step size Informed-RRT*and artificial potential field algorithm for hybrid path planning of robot[J]. Acta Metrologica Sinica, 2023, 44(1): 26-34.
[9] 张智超, 张景峰, 杨栋, 等. 基于目标失效概率的桥梁船撞风险及防撞水准论证[J]. 公路交通科技, 2023, 40(2): 72-80.
ZHANG Z C, ZHANG J F, YANG D, et al. Demonstration of bridge-vessel collision risk and fortification criterion against vessel collision based on target failure probability[J]. Journal of Highway and Transportation Research and Development, 2023, 40(2): 72-80.
[10] 孔祥磊, 汪芳琴, 钟选明, 等. 具有方向约束的多智能体系统的反一致性研究[J]. 空间控制技术与应用, 2023, 49(1): 74-81.
KONG X L, WANG F Q, ZHONG X M, et al. Inverse consensus of multi-agent systems with directional constraints[J]. Aerospace Control and Application, 2023, 49(1): 74-81.
[11] 丁伟, 明振军, 王国新, 等. 基于多层次LSTM网络的多智能体攻防效能动态预测模型[J]. 兵工学报, 2023, 44(1): 176-192.
DING W, MING Z J, WANG G X, et al. Dynamic prediction model based on multi-level LSTM network for multi-agent attack and defense effectiveness[J]. Acta Armamentarii, 2023, 44(1): 176-192.
[12] 田维青, 彭雪飞, 王成军, 等. 基于深度神经网络的电厂跑冒滴漏智能识别方法研究[J]. 电子器件, 2024, 47(2): 524-529.
TIAN W Q, PENG X F, WANG C J, et al. Research on intelligent identification method of power plant leakage based on deep neural network[J]. Chinese Journal of Electron Devices, 2024, 47(2): 524-529.
[13] 杨茂桃, 梁爽, 易淼荣, 等. 基于深度神经网络的超声速隔离段湍流涡黏性系数辨识[J]. 航空动力学报, 2023, 38(2): 312-324.
YANG M T, LIANG S, YI M R, et al. Identification of turbulence eddy viscosity coefficient in supersonic isolation section based on deep neural network[J]. Journal of Aerospace Power, 2023, 38(2): 312-324.
[14] 武晨雨, 陶银罗, 曾九孙. 引入注意力机制时空深度神经网络的再热器温度偏差预测方法[J]. 中国测试, 2024, 50(1): 151-159+192.
WU C Y, TAO Y L, ZENG J S. Prediction method of reheater temperature deviation based on attention mechanism spatiotemporal deep neural network[J]. China Measurement & Test, 2024, 50(1): 151-159+192.
[15] 黄国良, 周毅, 郑坤, 等. 基于改进蚁群算法的全局船舶路径规划方法[J]. 船海工程, 2023, 52(2): 97-101+136.
HUANG G L, ZHOU Y, ZHENG K, et al. Ship path planning and collision avoidance based on improved ant colony algorithm[J]. Ship & Ocean Engineering, 2023, 52(2): 97-101+136.

多智能体深度强化学习下多目标船舶避碰决策 Multi objective ship collision avoidance decision-making under multi-agent deep reinforcement learning

多智能体深度强化学习下多目标船舶避碰决策
Multi objective ship collision avoidance decision-making under multi-agent deep reinforcement learning