[1]BAI Tao,DONG Qinhao,FENG Zikun,et al.Longitudinal motion control of underwater high-speed vehicles based on reinforcement learning[J].CAAI Transactions on Intelligent Systems,2023,18(5):902-916.[doi:10.11992/tis.202203024]
Copy
CAAI Transactions on Intelligent Systems[ISSN 1673-4785/CN 23-1538/TP] Volume:
18
Number of periods:
2023 5
Page number:
902-916
Column:
学术论文—机器学习
Public date:
2023-09-05
- Title:
-
Longitudinal motion control of underwater high-speed vehicles based on reinforcement learning
- Author(s):
-
BAI Tao; DONG Qinhao; FENG Zikun; LI Xuehua
-
College of Intelligent Systems Science and Engineering, Harbin Engineering University, Harbin 150001, China
-
- Keywords:
-
intelligent control; reinforcement learning; deep deterministic policy gradient (DDPG) algorithm; underwater high-speed vehicle; nonlinear system; longitudinal stability control; actuator saturation; diving
- CLC:
-
TP15
- DOI:
-
10.11992/tis.202203024
- Abstract:
-
Owing to cavitation characteristics, the mathematical model of a high-speed underwater vehicle has strong nonlinearity and uncertainty. Classical methods such as the linear quadratic regulator and switching control cannot achieve effective control. To address problems in the difficulty of decoupling or linearizing the underwater high-speed vehicle model accurately, the classical control method cannot fully consider the complexity and variability of the underwater environment, and the controller may be oversaturated when dealing with disturbances. Thus, the reinforcement learning algorithm in intelligent control was adopted in this study. It continuously explores and interacts with the environment to obtain control despite the absence of an accurate model and thereby completing the design of the deep deterministic policy gradient agent controller. The experimental results show that the designed controller can ensure stable control of the longitudinal motion of the high-speed underwater vehicle. Within the saturation range of the actuator, it can respond to disturbance and complete the diving control task, and the controller has strong robustness and better adaptability.