Scientific Literature

HDAO: A Hierarchical Curiosity-Driven Reinforcement Learning Approach for AUV Dynamic Obstacle Avoidance

杜华争, Qian Liu, Xu Liu, Na Xia

April 14, 2026

Published Date

Research Abstract & Technology Focus

Autonomous obstacle avoidance is a critical capability for Autonomous Underwater Vehicles (AUVs) to operate safely in dynamic and uncertain marine environments. Traditional AUV control methods rely on precise physical modeling and preset rules, yet they struggle to adapt to multiple sources of uncertainty, such as random initial states, dynamic obstacles, and varying currents. In recent years, deep reinforcement learning has provided a new avenue for data-driven adaptive policy learning. However, it remains insufficient for handling long-horizon tasks with sparse rewards. While hierarchical reinforcement learning can mitigate reward sparsity through temporal abstraction, it often faces challenges including exploration–exploitation imbalance, slow global convergence, and insufficient safety guarantees. Furthermore, most existing studies neglect dynamic environmental disturbances and task continuity, which further limits the practical application of these algorithms. To address these challenges, this paper proposes a hierarchical curiosity-driven AUV obstacle avoidance algorithm (HDAO), designed for autonomous obstacle avoidance in dynamic and uncertain underwater environments. The core design of HDAO incorporates several key innovations. Firstly, it introduces a Collision Threat Index for dynamic obstacles, which enables explicit risk perception and quantifies collision threats, thereby enhancing the policy’s generalization and robustness. Secondly, a task-decoupled hierarchical architecture is employed to synergistically optimize global path planning and local obstacle avoidance behaviors. This approach effectively manages long-horizon navigation tasks while alleviating high-dimensional training pressure. Finally, a novel reward mechanism is designed by integrating hierarchical active exploration with curiosity-driven passive exploration. This mechanism effectively incentivizes the agent to explore unvisited areas under sparse reward conditions and dynamically balances exploration and exploitation. Experimental results demonstrate that HDAO significantly outperforms existing methods in terms of obstacle avoidance success rate, training convergence speed and robustness against external disturbances.

Read Full Literature

Correlated Market Trend: Artificial Intelligence

Bridging academia to market: The 60-day public search velocity mapping directly to the core technology of this paper. Dashed line represents 7-day moving average.

AI Semantic Synergy Context

Connecting this academic literature to real-world market discussions and products.

Instantaneous Planning, Control and Safety for Navigation in Unknown Underwater Spaces

Navigating autonomous underwater vehicles (AUVs) in unknown environments is significantly challenging due to poor visibility, weak signal transmission, and dynamic water currents. These factors pos...

CD-HSSRL: Cross-Domain Hierarchical Safe Switching Reinforcement Learning Framework for Autonomous Amphibious Robot Navigation

Autonomous tracked amphibious robotic systems operating across water and land environments are essential for coastal inspection, disaster response, environmental monitoring, and complex terrain exp...

Integrating Proximal Policy Optimization with Physically Realistic Simulation for Robust Autonomous Underwater Vehicle Control

This study presents the design and implementation of a reinforcement learning (RL)-based framework for the control of an autonomous underwater vehicle (AUV) directly within Unreal Engine (UE). A hi...

Analysis of advanced modified tuna swarm optimization technique for path planning of underwater vehicle

Purpose Path planning with obstacle avoidance is crucial for navigating an autonomous underwater vehicle (AUV) in an unknown and obstacle-rich three-dimensional space. This paper aims to develop an...

Development of a New Intelligent Algorithm to Improve Autonomous Car Operation

Autonomous Driving Systems (ADS) are transforming modern transportation by enabling safer, more efficient vehicle operation. Among their core components, local path planning remains a significant c...

Frequently Asked Questions (FAQ)

Curated market intelligence mapped to this research.

What is the core focus of the research titled 'HDAO: A Hierarchical Curiosity-Driven Reinforcement Learning Approach for AUV Dynamic Obstacle Avoidance'?

This literature focuses on: Autonomous obstacle avoidance is a critical capability for Autonomous Underwater Vehicles (AUVs) to operate safely in dynamic and uncertain marine environments. Traditional AUV control methods rely on precise physical modeling and preset rules, ye...

Are there open-source GitHub repositories related to HDAO: A Hierarchical Curiosity-Driven Reinforcement Learning Approach for AUV Dynamic Obstacle Avoidance?

Yes, open-source projects like Tencent-Hunyuan/UniRL (UniRL is a Framework for Unified Multimodal Model Reinforcement Learning) are actively building upon these concepts.

What other academic literature is closely related to 'HDAO: A Hierarchical Curiosity-Driven Reinforcement Learning Approach for AUV Dynamic Obstacle Avoidance'?

Yes, highly correlated activity was mapped. An entry titled 'Instantaneous Planning, Control and Safety for Navigation in Unknown Underwater Spaces' discusses this: Navigating autonomous underwater vehicles (AUVs) in unknown environments is significantly challenging due to poor visibility, weak signal transmiss...

Cite this Market Intelligence Report

Reference our AI-mapped synergy between this research and the commercial market to instantly build authority.

"Commercial Applications of HDAO: A Hierarchical Curiosity-Driven Reinforcement Learning Approach for AUV Dynamic Obstacle Avoidance." ROIpad Intelligence Index, 2026. Available at: https://roipad.com/saas-metrics/research/oa_W7154378184/hdao-a-hierarchical-curiosity-driven-reinforcement-learning-approach-for-auv-dynamic-obstacle-avoidance

Commercial Realization

Startups and Open Source tools heavily associated with the concepts explored in this paper.

GitHub
Tencent-Hunyuan/UniRL
UniRL is a Framework for Unified Multimodal Model Reinforcement Lea...

Associated Media Narrative

Europeâs Nuclear Dependency on Russia Persists Despite Decoupling Efforts
Naturalnews.com • Jul 7, 2026
Synthesis is harder than analysis
Surfingcomplexity.blog • Jul 4, 2026
Life Is a Story That Begins in the Middle: Bayo Akomolafe on the Rewilding Power of Obstacles
Themarginalian.org • Jun 21, 2026