Item request has been placed!

Item request cannot be made.

Processing Request

Where-to-Learn: Analytical Policy Gradient Directed Exploration for On-Policy Robotic Reinforcement Learning

Item request has been placed!

Item request cannot be made.

Processing Request

Read More Add to Saved list

Author(s): Chang, Leixin; Yao, Xinchen; Liu, Ben; Yang, Liangjing; Chen, Hua
Subject Terms:
Robotics
Document Type:
Working Paper
Online Access:
http://arxiv.org/abs/2603.27317

Additional Information
- Publication Date:
  2026
- Abstract:
  On-policy reinforcement learning (RL) algorithms have demonstrated great potential in robotic control, where effective exploration is crucial for efficient and high-quality policy learning. However, how to encourage the agent to explore the better trajectories efficiently remains a challenge. Most existing methods incentivize exploration by maximizing the policy entropy or encouraging novel state visiting regardless of the potential state value. We propose a new form of directed exploration that uses analytical policy gradients from a differentiable dynamics model to inject task-aware, physics-guided guidance, thereby steering the agent towards high-reward regions for accelerated and more effective policy learning.
  8 pages, 10 figures
- Accession Number:
  10.1109/LRA.2026.3678143
- Accession Number:
  edsarx.2603.27317

Comments

No Comments.