Robust Underactuated Point-Feet Bipedal Locomotion Using Deep Reinforcement Learning and a Balance Recovery System
Publication Date
8-14-2025
Document Type
Article
Publication Title
ASME Letters in Dynamic Systems and Control
Volume
5
Issue
4
DOI
10.1115/1.4069223
Abstract
This study proposes a deep reinforcement learning control strategy using the twin delayed deep deterministic algorithm for the robust locomotion of a point-feet, underactuated bipedal robot. We introduce two key contributions: a specialized balance recovery system and a bioinspired reward function. The balance recovery system is explicitly trained to handle off-balance and fall-like conditions. Its effectiveness was validated through 50 randomized trials, where it achieved a 74% success rate in stabilizing the robot from a wide range of initial heights, velocities, and configurations. The bioinspired reward function encourages the robot’s hip to remain between its feet, which was shown to significantly improve the gait stability. This reward shaping reduced the normalized fluctuation in joint angle movements by a factor of 1.75, even under external disturbances. The final controller produced an average running speed of 2.4 m/s and demonstrated robustness to external disturbances of up to ±60 N · m, paving the way for more resilient and adaptive bipedal locomotion.
Keywords
balance recovery system, control applications, deep RL, intelligent systems, machine learning, motion controls, robotics, TD3 agent, underactuated bipedal
Department
Mechanical Engineering
Recommended Citation
Aref Amiri, Soroush Zare, and Mojtaba Sharifi. "Robust Underactuated Point-Feet Bipedal Locomotion Using Deep Reinforcement Learning and a Balance Recovery System" ASME Letters in Dynamic Systems and Control (2025). https://doi.org/10.1115/1.4069223