#5576. Robust biped locomotion using deep reinforcement learning on top of an analytical control approach

August 2026publication date
Proposal available till 21-05-2025
4 total number of authors per manuscript0 $

The title of the journal is available only for the authors who have already paid for
Journal’s subject area:
Mathematics (all);
Computer Science Applications;
Control and Systems Engineering;
Software;
Places in the authors’ list:
place 1place 2place 3place 4
FreeFreeFreeFree
2350 $1200 $1050 $900 $
Contract5576.1 Contract5576.2 Contract5576.3 Contract5576.4
1 place - free (for sale)
2 place - free (for sale)
3 place - free (for sale)
4 place - free (for sale)

Abstract:
This paper proposes a modular framework to generate robust biped locomotion using a tight coupling between an analytical walking approach and deep reinforcement learning. This framework is composed of six main modules which are hierarchically connected to reduce the overall complexity and increase its flexibility. The core of this framework is a specific dynamics model which abstracts a humanoids dynamics model into two masses for modeling upper and lower body. This dynamics model is used to design an adaptive reference trajectories planner and an optimal controller which are fully parametric. Furthermore, a learning framework is developed based on Genetic Algorithm (GA) and Proximal Policy Optimization (PPO) to find the optimum parameters and to learn how to improve the stability of the robot by moving the arms and changing its center of mass height. A set of simulations are performed to validate the performance of the framework using the official RoboCup 3D League simulation environment.
Keywords:
Deep Reinforcement Learning (DRL); Genetic Algorithm (GA); Humanoid robots; Linear–Quadratic–Gaussian (LQG); Modular walk engine; Proximal Policy Optimization (PPO)

Contacts :
0