Robot Masters Terrain with Animal-Like Gait Transitions

Summary: Researchers leveraged deep reinforcement learning (DRL) to enable a robot to adaptively switch gaits, mimicking animal movements like trotting and pronking, to traverse complex terrains effectively. Their study explores the concept of viability—or fall prevention—as a primary motivator for such gait transitions, challenging previous beliefs that energy efficiency is the key driver.

This novel approach not only enhances the robot’s ability to handle challenging terrains but also provides deeper insights into animal locomotion. The team’s findings suggest that prioritizing fall prevention may lead to more agile and efficient robotic and biological movement across uneven surfaces.

Key Facts:

Gait Adaptation for Viability: The EPFL robot used DRL to learn gait transitions primarily for viability, effectively adapting its movement strategies to avoid falls when navigating terrains with gaps.
Reevaluation of Energy Efficiency: Contrary to previous theories, the study found that energy efficiency improvements are a consequence, not a driver, of gait transitions in challenging environments.
Bio-Inspired Robotic Agility: The research demonstrated a bio-inspired learning architecture that allowed for spontaneous, learning-driven gait transitions, showcasing advanced robotic agility in navigating consecutive gaps on experimental terrains.

Source: EPFL

With the help of a form of machine learning called deep reinforcement learning (DRL), the EPFL robot notably learned to transition from trotting to pronking – a leaping, arch-backed gait used by animals like springbok and gazelles – to navigate a challenging terrain with gaps ranging from 14-30cm.

The study, led by the BioRobotics Laboratory in EPFL’s School of Engineering, offers new insights into why and how such gait transitions occur in animals.

“Previous research has introduced energy efficiency and musculoskeletal injury avoidance as the two main explanations for gait transitions. More recently, biologists have argued that stability on flat terrain could be more important.

This shows the robot. — The robot spontaneously switched its gait from trotting to pronking to cross a challenging terrain with gaps. Credit: BioRob EPFL

“But animal and robotic experiments have shown that these hypotheses are not always valid, especially on uneven ground,” says PhD student Milad Shafiee, first author on a paper published in Nature Communications.

Shafiee and co-authors Guillaume Bellegarda and BioRobotics Lab head Auke Ijspeert were therefore interested in a new hypothesis for why gait transitions occur: viability, or fall avoidance. To test this hypothesis, they used DRL to train a quadruped robot to cross various terrains.

On flat terrain, they found that different gaits showed different levels of robustness against random pushes, and that the robot switched from a walk to a trot to maintain viability, just as quadruped animals do when they accelerate.

And when confronted with successive gaps in the experimental surface, the robot spontaneously switched from trotting to pronking to avoid falls. Moreover, viability was the only factor that was improved by such gait transitions.

“We showed that on flat terrain and challenging discrete terrain, viability leads to the emergence of gait transitions, but that energy efficiency is not necessarily improved,” Shafiee explains.

“It seems that energy efficiency, which was previously thought to be a driver of such transitions, may be more of a consequence. When an animal is navigating challenging terrain, it’s likely that its first priority is not falling, followed by energy efficiency.”

A bio-inspired learning architecture

To model locomotion control in their robot, the researchers considered the three interacting elements that drive animal movement: the brain, the spinal cord, and sensory feedback from the body.

They used DRL to train a neural network to imitate the spinal cord’s transmission of brain signals to the body as the robot crossed an experimental terrain. Then, the team assigned different weights to three possible learning goals: energy efficiency, force reduction, and viability.

A series of computer simulations revealed that of these three goals, viability was the only one that prompted the robot to automatically – without instruction from the scientists – change its gait.

The team emphasizes that these observations represent the first learning-based locomotion framework in which gait transitions emerge spontaneously during the learning process, as well as the most dynamic crossing of such large consecutive gaps for a quadrupedal robot.

“Our bio-inspired learning architecture demonstrated state-of-the-art quadruped robot agility on the challenging terrain,” Shafiee says.

The researchers aim to expand on their work with additional experiments that place different types of robots in a wider variety of challenging environments.

In addition to further elucidating animal locomotion, they hope that ultimately, their work will enable the more widespread use of robots for biological research, reducing reliance on animal models and the associated ethics concerns.

About this robotics and AI research news

Author: Celia Luterbacher
Source: EPFL
Contact: Celia Luterbacher – EPFL
Image: The image is credited to BioRob EPFL

Original Research: Open access.
“Viability leads to the emergence of gait transitions in learning agile quadrupedal locomotion on challenging terrains” by Milad Shafiee et al. Nature Communications

Abstract

Viability leads to the emergence of gait transitions in learning agile quadrupedal locomotion on challenging terrains

Quadruped animals are capable of seamless transitions between different gaits. While energy efficiency appears to be one of the reasons for changing gaits, other determinant factors likely play a role too, including terrain properties.

In this article, we propose that viability, i.e., the avoidance of falls, represents an important criterion for gait transitions.

We investigate the emergence of gait transitions through the interaction between supraspinal drive (brain), the central pattern generator in the spinal cord, the body, and exteroceptive sensing by leveraging deep reinforcement learning and robotics tools.

Consistent with quadruped animal data, we show that the walk-trot gait transition for quadruped robots on flat terrain improves both viability and energy efficiency.

Furthermore, we investigate the effects of discrete terrain (i.e., crossing successive gaps) on imposing gait transitions, and find the emergence of trot-pronk transitions to avoid non-viable states.

Viability is the only improved factor after gait transitions on both flat and discrete gap terrains, suggesting that viability could be a primary and universal objective of gait transitions, while other criteria are secondary objectives and/or a consequence of viability.

Moreover, our experiments demonstrate state-of-the-art quadruped robot agility in challenging scenarios.