Saving the Limping

Masked Sensory-Temporal Attention for Sensor Generalization in Quadruped Locomotion

Dikai Liu^{1, 2}

Tianwei Zhang²

Jianxiong Yin¹

Simon See^{1, 2, 3}

¹NVIDIA AI Technology Centre (NVAITC)

²Nanyang Technological University

³Coventry University

Paper

Abstract

With the rising focus on quadrupeds, a generalized policy capable of handling different robot models and sensor inputs becomes highly beneficial. Although several methods have been proposed to address different morphologies, it remains a challenge for learning-based policies to manage various combinations of proprioceptive information. This paper presents Masked Sensory-Temporal Attention (MSTA), a novel transformer-based mechanism with masking for quadruped locomotion. It employs direct sensor-level attention to enhance the sensory-temporal understanding and handle different combinations of sensor data, serving as a foundation for incorporating unseen information. MSTA can effectively understand its states even with a large portion of missing information, and is flexible enough to be deployed on physical systems despite the long input sequence.