Learning Terrain-Aware Whole-Body Control for Perceptive Legged Loco-Manipulation
学习面向感知的腿式移动操作的地形感知全身控制
Sikai Guo, Yudong Zhong, Guoyang Zhao, Botao Dang, Zhihai Bi, Jun Ma
AI总结 提出TA-WBC框架,通过混合外感受编码器提取地形特征、基于脚接触平面的末端执行器采样方法以及双策略蒸馏模块,实现腿式机械臂在复杂地形上的全身移动操作控制。
详情
腿式机械臂结合了卓越的地形适应性和移动操作能力,使其在人类中心环境中极具应用前景。通过协调腿和臂的控制,全身控制器可以显著扩展腿式机械臂的操作工作空间。然而,许多现有的全身控制器主要依赖于本体感觉,并未整合有效地形拓扑感知所需的关键外部感受。这一限制可能阻碍它们适应不同环境条件并有效导航复杂地形。在本文中,我们介绍了TA-WBC,一种用于腿式机械臂的地形感知全身控制框架,其特点是一种新颖的基于强化学习的统一策略,专门针对各种地形中的全身移动操作任务。具体来说,我们采用混合外感受编码器提取地形特征,为机器人主动调整姿态和立足点提供必要基础。此外,为了促进稳定的跨地形移动操作,我们提出了一种基于脚接触平面的新颖末端执行器采样方法,将操作目标与基座波动解耦。此外,引入了双策略蒸馏模块,以在不发生灾难性遗忘的情况下整合广泛的全身运动与地形适应性。仿真和真实世界实验验证了我们提出的控制器的鲁棒性,该控制器实现了更大的可达空间、更小的跟踪误差和减少的意外绊倒。这一统一策略突显了腿式机械臂在复杂地形上执行移动操作任务的有前景的能力。
Legged manipulators integrate exceptional terrain adaptability along with mobile manipulation capabilities, which make them highly promising for deployment in human-centric environments. By coordinating the control of both legs and arms, a whole-body controller can significantly expand the operational workspace of legged manipulators. However, many existing whole-body controllers primarily depend on proprioception and do not incorporate the critical exteroception required for effective terrain topology perception. This limitation can hinder their ability to adapt to varying environmental conditions and navigate complex terrains effectively. In this paper, we introduce TA-WBC, a terrain-aware whole-body control framework for legged manipulators, which features a novel RL-based unified policy tailored to whole-body loco-manipulation tasks in various terrains. Specifically, we employ a hybrid exteroception encoder to extract terrain features, providing an essential basis for the robot to proactively adapt posture and footholds. Furthermore, to facilitate stable cross-terrain loco-manipulation, we propose a novel end-effector sampling method based on the foot contact plane, decoupling manipulation target from base fluctuations. Moreover, a dual-policy distillation module is introduced to integrate expansive whole-body motion with terrain adaptability without catastrophic forgetting. The simulation and real-world experiments validate the robustness of our proposed controller, which leads to a larger reachable space, less tracking error, and reduced unexpected stumbles. This unified policy highlights the promising capabilities of legged manipulators in performing loco-manipulation tasks across complex terrains.