RT-2: New model translates vision and language into action | Haber Detay
RT-2: New model translates vision and language into action
Category: DeepMind Blog | Date: 2025-06-25 11:23:15
Robotic Transformer 2 (RT-2) is a novel vision-language-action (VLA) model that learns from both web and robotics data, and translates this knowledge into generalised instructions for robotic control.