Hi Robot: Open-Ended Instruction Following with Hierarchical Vision-Language-Action Models

YouTube Seminar

A paper review seminar on Hi Robot, which proposes hierarchical Vision-Language-Action models for open-ended instruction following in robotic manipulation.




Enjoy Reading This Article?

Here are some more articles you might like to read next:

  • OpenVLA: An Open-Source Vision-Language-Action Model
  • End-to-End Semi-Supervised 3D Instance Segmentation