GeneralVLA design integrates ASM and Knowledge-Guided Trajectory Planning for generalizable zero-shot robotic manipulation and scalable demonstration-free data generation across tasks.
Selected real-world episodes (videos 7–10) showcase the model acting in unconstrained physical settings. Each clip is labeled beneath the footage for quick reference.
Controlled simulation episodes (clips 1–6) highlight the model’s behavior in the virtual testbed, with motion traces rendered as GIFs for quick comparison.
@article{ma2026generalvla,
title={GeneralVLA: Generalizable Vision-Language-Action Models with Knowledge-Guided Trajectory Planning},
author={Ma, Guoqing and Wang, Siheng and Zhang, Zeyu and Yu, Shan and Tang, Hao},
journal={arXiv preprint arXiv:2602.04315},
year={2026}
}