微软研究院博客
AsgardBench: A benchmark for visually grounded interactive planning
Imagine a robot tasked with cleanin…
微软研究院博客
GroundedPlanBench: Spatially grounded long-horizon task planning for robot manipulation
Vision-language models (VLMs) use i…