Collections of ICLR 2026 paper: "OmniSpatial: Towards Comprehensive Spatial Reasoning Benchmark for Vision Language Models"
Zekun Qi
qizekun
AI & ML interests
Embodied Intelligence, Large Langugae Model, 3D Computer Vision
Recent Activity
liked a model 15 days ago
ginwind/VLA-JEPA liked a dataset 2 months ago
stepfun-ai/Step-3.5-Flash-SFT