Publications
2026
-
Zero-Shot Long-Horizon Dexterous Manipulation via Multi-View 3D-Grounded VLM ReasoningarXiv 2026 -
SimuScene: Simulation-Ready Compositional 3D Scene Reconstruction from a Single ImagearXiv 2026 -
Text-Guided 6D Object Pose Rearrangement via Closed-Loop VLM AgentsECCV 2026
2025
-
Learning to Generate Human-Human-Object Interactions from Textual DescriptionsNeurIPS 2025 -
Learning 3D Object Spatial Relationships from Pre-trained 2D Diffusion ModelsICCV 2025 -
DAViD: Modeling Dynamic Affordance of 3D Objects using Pre-trained Video Diffusion ModelsICCV 2025