Picture for Bin Zhao

Bin Zhao

MoMa-Kitchen: A 100K+ Benchmark for Affordance-Grounded Last-Mile Navigation in Mobile Manipulation

Add code
Mar 14, 2025
Viaarxiv icon

AgiBot World Colosseo: A Large-scale Manipulation Platform for Scalable and Intelligent Embodied Systems

Add code
Mar 09, 2025
Viaarxiv icon

OpenFly: A Versatile Toolchain and Large-scale Benchmark for Aerial Vision-Language Navigation

Add code
Feb 25, 2025
Viaarxiv icon

Exploring the Potential of Encoder-free Architectures in 3D LMMs

Add code
Feb 13, 2025
Figure 1 for Exploring the Potential of Encoder-free Architectures in 3D LMMs
Figure 2 for Exploring the Potential of Encoder-free Architectures in 3D LMMs
Figure 3 for Exploring the Potential of Encoder-free Architectures in 3D LMMs
Figure 4 for Exploring the Potential of Encoder-free Architectures in 3D LMMs
Viaarxiv icon

SpatialVLA: Exploring Spatial Representations for Visual-Language-Action Model

Add code
Jan 27, 2025
Figure 1 for SpatialVLA: Exploring Spatial Representations for Visual-Language-Action Model
Figure 2 for SpatialVLA: Exploring Spatial Representations for Visual-Language-Action Model
Figure 3 for SpatialVLA: Exploring Spatial Representations for Visual-Language-Action Model
Figure 4 for SpatialVLA: Exploring Spatial Representations for Visual-Language-Action Model
Viaarxiv icon

Open-Vocabulary Octree-Graph for 3D Scene Understanding

Add code
Nov 25, 2024
Figure 1 for Open-Vocabulary Octree-Graph for 3D Scene Understanding
Figure 2 for Open-Vocabulary Octree-Graph for 3D Scene Understanding
Figure 3 for Open-Vocabulary Octree-Graph for 3D Scene Understanding
Figure 4 for Open-Vocabulary Octree-Graph for 3D Scene Understanding
Viaarxiv icon

Night-to-Day Translation via Illumination Degradation Disentanglement

Add code
Nov 21, 2024
Viaarxiv icon

FreeGaussian: Guidance-free Controllable 3D Gaussian Splats with Flow Derivatives

Add code
Oct 29, 2024
Figure 1 for FreeGaussian: Guidance-free Controllable 3D Gaussian Splats with Flow Derivatives
Figure 2 for FreeGaussian: Guidance-free Controllable 3D Gaussian Splats with Flow Derivatives
Figure 3 for FreeGaussian: Guidance-free Controllable 3D Gaussian Splats with Flow Derivatives
Figure 4 for FreeGaussian: Guidance-free Controllable 3D Gaussian Splats with Flow Derivatives
Viaarxiv icon

Towards Flexible and Efficient Diffusion Low Light Enhancer

Add code
Oct 16, 2024
Figure 1 for Towards Flexible and Efficient Diffusion Low Light Enhancer
Figure 2 for Towards Flexible and Efficient Diffusion Low Light Enhancer
Figure 3 for Towards Flexible and Efficient Diffusion Low Light Enhancer
Figure 4 for Towards Flexible and Efficient Diffusion Low Light Enhancer
Viaarxiv icon

Aerial Vision-and-Language Navigation via Semantic-Topo-Metric Representation Guided LLM Reasoning

Add code
Oct 11, 2024
Figure 1 for Aerial Vision-and-Language Navigation via Semantic-Topo-Metric Representation Guided LLM Reasoning
Figure 2 for Aerial Vision-and-Language Navigation via Semantic-Topo-Metric Representation Guided LLM Reasoning
Figure 3 for Aerial Vision-and-Language Navigation via Semantic-Topo-Metric Representation Guided LLM Reasoning
Figure 4 for Aerial Vision-and-Language Navigation via Semantic-Topo-Metric Representation Guided LLM Reasoning
Viaarxiv icon