Picture for Botao Ye

Botao Ye

Window Token Concatenation for Efficient Visual Large Language Models

Add code
Apr 05, 2025
Viaarxiv icon

Synthesizing Consistent Novel Views via 3D Epipolar Attention without Re-Training

Add code
Feb 25, 2025
Viaarxiv icon

GEM: A Generalizable Ego-Vision Multimodal World Model for Fine-Grained Ego-Motion, Object Dynamics, and Scene Composition Control

Add code
Dec 15, 2024
Figure 1 for GEM: A Generalizable Ego-Vision Multimodal World Model for Fine-Grained Ego-Motion, Object Dynamics, and Scene Composition Control
Figure 2 for GEM: A Generalizable Ego-Vision Multimodal World Model for Fine-Grained Ego-Motion, Object Dynamics, and Scene Composition Control
Figure 3 for GEM: A Generalizable Ego-Vision Multimodal World Model for Fine-Grained Ego-Motion, Object Dynamics, and Scene Composition Control
Figure 4 for GEM: A Generalizable Ego-Vision Multimodal World Model for Fine-Grained Ego-Motion, Object Dynamics, and Scene Composition Control
Viaarxiv icon

No Pose, No Problem: Surprisingly Simple 3D Gaussian Splats from Sparse Unposed Images

Add code
Oct 31, 2024
Figure 1 for No Pose, No Problem: Surprisingly Simple 3D Gaussian Splats from Sparse Unposed Images
Figure 2 for No Pose, No Problem: Surprisingly Simple 3D Gaussian Splats from Sparse Unposed Images
Figure 3 for No Pose, No Problem: Surprisingly Simple 3D Gaussian Splats from Sparse Unposed Images
Figure 4 for No Pose, No Problem: Surprisingly Simple 3D Gaussian Splats from Sparse Unposed Images
Viaarxiv icon

Joint Feature Learning and Relation Modeling for Tracking: A One-Stream Framework

Add code
Mar 24, 2022
Figure 1 for Joint Feature Learning and Relation Modeling for Tracking: A One-Stream Framework
Figure 2 for Joint Feature Learning and Relation Modeling for Tracking: A One-Stream Framework
Figure 3 for Joint Feature Learning and Relation Modeling for Tracking: A One-Stream Framework
Figure 4 for Joint Feature Learning and Relation Modeling for Tracking: A One-Stream Framework
Viaarxiv icon

Geometry-aware data augmentation for monocular 3D object detection

Add code
Apr 12, 2021
Figure 1 for Geometry-aware data augmentation for monocular 3D object detection
Figure 2 for Geometry-aware data augmentation for monocular 3D object detection
Figure 3 for Geometry-aware data augmentation for monocular 3D object detection
Figure 4 for Geometry-aware data augmentation for monocular 3D object detection
Viaarxiv icon