Picture for Bolei Zhou

Bolei Zhou

Learning from Active Human Involvement through Proxy Value Propagation

Add code
Feb 05, 2025
Viaarxiv icon

Embodied Scene Understanding for Vision Language Models via MetaVQA

Add code
Jan 15, 2025
Figure 1 for Embodied Scene Understanding for Vision Language Models via MetaVQA
Figure 2 for Embodied Scene Understanding for Vision Language Models via MetaVQA
Figure 3 for Embodied Scene Understanding for Vision Language Models via MetaVQA
Figure 4 for Embodied Scene Understanding for Vision Language Models via MetaVQA
Viaarxiv icon

Vid2Sim: Realistic and Interactive Simulation from Video for Urban Navigation

Add code
Jan 14, 2025
Viaarxiv icon

Joint Optimization for 4D Human-Scene Reconstruction in the Wild

Add code
Jan 04, 2025
Figure 1 for Joint Optimization for 4D Human-Scene Reconstruction in the Wild
Figure 2 for Joint Optimization for 4D Human-Scene Reconstruction in the Wild
Figure 3 for Joint Optimization for 4D Human-Scene Reconstruction in the Wild
Figure 4 for Joint Optimization for 4D Human-Scene Reconstruction in the Wild
Viaarxiv icon

Hyper: Hyperparameter Robust Efficient Exploration in Reinforcement Learning

Add code
Dec 04, 2024
Figure 1 for Hyper: Hyperparameter Robust Efficient Exploration in Reinforcement Learning
Figure 2 for Hyper: Hyperparameter Robust Efficient Exploration in Reinforcement Learning
Figure 3 for Hyper: Hyperparameter Robust Efficient Exploration in Reinforcement Learning
Figure 4 for Hyper: Hyperparameter Robust Efficient Exploration in Reinforcement Learning
Viaarxiv icon

V2XPnP: Vehicle-to-Everything Spatio-Temporal Fusion for Multi-Agent Perception and Prediction

Add code
Dec 02, 2024
Figure 1 for V2XPnP: Vehicle-to-Everything Spatio-Temporal Fusion for Multi-Agent Perception and Prediction
Figure 2 for V2XPnP: Vehicle-to-Everything Spatio-Temporal Fusion for Multi-Agent Perception and Prediction
Figure 3 for V2XPnP: Vehicle-to-Everything Spatio-Temporal Fusion for Multi-Agent Perception and Prediction
Figure 4 for V2XPnP: Vehicle-to-Everything Spatio-Temporal Fusion for Multi-Agent Perception and Prediction
Viaarxiv icon

Verbalized Representation Learning for Interpretable Few-Shot Generalization

Add code
Nov 27, 2024
Figure 1 for Verbalized Representation Learning for Interpretable Few-Shot Generalization
Figure 2 for Verbalized Representation Learning for Interpretable Few-Shot Generalization
Figure 3 for Verbalized Representation Learning for Interpretable Few-Shot Generalization
Figure 4 for Verbalized Representation Learning for Interpretable Few-Shot Generalization
Viaarxiv icon

Learning to Generate Diverse Pedestrian Movements from Web Videos with Noisy Labels

Add code
Oct 10, 2024
Figure 1 for Learning to Generate Diverse Pedestrian Movements from Web Videos with Noisy Labels
Figure 2 for Learning to Generate Diverse Pedestrian Movements from Web Videos with Noisy Labels
Figure 3 for Learning to Generate Diverse Pedestrian Movements from Web Videos with Noisy Labels
Figure 4 for Learning to Generate Diverse Pedestrian Movements from Web Videos with Noisy Labels
Viaarxiv icon

CooPre: Cooperative Pretraining for V2X Cooperative Perception

Add code
Aug 20, 2024
Figure 1 for CooPre: Cooperative Pretraining for V2X Cooperative Perception
Figure 2 for CooPre: Cooperative Pretraining for V2X Cooperative Perception
Figure 3 for CooPre: Cooperative Pretraining for V2X Cooperative Perception
Figure 4 for CooPre: Cooperative Pretraining for V2X Cooperative Perception
Viaarxiv icon

MetaUrban: A Simulation Platform for Embodied AI in Urban Spaces

Add code
Jul 11, 2024
Figure 1 for MetaUrban: A Simulation Platform for Embodied AI in Urban Spaces
Figure 2 for MetaUrban: A Simulation Platform for Embodied AI in Urban Spaces
Figure 3 for MetaUrban: A Simulation Platform for Embodied AI in Urban Spaces
Figure 4 for MetaUrban: A Simulation Platform for Embodied AI in Urban Spaces
Viaarxiv icon