Picture for Yilun Chen

Yilun Chen

Bench4Merge: A Comprehensive Benchmark for Merging in Realistic Dense Traffic with Micro-Interactive Vehicles

Add code
Oct 21, 2024
Viaarxiv icon

VLM-Grounder: A VLM Agent for Zero-Shot 3D Visual Grounding

Add code
Oct 17, 2024
Viaarxiv icon

SLM-Mod: Small Language Models Surpass LLMs at Content Moderation

Add code
Oct 17, 2024
Viaarxiv icon

Dual-AEB: Synergizing Rule-Based and Multimodal Large Language Models for Effective Emergency Braking

Add code
Oct 11, 2024
Figure 1 for Dual-AEB: Synergizing Rule-Based and Multimodal Large Language Models for Effective Emergency Braking
Figure 2 for Dual-AEB: Synergizing Rule-Based and Multimodal Large Language Models for Effective Emergency Braking
Figure 3 for Dual-AEB: Synergizing Rule-Based and Multimodal Large Language Models for Effective Emergency Braking
Figure 4 for Dual-AEB: Synergizing Rule-Based and Multimodal Large Language Models for Effective Emergency Braking
Viaarxiv icon

GRUtopia: Dream General Robots in a City at Scale

Add code
Jul 15, 2024
Viaarxiv icon

OVExp: Open Vocabulary Exploration for Object-Oriented Navigation

Add code
Jul 12, 2024
Figure 1 for OVExp: Open Vocabulary Exploration for Object-Oriented Navigation
Figure 2 for OVExp: Open Vocabulary Exploration for Object-Oriented Navigation
Figure 3 for OVExp: Open Vocabulary Exploration for Object-Oriented Navigation
Figure 4 for OVExp: Open Vocabulary Exploration for Object-Oriented Navigation
Viaarxiv icon

MMScan: A Multi-Modal 3D Scene Dataset with Hierarchical Grounded Language Annotations

Add code
Jun 13, 2024
Viaarxiv icon

Generalization Beyond Data Imbalance: A Controlled Study on CLIP for Transferable Insights

Add code
May 31, 2024
Viaarxiv icon

Grounded 3D-LLM with Referent Tokens

Add code
May 16, 2024
Viaarxiv icon

3DGSR: Implicit Surface Reconstruction with 3D Gaussian Splatting

Add code
Mar 30, 2024
Viaarxiv icon