Picture for Yansong Qu

Yansong Qu

SynergyAmodal: Deocclude Anything with Text Control

Add code
Apr 28, 2025
Viaarxiv icon

Sky-Drive: A Distributed Multi-Agent Simulation Platform for Socially-Aware and Human-AI Collaborative Future Transportation

Add code
Apr 25, 2025
Viaarxiv icon

Training-Free Hierarchical Scene Understanding for Gaussian Splatting with Superpoint Graphs

Add code
Apr 17, 2025
Viaarxiv icon

WildSeg3D: Segment Any 3D Objects in the Wild from 2D Images

Add code
Mar 11, 2025
Viaarxiv icon

AnomalyPainter: Vision-Language-Diffusion Synergy for Zero-Shot Realistic and Diverse Industrial Anomaly Synthesis

Add code
Mar 11, 2025
Viaarxiv icon

CurricuVLM: Towards Safe Autonomous Driving via Personalized Safety-Critical Curriculum Learning with Vision-Language Models

Add code
Feb 21, 2025
Viaarxiv icon

Drag Your Gaussian: Effective Drag-Based Editing with Score Distillation for 3D Gaussian Splatting

Add code
Jan 30, 2025
Figure 1 for Drag Your Gaussian: Effective Drag-Based Editing with Score Distillation for 3D Gaussian Splatting
Figure 2 for Drag Your Gaussian: Effective Drag-Based Editing with Score Distillation for 3D Gaussian Splatting
Figure 3 for Drag Your Gaussian: Effective Drag-Based Editing with Score Distillation for 3D Gaussian Splatting
Figure 4 for Drag Your Gaussian: Effective Drag-Based Editing with Score Distillation for 3D Gaussian Splatting
Viaarxiv icon

VLM-RL: A Unified Vision Language Models and Reinforcement Learning Framework for Safe Autonomous Driving

Add code
Dec 20, 2024
Figure 1 for VLM-RL: A Unified Vision Language Models and Reinforcement Learning Framework for Safe Autonomous Driving
Figure 2 for VLM-RL: A Unified Vision Language Models and Reinforcement Learning Framework for Safe Autonomous Driving
Figure 3 for VLM-RL: A Unified Vision Language Models and Reinforcement Learning Framework for Safe Autonomous Driving
Figure 4 for VLM-RL: A Unified Vision Language Models and Reinforcement Learning Framework for Safe Autonomous Driving
Viaarxiv icon

Towards 3D Semantic Scene Completion for Autonomous Driving: A Meta-Learning Framework Empowered by Deformable Large-Kernel Attention and Mamba Model

Add code
Nov 06, 2024
Viaarxiv icon

Director3D: Real-world Camera Trajectory and 3D Scene Generation from Text

Add code
Jun 25, 2024
Figure 1 for Director3D: Real-world Camera Trajectory and 3D Scene Generation from Text
Figure 2 for Director3D: Real-world Camera Trajectory and 3D Scene Generation from Text
Figure 3 for Director3D: Real-world Camera Trajectory and 3D Scene Generation from Text
Figure 4 for Director3D: Real-world Camera Trajectory and 3D Scene Generation from Text
Viaarxiv icon