Picture for Yansong Qu

Yansong Qu

WildSeg3D: Segment Any 3D Objects in the Wild from 2D Images

Add code
Mar 11, 2025
Viaarxiv icon

AnomalyPainter: Vision-Language-Diffusion Synergy for Zero-Shot Realistic and Diverse Industrial Anomaly Synthesis

Add code
Mar 11, 2025
Viaarxiv icon

CurricuVLM: Towards Safe Autonomous Driving via Personalized Safety-Critical Curriculum Learning with Vision-Language Models

Add code
Feb 21, 2025
Viaarxiv icon

Drag Your Gaussian: Effective Drag-Based Editing with Score Distillation for 3D Gaussian Splatting

Add code
Jan 30, 2025
Figure 1 for Drag Your Gaussian: Effective Drag-Based Editing with Score Distillation for 3D Gaussian Splatting
Figure 2 for Drag Your Gaussian: Effective Drag-Based Editing with Score Distillation for 3D Gaussian Splatting
Figure 3 for Drag Your Gaussian: Effective Drag-Based Editing with Score Distillation for 3D Gaussian Splatting
Figure 4 for Drag Your Gaussian: Effective Drag-Based Editing with Score Distillation for 3D Gaussian Splatting
Viaarxiv icon

VLM-RL: A Unified Vision Language Models and Reinforcement Learning Framework for Safe Autonomous Driving

Add code
Dec 20, 2024
Figure 1 for VLM-RL: A Unified Vision Language Models and Reinforcement Learning Framework for Safe Autonomous Driving
Figure 2 for VLM-RL: A Unified Vision Language Models and Reinforcement Learning Framework for Safe Autonomous Driving
Figure 3 for VLM-RL: A Unified Vision Language Models and Reinforcement Learning Framework for Safe Autonomous Driving
Figure 4 for VLM-RL: A Unified Vision Language Models and Reinforcement Learning Framework for Safe Autonomous Driving
Viaarxiv icon

Towards 3D Semantic Scene Completion for Autonomous Driving: A Meta-Learning Framework Empowered by Deformable Large-Kernel Attention and Mamba Model

Add code
Nov 06, 2024
Viaarxiv icon

Director3D: Real-world Camera Trajectory and 3D Scene Generation from Text

Add code
Jun 25, 2024
Figure 1 for Director3D: Real-world Camera Trajectory and 3D Scene Generation from Text
Figure 2 for Director3D: Real-world Camera Trajectory and 3D Scene Generation from Text
Figure 3 for Director3D: Real-world Camera Trajectory and 3D Scene Generation from Text
Figure 4 for Director3D: Real-world Camera Trajectory and 3D Scene Generation from Text
Viaarxiv icon

GOI: Find 3D Gaussians of Interest with an Optimizable Open-vocabulary Semantic-space Hyperplane

Add code
May 27, 2024
Figure 1 for GOI: Find 3D Gaussians of Interest with an Optimizable Open-vocabulary Semantic-space Hyperplane
Figure 2 for GOI: Find 3D Gaussians of Interest with an Optimizable Open-vocabulary Semantic-space Hyperplane
Figure 3 for GOI: Find 3D Gaussians of Interest with an Optimizable Open-vocabulary Semantic-space Hyperplane
Figure 4 for GOI: Find 3D Gaussians of Interest with an Optimizable Open-vocabulary Semantic-space Hyperplane
Viaarxiv icon

Cross-Modality Perturbation Synergy Attack for Person Re-identification

Add code
Jan 19, 2024
Viaarxiv icon