Picture for Xilin Chen

Xilin Chen

Robotic Programmer: Video Instructed Policy Code Generation for Robotic Manipulation

Add code
Jan 08, 2025
Viaarxiv icon

M$^3$oralBench: A MultiModal Moral Benchmark for LVLMs

Add code
Dec 30, 2024
Viaarxiv icon

Multi-P$^2$A: A Multi-perspective Benchmark on Privacy Assessment for Large Vision-Language Models

Add code
Dec 27, 2024
Viaarxiv icon

UniPose: A Unified Multimodal Framework for Human Pose Comprehension, Generation and Editing

Add code
Nov 25, 2024
Viaarxiv icon

Semantic or Covariate? A Study on the Intractable Case of Out-of-Distribution Detection

Add code
Nov 18, 2024
Figure 1 for Semantic or Covariate? A Study on the Intractable Case of Out-of-Distribution Detection
Figure 2 for Semantic or Covariate? A Study on the Intractable Case of Out-of-Distribution Detection
Figure 3 for Semantic or Covariate? A Study on the Intractable Case of Out-of-Distribution Detection
Figure 4 for Semantic or Covariate? A Study on the Intractable Case of Out-of-Distribution Detection
Viaarxiv icon

UMFC: Unsupervised Multi-Domain Feature Calibration for Vision-Language Models

Add code
Nov 11, 2024
Viaarxiv icon

CtrLoRA: An Extensible and Efficient Framework for Controllable Image Generation

Add code
Oct 12, 2024
Viaarxiv icon

HERM: Benchmarking and Enhancing Multimodal LLMs for Human-Centric Understanding

Add code
Oct 09, 2024
Viaarxiv icon

Blocks as Probes: Dissecting Categorization Ability of Large Multimodal Models

Add code
Sep 03, 2024
Figure 1 for Blocks as Probes: Dissecting Categorization Ability of Large Multimodal Models
Figure 2 for Blocks as Probes: Dissecting Categorization Ability of Large Multimodal Models
Figure 3 for Blocks as Probes: Dissecting Categorization Ability of Large Multimodal Models
Figure 4 for Blocks as Probes: Dissecting Categorization Ability of Large Multimodal Models
Viaarxiv icon

T2IShield: Defending Against Backdoors on Text-to-Image Diffusion Models

Add code
Jul 05, 2024
Viaarxiv icon