Picture for Xilin Chen

Xilin Chen

UniPose: A Unified Multimodal Framework for Human Pose Comprehension, Generation and Editing

Add code
Nov 25, 2024
Viaarxiv icon

Semantic or Covariate? A Study on the Intractable Case of Out-of-Distribution Detection

Add code
Nov 18, 2024
Figure 1 for Semantic or Covariate? A Study on the Intractable Case of Out-of-Distribution Detection
Figure 2 for Semantic or Covariate? A Study on the Intractable Case of Out-of-Distribution Detection
Figure 3 for Semantic or Covariate? A Study on the Intractable Case of Out-of-Distribution Detection
Figure 4 for Semantic or Covariate? A Study on the Intractable Case of Out-of-Distribution Detection
Viaarxiv icon

UMFC: Unsupervised Multi-Domain Feature Calibration for Vision-Language Models

Add code
Nov 11, 2024
Viaarxiv icon

CtrLoRA: An Extensible and Efficient Framework for Controllable Image Generation

Add code
Oct 12, 2024
Viaarxiv icon

HERM: Benchmarking and Enhancing Multimodal LLMs for Human-Centric Understanding

Add code
Oct 09, 2024
Viaarxiv icon

Blocks as Probes: Dissecting Categorization Ability of Large Multimodal Models

Add code
Sep 03, 2024
Figure 1 for Blocks as Probes: Dissecting Categorization Ability of Large Multimodal Models
Figure 2 for Blocks as Probes: Dissecting Categorization Ability of Large Multimodal Models
Figure 3 for Blocks as Probes: Dissecting Categorization Ability of Large Multimodal Models
Figure 4 for Blocks as Probes: Dissecting Categorization Ability of Large Multimodal Models
Viaarxiv icon

T2IShield: Defending Against Backdoors on Text-to-Image Diffusion Models

Add code
Jul 05, 2024
Viaarxiv icon

Dysca: A Dynamic and Scalable Benchmark for Evaluating Perception Ability of LVLMs

Add code
Jun 27, 2024
Viaarxiv icon

Evaluating the Quality of Hallucination Benchmarks for Large Vision-Language Models

Add code
Jun 24, 2024
Viaarxiv icon

VLBiasBench: A Comprehensive Benchmark for Evaluating Bias in Large Vision-Language Model

Add code
Jun 20, 2024
Viaarxiv icon