Picture for Kai-Wei Chang

Kai-Wei Chang

Enhancing LLM Character-Level Manipulation via Divide and Conquer

Add code
Feb 12, 2025
Viaarxiv icon

Fact-or-Fair: A Checklist for Behavioral Testing of AI Models on Fairness-Related Queries

Add code
Feb 09, 2025
Viaarxiv icon

QLASS: Boosting Language Agent Inference via Q-Guided Stepwise Search

Add code
Feb 04, 2025
Viaarxiv icon

STIV: Scalable Text and Image Conditioned Video Generation

Add code
Dec 10, 2024
Viaarxiv icon

SafeWorld: Geo-Diverse Safety Alignment

Add code
Dec 09, 2024
Viaarxiv icon

VISCO: Benchmarking Fine-Grained Critique and Correction Towards Self-Improvement in Visual Reasoning

Add code
Dec 03, 2024
Viaarxiv icon

Verbalized Representation Learning for Interpretable Few-Shot Generalization

Add code
Nov 27, 2024
Figure 1 for Verbalized Representation Learning for Interpretable Few-Shot Generalization
Figure 2 for Verbalized Representation Learning for Interpretable Few-Shot Generalization
Figure 3 for Verbalized Representation Learning for Interpretable Few-Shot Generalization
Figure 4 for Verbalized Representation Learning for Interpretable Few-Shot Generalization
Viaarxiv icon

DRS: Deep Question Reformulation With Structured Output

Add code
Nov 27, 2024
Figure 1 for DRS: Deep Question Reformulation With Structured Output
Figure 2 for DRS: Deep Question Reformulation With Structured Output
Figure 3 for DRS: Deep Question Reformulation With Structured Output
Figure 4 for DRS: Deep Question Reformulation With Structured Output
Viaarxiv icon

Exploring Visual Vulnerabilities via Multi-Loss Adversarial Search for Jailbreaking Vision-Language Models

Add code
Nov 27, 2024
Viaarxiv icon

Dynamic-SUPERB Phase-2: A Collaboratively Expanding Benchmark for Measuring the Capabilities of Spoken Language Models with 180 Tasks

Add code
Nov 08, 2024
Figure 1 for Dynamic-SUPERB Phase-2: A Collaboratively Expanding Benchmark for Measuring the Capabilities of Spoken Language Models with 180 Tasks
Figure 2 for Dynamic-SUPERB Phase-2: A Collaboratively Expanding Benchmark for Measuring the Capabilities of Spoken Language Models with 180 Tasks
Figure 3 for Dynamic-SUPERB Phase-2: A Collaboratively Expanding Benchmark for Measuring the Capabilities of Spoken Language Models with 180 Tasks
Figure 4 for Dynamic-SUPERB Phase-2: A Collaboratively Expanding Benchmark for Measuring the Capabilities of Spoken Language Models with 180 Tasks
Viaarxiv icon