Picture for Zhixin Zhang

Zhixin Zhang

RACA: Representation-Aware Coverage Criteria for LLM Safety Testing

Add code
Feb 02, 2026
Viaarxiv icon

StackPlanner: A Centralized Hierarchical Multi-Agent System with Task-Experience Memory Management

Add code
Jan 09, 2026
Viaarxiv icon

UniAPO: Unified Multimodal Automated Prompt Optimization

Add code
Aug 25, 2025
Figure 1 for UniAPO: Unified Multimodal Automated Prompt Optimization
Figure 2 for UniAPO: Unified Multimodal Automated Prompt Optimization
Figure 3 for UniAPO: Unified Multimodal Automated Prompt Optimization
Figure 4 for UniAPO: Unified Multimodal Automated Prompt Optimization
Viaarxiv icon

Filter-And-Refine: A MLLM Based Cascade System for Industrial-Scale Video Content Moderation

Add code
Jul 23, 2025
Viaarxiv icon

Mitigating Fine-tuning Risks in LLMs via Safety-Aware Probing Optimization

Add code
May 22, 2025
Viaarxiv icon

Debt Collection Negotiations with Large Language Models: An Evaluation System and Optimizing Decision Making with Multi-Agent

Add code
Feb 25, 2025
Figure 1 for Debt Collection Negotiations with Large Language Models: An Evaluation System and Optimizing Decision Making with Multi-Agent
Figure 2 for Debt Collection Negotiations with Large Language Models: An Evaluation System and Optimizing Decision Making with Multi-Agent
Figure 3 for Debt Collection Negotiations with Large Language Models: An Evaluation System and Optimizing Decision Making with Multi-Agent
Figure 4 for Debt Collection Negotiations with Large Language Models: An Evaluation System and Optimizing Decision Making with Multi-Agent
Viaarxiv icon

Stackelberg Game Preference Optimization for Data-Efficient Alignment of Language Models

Add code
Feb 25, 2025
Viaarxiv icon

Vision Search Assistant: Empower Vision-Language Models as Multimodal Search Engines

Add code
Oct 28, 2024
Figure 1 for Vision Search Assistant: Empower Vision-Language Models as Multimodal Search Engines
Figure 2 for Vision Search Assistant: Empower Vision-Language Models as Multimodal Search Engines
Figure 3 for Vision Search Assistant: Empower Vision-Language Models as Multimodal Search Engines
Figure 4 for Vision Search Assistant: Empower Vision-Language Models as Multimodal Search Engines
Viaarxiv icon

CPFD: Confidence-aware Privileged Feature Distillation for Short Video Classification

Add code
Oct 07, 2024
Figure 1 for CPFD: Confidence-aware Privileged Feature Distillation for Short Video Classification
Figure 2 for CPFD: Confidence-aware Privileged Feature Distillation for Short Video Classification
Figure 3 for CPFD: Confidence-aware Privileged Feature Distillation for Short Video Classification
Figure 4 for CPFD: Confidence-aware Privileged Feature Distillation for Short Video Classification
Viaarxiv icon

InteractiveVideo: User-Centric Controllable Video Generation with Synergistic Multimodal Instructions

Add code
Feb 05, 2024
Viaarxiv icon