Picture for Hui Xue

Hui Xue

Diffusion Probe: Generated Image Result Prediction Using CNN Probes

Add code
Mar 05, 2026
Viaarxiv icon

SRasP: Self-Reorientation Adversarial Style Perturbation for Cross-Domain Few-Shot Learning

Add code
Mar 05, 2026
Viaarxiv icon

TC-Padé: Trajectory-Consistent Padé Approximation for Diffusion Acceleration

Add code
Mar 03, 2026
Viaarxiv icon

How Controllable Are Large Language Models? A Unified Evaluation across Behavioral Granularities

Add code
Mar 03, 2026
Viaarxiv icon

Why Steering Works: Toward a Unified View of Language Model Parameter Dynamics

Add code
Feb 02, 2026
Viaarxiv icon

A Causal Perspective for Enhancing Jailbreak Attack and Defense

Add code
Jan 31, 2026
Viaarxiv icon

AgentDoG: A Diagnostic Guardrail Framework for AI Agent Safety and Security

Add code
Jan 26, 2026
Viaarxiv icon

YuFeng-XGuard: A Reasoning-Centric, Interpretable, and Flexible Guardrail Model for Large Language Models

Add code
Jan 22, 2026
Viaarxiv icon

A Safety Report on GPT-5.2, Gemini 3 Pro, Qwen3-VL, Grok 4.1 Fast, Nano Banana Pro, and Seedream 4.5

Add code
Jan 16, 2026
Viaarxiv icon

Adaptive Hyperbolic Kernels: Modulated Embedding in de Branges-Rovnyak Spaces

Add code
Nov 13, 2025
Viaarxiv icon