Picture for Hui Xue

Hui Xue

Why Steering Works: Toward a Unified View of Language Model Parameter Dynamics

Add code
Feb 02, 2026
Viaarxiv icon

AgentDoG: A Diagnostic Guardrail Framework for AI Agent Safety and Security

Add code
Jan 26, 2026
Viaarxiv icon

YuFeng-XGuard: A Reasoning-Centric, Interpretable, and Flexible Guardrail Model for Large Language Models

Add code
Jan 22, 2026
Viaarxiv icon

A Safety Report on GPT-5.2, Gemini 3 Pro, Qwen3-VL, Grok 4.1 Fast, Nano Banana Pro, and Seedream 4.5

Add code
Jan 16, 2026
Viaarxiv icon

Adaptive Hyperbolic Kernels: Modulated Embedding in de Branges-Rovnyak Spaces

Add code
Nov 13, 2025
Viaarxiv icon

Generative Video Matting

Add code
Aug 11, 2025
Viaarxiv icon

Dynamic Mixture of Curriculum LoRA Experts for Continual Multimodal Instruction Tuning

Add code
Jun 13, 2025
Viaarxiv icon

Reviving DSP for Advanced Theorem Proving in the Era of Reasoning Models

Add code
Jun 13, 2025
Viaarxiv icon

Score-based Generative Modeling for Conditional Independence Testing

Add code
May 29, 2025
Viaarxiv icon

Towards the Resistance of Neural Network Watermarking to Fine-tuning

Add code
May 02, 2025
Figure 1 for Towards the Resistance of Neural Network Watermarking to Fine-tuning
Figure 2 for Towards the Resistance of Neural Network Watermarking to Fine-tuning
Figure 3 for Towards the Resistance of Neural Network Watermarking to Fine-tuning
Figure 4 for Towards the Resistance of Neural Network Watermarking to Fine-tuning
Viaarxiv icon