Picture for Xiao Yang

Xiao Yang

Think-Then-Generate: Reasoning-Aware Text-to-Image Diffusion with LLM Encoders

Add code
Jan 15, 2026
Viaarxiv icon

ReflexDiffusion: Reflection-Enhanced Trajectory Planning for High-lateral-acceleration Scenarios in Autonomous Driving

Add code
Jan 14, 2026
Viaarxiv icon

Focal Guidance: Unlocking Controllability from Semantic-Weak Layers in Video Diffusion Models

Add code
Jan 12, 2026
Viaarxiv icon

AgentHallu: Benchmarking Automated Hallucination Attribution of LLM-based Agents

Add code
Jan 11, 2026
Viaarxiv icon

RoboSafe: Safeguarding Embodied Agents via Executable Safety Logic

Add code
Dec 24, 2025
Viaarxiv icon

SemCovert: Secure and Covert Video Transmission via Deep Semantic-Level Hiding

Add code
Dec 23, 2025
Viaarxiv icon

EHRStruct: A Comprehensive Benchmark Framework for Evaluating Large Language Models on Structured Electronic Health Record Tasks

Add code
Nov 16, 2025
Viaarxiv icon

KG-DF: A Black-box Defense Framework against Jailbreak Attacks Based on Knowledge Graphs

Add code
Nov 09, 2025
Viaarxiv icon

Make It Long, Keep It Fast: End-to-End 10k-Sequence Modeling at Billion Scale on Douyin

Add code
Nov 08, 2025
Figure 1 for Make It Long, Keep It Fast: End-to-End 10k-Sequence Modeling at Billion Scale on Douyin
Figure 2 for Make It Long, Keep It Fast: End-to-End 10k-Sequence Modeling at Billion Scale on Douyin
Figure 3 for Make It Long, Keep It Fast: End-to-End 10k-Sequence Modeling at Billion Scale on Douyin
Figure 4 for Make It Long, Keep It Fast: End-to-End 10k-Sequence Modeling at Billion Scale on Douyin
Viaarxiv icon

CRAG-MM: Multi-modal Multi-turn Comprehensive RAG Benchmark

Add code
Oct 30, 2025
Figure 1 for CRAG-MM: Multi-modal Multi-turn Comprehensive RAG Benchmark
Figure 2 for CRAG-MM: Multi-modal Multi-turn Comprehensive RAG Benchmark
Figure 3 for CRAG-MM: Multi-modal Multi-turn Comprehensive RAG Benchmark
Figure 4 for CRAG-MM: Multi-modal Multi-turn Comprehensive RAG Benchmark
Viaarxiv icon