Picture for Yu Gu

Yu Gu

DAIEN-TTS: Disentangled Audio Infilling for Environment-Aware Text-to-Speech Synthesis

Add code
Sep 18, 2025
Viaarxiv icon

What-If Analysis of Large Language Models: Explore the Game World Using Proactive Thinking

Add code
Sep 05, 2025
Viaarxiv icon

AURAD: Anatomy-Pathology Unified Radiology Synthesis with Progressive Representations

Add code
Sep 05, 2025
Viaarxiv icon

SSG-Dit: A Spatial Signal Guided Framework for Controllable Video Generation

Add code
Aug 23, 2025
Viaarxiv icon

Mind2Web 2: Evaluating Agentic Search with Agent-as-a-Judge

Add code
Jun 26, 2025
Figure 1 for Mind2Web 2: Evaluating Agentic Search with Agent-as-a-Judge
Figure 2 for Mind2Web 2: Evaluating Agentic Search with Agent-as-a-Judge
Figure 3 for Mind2Web 2: Evaluating Agentic Search with Agent-as-a-Judge
Figure 4 for Mind2Web 2: Evaluating Agentic Search with Agent-as-a-Judge
Viaarxiv icon

EULER: Enhancing the Reasoning Ability of Large Language Models through Error-Induced Learning

Add code
May 28, 2025
Viaarxiv icon

Learning to Route Queries Across Knowledge Bases for Step-wise Retrieval-Augmented Reasoning

Add code
May 28, 2025
Viaarxiv icon

ConsRec: Denoising Sequential Recommendation through User-Consistent Preference Modeling

Add code
May 28, 2025
Viaarxiv icon

Mitigating Audiovisual Mismatch in Visual-Guide Audio Captioning

Add code
May 28, 2025
Viaarxiv icon

X-Reasoner: Towards Generalizable Reasoning Across Modalities and Domains

Add code
May 06, 2025
Viaarxiv icon