Picture for Yu Gu

Yu Gu

Scaling medical imaging report generation with multimodal reinforcement learning

Add code
Jan 23, 2026
Viaarxiv icon

Beyond Hard Writes and Rigid Preservation: Soft Recursive Least-Squares for Lifelong LLM Editing

Add code
Jan 22, 2026
Viaarxiv icon

Teaching LLMs to Learn Tool Trialing and Execution through Environment Interaction

Add code
Jan 19, 2026
Viaarxiv icon

Long-Chain Reasoning Distillation via Adaptive Prefix Alignment

Add code
Jan 15, 2026
Viaarxiv icon

Revealing the Attention Floating Mechanism in Masked Diffusion Models

Add code
Jan 12, 2026
Viaarxiv icon

JoyVoice: Long-Context Conditioning for Anthropomorphic Multi-Speaker Conversational Synthesis

Add code
Dec 22, 2025
Viaarxiv icon

DAIEN-TTS: Disentangled Audio Infilling for Environment-Aware Text-to-Speech Synthesis

Add code
Sep 18, 2025
Figure 1 for DAIEN-TTS: Disentangled Audio Infilling for Environment-Aware Text-to-Speech Synthesis
Figure 2 for DAIEN-TTS: Disentangled Audio Infilling for Environment-Aware Text-to-Speech Synthesis
Figure 3 for DAIEN-TTS: Disentangled Audio Infilling for Environment-Aware Text-to-Speech Synthesis
Viaarxiv icon

AURAD: Anatomy-Pathology Unified Radiology Synthesis with Progressive Representations

Add code
Sep 05, 2025
Figure 1 for AURAD: Anatomy-Pathology Unified Radiology Synthesis with Progressive Representations
Figure 2 for AURAD: Anatomy-Pathology Unified Radiology Synthesis with Progressive Representations
Figure 3 for AURAD: Anatomy-Pathology Unified Radiology Synthesis with Progressive Representations
Figure 4 for AURAD: Anatomy-Pathology Unified Radiology Synthesis with Progressive Representations
Viaarxiv icon

What-If Analysis of Large Language Models: Explore the Game World Using Proactive Thinking

Add code
Sep 05, 2025
Viaarxiv icon

SSG-Dit: A Spatial Signal Guided Framework for Controllable Video Generation

Add code
Aug 23, 2025
Viaarxiv icon