Picture for Linchao Zhu

Linchao Zhu

Mitigating Conversational Inertia in Multi-Turn Agents

Add code
Feb 03, 2026
Viaarxiv icon

GPD: Guided Progressive Distillation for Fast and High-Quality Video Generation

Add code
Feb 02, 2026
Viaarxiv icon

MTC-VAE: Multi-Level Temporal Compression with Content Awareness

Add code
Feb 01, 2026
Viaarxiv icon

Unified Generation and Self-Verification for Vision-Language Models via Advantage Decoupled Preference Optimization

Add code
Jan 04, 2026
Viaarxiv icon

MVP: Multiple View Prediction Improves GUI Grounding

Add code
Dec 09, 2025
Figure 1 for MVP: Multiple View Prediction Improves GUI Grounding
Figure 2 for MVP: Multiple View Prediction Improves GUI Grounding
Figure 3 for MVP: Multiple View Prediction Improves GUI Grounding
Figure 4 for MVP: Multiple View Prediction Improves GUI Grounding
Viaarxiv icon

Code Graph Model (CGM): A Graph-Integrated Large Language Model for Repository-Level Software Engineering Tasks

Add code
May 22, 2025
Figure 1 for Code Graph Model (CGM): A Graph-Integrated Large Language Model for Repository-Level Software Engineering Tasks
Figure 2 for Code Graph Model (CGM): A Graph-Integrated Large Language Model for Repository-Level Software Engineering Tasks
Figure 3 for Code Graph Model (CGM): A Graph-Integrated Large Language Model for Repository-Level Software Engineering Tasks
Figure 4 for Code Graph Model (CGM): A Graph-Integrated Large Language Model for Repository-Level Software Engineering Tasks
Viaarxiv icon

Learning from Reference Answers: Versatile Language Model Alignment without Binary Human Preference Data

Add code
Apr 14, 2025
Figure 1 for Learning from Reference Answers: Versatile Language Model Alignment without Binary Human Preference Data
Figure 2 for Learning from Reference Answers: Versatile Language Model Alignment without Binary Human Preference Data
Figure 3 for Learning from Reference Answers: Versatile Language Model Alignment without Binary Human Preference Data
Figure 4 for Learning from Reference Answers: Versatile Language Model Alignment without Binary Human Preference Data
Viaarxiv icon

From Trial to Triumph: Advancing Long Video Understanding via Visual Context Sample Scaling and Self-reward Alignment

Add code
Mar 26, 2025
Viaarxiv icon

VideoGrain: Modulating Space-Time Attention for Multi-grained Video Editing

Add code
Feb 24, 2025
Figure 1 for VideoGrain: Modulating Space-Time Attention for Multi-grained Video Editing
Figure 2 for VideoGrain: Modulating Space-Time Attention for Multi-grained Video Editing
Figure 3 for VideoGrain: Modulating Space-Time Attention for Multi-grained Video Editing
Figure 4 for VideoGrain: Modulating Space-Time Attention for Multi-grained Video Editing
Viaarxiv icon

Noise-Tolerant Hybrid Prototypical Learning with Noisy Web Data

Add code
Jan 05, 2025
Figure 1 for Noise-Tolerant Hybrid Prototypical Learning with Noisy Web Data
Figure 2 for Noise-Tolerant Hybrid Prototypical Learning with Noisy Web Data
Figure 3 for Noise-Tolerant Hybrid Prototypical Learning with Noisy Web Data
Figure 4 for Noise-Tolerant Hybrid Prototypical Learning with Noisy Web Data
Viaarxiv icon