Picture for Wenhu Chen

Wenhu Chen

ScholarCopilot: Training Large Language Models for Academic Writing with Accurate Citations

Add code
Apr 03, 2025
Viaarxiv icon

MoCha: Towards Movie-Grade Talking Character Synthesis

Add code
Mar 30, 2025
Viaarxiv icon

Towards Trustworthy GUI Agents: A Survey

Add code
Mar 30, 2025
Viaarxiv icon

Vamba: Understanding Hour-Long Videos with Hybrid Mamba-Transformers

Add code
Mar 14, 2025
Viaarxiv icon

VisualWebInstruct: Scaling up Multimodal Instruction Data through Web Search

Add code
Mar 13, 2025
Viaarxiv icon

YuE: Scaling Open Foundation Models for Long-Form Music Generation

Add code
Mar 11, 2025
Viaarxiv icon

TheoremExplainAgent: Towards Multimodal Explanations for LLM Theorem Understanding

Add code
Feb 26, 2025
Viaarxiv icon

ACECODER: Acing Coder RL via Automated Test-Case Synthesis

Add code
Feb 03, 2025
Viaarxiv icon

PixelWorld: Towards Perceiving Everything as Pixels

Add code
Jan 31, 2025
Viaarxiv icon

Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate

Add code
Jan 30, 2025
Figure 1 for Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate
Figure 2 for Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate
Figure 3 for Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate
Figure 4 for Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate
Viaarxiv icon