Picture for Dacheng Li

Dacheng Li

Climbing the Ladder of Reasoning: What LLMs Can-and Still Can't-Solve after SFT?

Add code
Apr 16, 2025
Figure 1 for Climbing the Ladder of Reasoning: What LLMs Can-and Still Can't-Solve after SFT?
Figure 2 for Climbing the Ladder of Reasoning: What LLMs Can-and Still Can't-Solve after SFT?
Figure 3 for Climbing the Ladder of Reasoning: What LLMs Can-and Still Can't-Solve after SFT?
Figure 4 for Climbing the Ladder of Reasoning: What LLMs Can-and Still Can't-Solve after SFT?
Viaarxiv icon

WorldModelBench: Judging Video Generation Models As World Models

Add code
Feb 28, 2025
Figure 1 for WorldModelBench: Judging Video Generation Models As World Models
Figure 2 for WorldModelBench: Judging Video Generation Models As World Models
Figure 3 for WorldModelBench: Judging Video Generation Models As World Models
Figure 4 for WorldModelBench: Judging Video Generation Models As World Models
Viaarxiv icon

S*: Test Time Scaling for Code Generation

Add code
Feb 20, 2025
Figure 1 for S*: Test Time Scaling for Code Generation
Figure 2 for S*: Test Time Scaling for Code Generation
Figure 3 for S*: Test Time Scaling for Code Generation
Figure 4 for S*: Test Time Scaling for Code Generation
Viaarxiv icon

The Danger of Overthinking: Examining the Reasoning-Action Dilemma in Agentic Tasks

Add code
Feb 12, 2025
Figure 1 for The Danger of Overthinking: Examining the Reasoning-Action Dilemma in Agentic Tasks
Figure 2 for The Danger of Overthinking: Examining the Reasoning-Action Dilemma in Agentic Tasks
Figure 3 for The Danger of Overthinking: Examining the Reasoning-Action Dilemma in Agentic Tasks
Figure 4 for The Danger of Overthinking: Examining the Reasoning-Action Dilemma in Agentic Tasks
Viaarxiv icon

LLMs Can Easily Learn to Reason from Demonstrations Structure, not content, is what matters!

Add code
Feb 11, 2025
Viaarxiv icon

Efficient-vDiT: Efficient Video Diffusion Transformers With Attention Tile

Add code
Feb 10, 2025
Figure 1 for Efficient-vDiT: Efficient Video Diffusion Transformers With Attention Tile
Figure 2 for Efficient-vDiT: Efficient Video Diffusion Transformers With Attention Tile
Figure 3 for Efficient-vDiT: Efficient Video Diffusion Transformers With Attention Tile
Figure 4 for Efficient-vDiT: Efficient Video Diffusion Transformers With Attention Tile
Viaarxiv icon

Sparse VideoGen: Accelerating Video Diffusion Transformers with Spatial-Temporal Sparsity

Add code
Feb 03, 2025
Figure 1 for Sparse VideoGen: Accelerating Video Diffusion Transformers with Spatial-Temporal Sparsity
Figure 2 for Sparse VideoGen: Accelerating Video Diffusion Transformers with Spatial-Temporal Sparsity
Figure 3 for Sparse VideoGen: Accelerating Video Diffusion Transformers with Spatial-Temporal Sparsity
Figure 4 for Sparse VideoGen: Accelerating Video Diffusion Transformers with Spatial-Temporal Sparsity
Viaarxiv icon

Locality-aware Fair Scheduling in LLM Serving

Add code
Jan 24, 2025
Figure 1 for Locality-aware Fair Scheduling in LLM Serving
Figure 2 for Locality-aware Fair Scheduling in LLM Serving
Figure 3 for Locality-aware Fair Scheduling in LLM Serving
Figure 4 for Locality-aware Fair Scheduling in LLM Serving
Viaarxiv icon

NVILA: Efficient Frontier Visual Language Models

Add code
Dec 05, 2024
Figure 1 for NVILA: Efficient Frontier Visual Language Models
Figure 2 for NVILA: Efficient Frontier Visual Language Models
Figure 3 for NVILA: Efficient Frontier Visual Language Models
Figure 4 for NVILA: Efficient Frontier Visual Language Models
Viaarxiv icon

VILA-U: a Unified Foundation Model Integrating Visual Understanding and Generation

Add code
Sep 06, 2024
Figure 1 for VILA-U: a Unified Foundation Model Integrating Visual Understanding and Generation
Figure 2 for VILA-U: a Unified Foundation Model Integrating Visual Understanding and Generation
Figure 3 for VILA-U: a Unified Foundation Model Integrating Visual Understanding and Generation
Figure 4 for VILA-U: a Unified Foundation Model Integrating Visual Understanding and Generation
Viaarxiv icon