Picture for Zili Wang

Zili Wang

Blending Supervised and Reinforcement Fine-Tuning with Prefix Sampling

Add code
Jul 02, 2025
Viaarxiv icon

Can Mixture-of-Experts Surpass Dense LLMs Under Strictly Equal Resources?

Add code
Jun 13, 2025
Viaarxiv icon

Farseer: A Refined Scaling Law in Large Language Models

Add code
Jun 12, 2025
Viaarxiv icon

Faster and Better LLMs via Latency-Aware Test-Time Scaling

Add code
May 26, 2025
Viaarxiv icon

Predictable Scale: Part I -- Optimal Hyperparameter Scaling Law in Large Language Model Pretraining

Add code
Mar 06, 2025
Viaarxiv icon

SuperGPQA: Scaling LLM Evaluation across 285 Graduate Disciplines

Add code
Feb 20, 2025
Viaarxiv icon

Sailor2: Sailing in South-East Asia with Inclusive Multilingual LLMs

Add code
Feb 18, 2025
Viaarxiv icon

Multi-matrix Factorization Attention

Add code
Dec 26, 2024
Figure 1 for Multi-matrix Factorization Attention
Figure 2 for Multi-matrix Factorization Attention
Figure 3 for Multi-matrix Factorization Attention
Figure 4 for Multi-matrix Factorization Attention
Viaarxiv icon

Continuous Speculative Decoding for Autoregressive Image Generation

Add code
Nov 18, 2024
Viaarxiv icon

OpenCoder: The Open Cookbook for Top-Tier Code Large Language Models

Add code
Nov 07, 2024
Figure 1 for OpenCoder: The Open Cookbook for Top-Tier Code Large Language Models
Figure 2 for OpenCoder: The Open Cookbook for Top-Tier Code Large Language Models
Figure 3 for OpenCoder: The Open Cookbook for Top-Tier Code Large Language Models
Figure 4 for OpenCoder: The Open Cookbook for Top-Tier Code Large Language Models
Viaarxiv icon