Picture for Zili Wang

Zili Wang

CodeSimpleQA: Scaling Factuality in Code Large Language Models

Add code
Dec 22, 2025
Viaarxiv icon

MVU-Eval: Towards Multi-Video Understanding Evaluation for Multimodal LLMs

Add code
Nov 13, 2025
Viaarxiv icon

Neurosymbolic Feature Extraction for Identifying Forced Labor in Supply Chains

Add code
Jul 09, 2025
Viaarxiv icon

Blending Supervised and Reinforcement Fine-Tuning with Prefix Sampling

Add code
Jul 02, 2025
Viaarxiv icon

Can Mixture-of-Experts Surpass Dense LLMs Under Strictly Equal Resources?

Add code
Jun 13, 2025
Viaarxiv icon

Farseer: A Refined Scaling Law in Large Language Models

Add code
Jun 12, 2025
Figure 1 for Farseer: A Refined Scaling Law in Large Language Models
Figure 2 for Farseer: A Refined Scaling Law in Large Language Models
Figure 3 for Farseer: A Refined Scaling Law in Large Language Models
Figure 4 for Farseer: A Refined Scaling Law in Large Language Models
Viaarxiv icon

Faster and Better LLMs via Latency-Aware Test-Time Scaling

Add code
May 26, 2025
Viaarxiv icon

Predictable Scale: Part I -- Optimal Hyperparameter Scaling Law in Large Language Model Pretraining

Add code
Mar 06, 2025
Viaarxiv icon

SuperGPQA: Scaling LLM Evaluation across 285 Graduate Disciplines

Add code
Feb 20, 2025
Viaarxiv icon

Sailor2: Sailing in South-East Asia with Inclusive Multilingual LLMs

Add code
Feb 18, 2025
Figure 1 for Sailor2: Sailing in South-East Asia with Inclusive Multilingual LLMs
Figure 2 for Sailor2: Sailing in South-East Asia with Inclusive Multilingual LLMs
Figure 3 for Sailor2: Sailing in South-East Asia with Inclusive Multilingual LLMs
Figure 4 for Sailor2: Sailing in South-East Asia with Inclusive Multilingual LLMs
Viaarxiv icon