Picture for Shaohan Huang

Shaohan Huang

BitNet Text Embeddings

Add code
Jun 24, 2026
Viaarxiv icon

Group-Graph Policy Optimization for Long-Horizon Agentic Reinforcement Learning

Add code
Jun 22, 2026
Viaarxiv icon

Mind the Gap: Can Frontier LLMs Pass a Standardized Office Proficiency Exam?

Add code
Jun 09, 2026
Viaarxiv icon

SPPO: Sequence-Level PPO for Long-Horizon Reasoning Tasks

Add code
Apr 10, 2026
Viaarxiv icon

Universal YOCO for Efficient Depth Scaling

Add code
Apr 01, 2026
Viaarxiv icon

Online Experiential Learning for Language Models

Add code
Mar 17, 2026
Viaarxiv icon

Scaling Data Difficulty: Improving Coding Models via Reinforcement Learning on Fresh and Challenging Problems

Add code
Mar 08, 2026
Viaarxiv icon

Breaking Training Bottlenecks: Effective and Stable Reinforcement Learning for Coding Models

Add code
Mar 08, 2026
Viaarxiv icon

Sparse-BitNet: 1.58-bit LLMs are Naturally Friendly to Semi-Structured Sparsity

Add code
Mar 05, 2026
Viaarxiv icon

SlideSparse: Fast and Flexible (2N-2):2N Structured Sparsity

Add code
Mar 05, 2026
Viaarxiv icon