Picture for Hanting Chen

Hanting Chen

and Other Contributors

Top 10 Open Challenges Steering the Future of Diffusion Language Model and Its Variants

Add code
Jan 20, 2026
Viaarxiv icon

Deferred Commitment Decoding for Diffusion Language Models with Confidence-Aware Sliding Windows

Add code
Jan 05, 2026
Viaarxiv icon

MoRAgent: Parameter Efficient Agent Tuning with Mixture-of-Roles

Add code
Dec 25, 2025
Viaarxiv icon

Towards Efficient Agents: A Co-Design of Inference Architecture and System

Add code
Dec 20, 2025
Viaarxiv icon

Step by Step Network

Add code
Nov 18, 2025
Viaarxiv icon

Revealing the Power of Post-Training for Small Language Models via Knowledge Distillation

Add code
Sep 30, 2025
Viaarxiv icon

Rethinking 1-bit Optimization Leveraging Pre-trained Large Language Models

Add code
Aug 09, 2025
Figure 1 for Rethinking 1-bit Optimization Leveraging Pre-trained Large Language Models
Figure 2 for Rethinking 1-bit Optimization Leveraging Pre-trained Large Language Models
Figure 3 for Rethinking 1-bit Optimization Leveraging Pre-trained Large Language Models
Figure 4 for Rethinking 1-bit Optimization Leveraging Pre-trained Large Language Models
Viaarxiv icon

Pangu DeepDiver: Adaptive Search Intensity Scaling via Open-Web Reinforcement Learning

Add code
May 30, 2025
Viaarxiv icon

Pangu Embedded: An Efficient Dual-system LLM Reasoner with Metacognition

Add code
May 29, 2025
Figure 1 for Pangu Embedded: An Efficient Dual-system LLM Reasoner with Metacognition
Figure 2 for Pangu Embedded: An Efficient Dual-system LLM Reasoner with Metacognition
Figure 3 for Pangu Embedded: An Efficient Dual-system LLM Reasoner with Metacognition
Figure 4 for Pangu Embedded: An Efficient Dual-system LLM Reasoner with Metacognition
Viaarxiv icon

Pangu Pro MoE: Mixture of Grouped Experts for Efficient Sparsity

Add code
May 28, 2025
Viaarxiv icon