Picture for Mao Zheng

Mao Zheng

PodBench: A Comprehensive Benchmark for Instruction-Aware Audio-Oriented Podcast Script Generation

Add code
Jan 21, 2026
Viaarxiv icon

CodeDelegator: Mitigating Context Pollution via Role Separation in Code-as-Action Agents

Add code
Jan 21, 2026
Viaarxiv icon

HY-MT1.5 Technical Report

Add code
Dec 30, 2025
Viaarxiv icon

Hunyuan-MT Technical Report

Add code
Sep 05, 2025
Viaarxiv icon

Walk Before You Run! Concise LLM Reasoning via Reinforcement Learning

Add code
May 27, 2025
Figure 1 for Walk Before You Run! Concise LLM Reasoning via Reinforcement Learning
Figure 2 for Walk Before You Run! Concise LLM Reasoning via Reinforcement Learning
Figure 3 for Walk Before You Run! Concise LLM Reasoning via Reinforcement Learning
Figure 4 for Walk Before You Run! Concise LLM Reasoning via Reinforcement Learning
Viaarxiv icon

TAT-R1: Terminology-Aware Translation with Reinforcement Learning and Word Alignment

Add code
May 27, 2025
Viaarxiv icon

SSR-Zero: Simple Self-Rewarding Reinforcement Learning for Machine Translation

Add code
May 22, 2025
Viaarxiv icon

Hunyuan-TurboS: Advancing Large Language Models through Mamba-Transformer Synergy and Adaptive Chain-of-Thought

Add code
May 21, 2025
Viaarxiv icon

FastCuRL: Curriculum Reinforcement Learning with Progressive Context Extension for Efficient Training R1-like Reasoning Models

Add code
Mar 21, 2025
Viaarxiv icon

GRP: Goal-Reversed Prompting for Zero-Shot Evaluation with LLMs

Add code
Mar 08, 2025
Viaarxiv icon