Picture for Mingyang Song

Mingyang Song

HoPE: Hyperbolic Rotary Positional Encoding for Stable Long-Range Dependency Modeling in Large Language Models

Add code
Sep 05, 2025
Viaarxiv icon

Hunyuan-MT Technical Report

Add code
Sep 05, 2025
Viaarxiv icon

Spline Deformation Field

Add code
Jul 10, 2025
Viaarxiv icon

Walk Before You Run! Concise LLM Reasoning via Reinforcement Learning

Add code
May 27, 2025
Viaarxiv icon

TAT-R1: Terminology-Aware Translation with Reinforcement Learning and Word Alignment

Add code
May 27, 2025
Viaarxiv icon

SSR-Zero: Simple Self-Rewarding Reinforcement Learning for Machine Translation

Add code
May 22, 2025
Viaarxiv icon

OpenThinkIMG: Learning to Think with Images via Visual Tool Reinforcement Learning

Add code
May 13, 2025
Figure 1 for OpenThinkIMG: Learning to Think with Images via Visual Tool Reinforcement Learning
Figure 2 for OpenThinkIMG: Learning to Think with Images via Visual Tool Reinforcement Learning
Figure 3 for OpenThinkIMG: Learning to Think with Images via Visual Tool Reinforcement Learning
Figure 4 for OpenThinkIMG: Learning to Think with Images via Visual Tool Reinforcement Learning
Viaarxiv icon

FastCuRL: Curriculum Reinforcement Learning with Progressive Context Extension for Efficient Training R1-like Reasoning Models

Add code
Mar 21, 2025
Viaarxiv icon

From Head to Tail: Towards Balanced Representation in Large Vision-Language Models through Adaptive Data Calibration

Add code
Mar 17, 2025
Viaarxiv icon

GRP: Goal-Reversed Prompting for Zero-Shot Evaluation with LLMs

Add code
Mar 08, 2025
Viaarxiv icon