Picture for Junfeng Ran

Junfeng Ran

TinyR1-32B-Preview: Boosting Accuracy with Branch-Merge Distillation

Add code
Mar 06, 2025
Viaarxiv icon

Chain-of-Thought Matters: Improving Long-Context Language Models with Reasoning Path Supervision

Add code
Feb 28, 2025
Viaarxiv icon

LongAttn: Selecting Long-context Training Data via Token-level Attention

Add code
Feb 24, 2025
Viaarxiv icon