Picture for Xiaowei Lv

Xiaowei Lv

DecisionLLM: Large Language Models for Long Sequence Decision Exploration

Add code
Jan 15, 2026
Viaarxiv icon

360-LLaMA-Factory: Plug & Play Sequence Parallelism for Long Post-Training

Add code
May 28, 2025
Viaarxiv icon

Light-R1: Curriculum SFT, DPO and RL for Long COT from Scratch and Beyond

Add code
Mar 13, 2025
Viaarxiv icon