Picture for Chao Xue

Chao Xue

Parameter Importance is Not Static: Evolving Parameter Isolation for Supervised Fine-Tuning

Add code
Apr 15, 2026
Viaarxiv icon

Why Supervised Fine-Tuning Fails to Learn: A Systematic Study of Incomplete Learning in Large Language Models

Add code
Apr 11, 2026
Viaarxiv icon

Reason Only When Needed: Efficient Generative Reward Modeling via Model-Internal Uncertainty

Add code
Apr 11, 2026
Viaarxiv icon

JoyAI-LLM Flash: Advancing Mid-Scale LLMs with Token Efficiency

Add code
Apr 03, 2026
Viaarxiv icon

Fibration Policy Optimization

Add code
Mar 09, 2026
Viaarxiv icon

Reinforcement Learning Enhanced Multi-hop Reasoning for Temporal Knowledge Question Answering

Add code
Jan 03, 2026
Viaarxiv icon

Efficient Reasoning via Thought-Training and Thought-Free Inference

Add code
Nov 05, 2025
Figure 1 for Efficient Reasoning via Thought-Training and Thought-Free Inference
Figure 2 for Efficient Reasoning via Thought-Training and Thought-Free Inference
Figure 3 for Efficient Reasoning via Thought-Training and Thought-Free Inference
Figure 4 for Efficient Reasoning via Thought-Training and Thought-Free Inference
Viaarxiv icon

MotionFlux: Efficient Text-Guided Motion Generation through Rectified Flow Matching and Preference Alignment

Add code
Aug 27, 2025
Figure 1 for MotionFlux: Efficient Text-Guided Motion Generation through Rectified Flow Matching and Preference Alignment
Figure 2 for MotionFlux: Efficient Text-Guided Motion Generation through Rectified Flow Matching and Preference Alignment
Figure 3 for MotionFlux: Efficient Text-Guided Motion Generation through Rectified Flow Matching and Preference Alignment
Figure 4 for MotionFlux: Efficient Text-Guided Motion Generation through Rectified Flow Matching and Preference Alignment
Viaarxiv icon

ChartMaster: Advancing Chart-to-Code Generation with Real-World Charts and Chart Similarity Reinforcement Learning

Add code
Aug 25, 2025
Figure 1 for ChartMaster: Advancing Chart-to-Code Generation with Real-World Charts and Chart Similarity Reinforcement Learning
Figure 2 for ChartMaster: Advancing Chart-to-Code Generation with Real-World Charts and Chart Similarity Reinforcement Learning
Figure 3 for ChartMaster: Advancing Chart-to-Code Generation with Real-World Charts and Chart Similarity Reinforcement Learning
Figure 4 for ChartMaster: Advancing Chart-to-Code Generation with Real-World Charts and Chart Similarity Reinforcement Learning
Viaarxiv icon

Learning Temporal Abstractions via Variational Homomorphisms in Option-Induced Abstract MDPs

Add code
Jul 24, 2025
Figure 1 for Learning Temporal Abstractions via Variational Homomorphisms in Option-Induced Abstract MDPs
Figure 2 for Learning Temporal Abstractions via Variational Homomorphisms in Option-Induced Abstract MDPs
Figure 3 for Learning Temporal Abstractions via Variational Homomorphisms in Option-Induced Abstract MDPs
Figure 4 for Learning Temporal Abstractions via Variational Homomorphisms in Option-Induced Abstract MDPs
Viaarxiv icon