Picture for Sijia Luo

Sijia Luo

Sparse-RL: Breaking the Memory Wall in LLM Reinforcement Learning via Stable Sparse Rollouts

Add code
Jan 15, 2026
Viaarxiv icon

CoT-based Synthesizer: Enhancing LLM Performance through Answer Synthesis

Add code
Jan 03, 2025
Figure 1 for CoT-based Synthesizer: Enhancing LLM Performance through Answer Synthesis
Figure 2 for CoT-based Synthesizer: Enhancing LLM Performance through Answer Synthesis
Figure 3 for CoT-based Synthesizer: Enhancing LLM Performance through Answer Synthesis
Figure 4 for CoT-based Synthesizer: Enhancing LLM Performance through Answer Synthesis
Viaarxiv icon

Dynamic Scaling of Unit Tests for Code Reward Modeling

Add code
Jan 02, 2025
Figure 1 for Dynamic Scaling of Unit Tests for Code Reward Modeling
Figure 2 for Dynamic Scaling of Unit Tests for Code Reward Modeling
Figure 3 for Dynamic Scaling of Unit Tests for Code Reward Modeling
Figure 4 for Dynamic Scaling of Unit Tests for Code Reward Modeling
Viaarxiv icon

SpreadsheetBench: Towards Challenging Real World Spreadsheet Manipulation

Add code
Jun 21, 2024
Figure 1 for SpreadsheetBench: Towards Challenging Real World Spreadsheet Manipulation
Figure 2 for SpreadsheetBench: Towards Challenging Real World Spreadsheet Manipulation
Figure 3 for SpreadsheetBench: Towards Challenging Real World Spreadsheet Manipulation
Figure 4 for SpreadsheetBench: Towards Challenging Real World Spreadsheet Manipulation
Viaarxiv icon