Picture for Shouren Wang

Shouren Wang

When Domains Interact: Asymmetric and Order-Sensitive Cross-Domain Effects in Reinforcement Learning for Reasoning

Add code
Feb 01, 2026
Viaarxiv icon

Mid-Think: Training-Free Intermediate-Budget Reasoning via Token-Level Triggers

Add code
Jan 11, 2026
Viaarxiv icon

Demystifying Hybrid Thinking: Can LLMs Truly Switch Between Think and No-Think?

Add code
Oct 14, 2025
Figure 1 for Demystifying Hybrid Thinking: Can LLMs Truly Switch Between Think and No-Think?
Figure 2 for Demystifying Hybrid Thinking: Can LLMs Truly Switch Between Think and No-Think?
Figure 3 for Demystifying Hybrid Thinking: Can LLMs Truly Switch Between Think and No-Think?
Figure 4 for Demystifying Hybrid Thinking: Can LLMs Truly Switch Between Think and No-Think?
Viaarxiv icon

Enhancing Player Enjoyment with a Two-Tier DRL and LLM-Based Agent System for Fighting Games

Add code
Apr 10, 2025
Viaarxiv icon