Picture for Jianfeng Gao

Jianfeng Gao

EJ

Dyna-Mind: Learning to Simulate from Experience for Better AI Agents

Add code
Oct 10, 2025
Viaarxiv icon

FlowRL: Matching Reward Distributions for LLM Reasoning

Add code
Sep 18, 2025
Viaarxiv icon

SAS: Simulated Attention Score

Add code
Jul 10, 2025
Viaarxiv icon

Decoder-Hybrid-Decoder Architecture for Efficient Reasoning with Long Generation

Add code
Jul 09, 2025
Figure 1 for Decoder-Hybrid-Decoder Architecture for Efficient Reasoning with Long Generation
Figure 2 for Decoder-Hybrid-Decoder Architecture for Efficient Reasoning with Long Generation
Figure 3 for Decoder-Hybrid-Decoder Architecture for Efficient Reasoning with Long Generation
Figure 4 for Decoder-Hybrid-Decoder Architecture for Efficient Reasoning with Long Generation
Viaarxiv icon

Training Language Models to Generate Quality Code with Program Analysis Feedback

Add code
May 28, 2025
Viaarxiv icon

Text Generation Beyond Discrete Token Sampling

Add code
May 20, 2025
Viaarxiv icon

EfficientLLM: Efficiency in Large Language Models

Add code
May 20, 2025
Viaarxiv icon

SITE: towards Spatial Intelligence Thorough Evaluation

Add code
May 08, 2025
Viaarxiv icon

Phi-4-Mini-Reasoning: Exploring the Limits of Small Reasoning Language Models in Math

Add code
Apr 30, 2025
Viaarxiv icon

Reinforcement Learning for Reasoning in Large Language Models with One Training Example

Add code
Apr 29, 2025
Viaarxiv icon