Picture for Gabriel Synnaeve

Gabriel Synnaeve

Jack

Optimizing Language Models for Inference Time Objectives using Reinforcement Learning

Add code
Mar 25, 2025
Viaarxiv icon

BigO(Bench) -- Can LLMs Generate Code with Controlled Time and Space Complexity?

Add code
Mar 20, 2025
Viaarxiv icon

The KoLMogorov Test: Compression by Code Generation

Add code
Mar 18, 2025
Viaarxiv icon

Soft Policy Optimization: Online Off-Policy RL for Sequence Models

Add code
Mar 07, 2025
Figure 1 for Soft Policy Optimization: Online Off-Policy RL for Sequence Models
Figure 2 for Soft Policy Optimization: Online Off-Policy RL for Sequence Models
Viaarxiv icon

SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution

Add code
Feb 25, 2025
Viaarxiv icon

Don't Transform the Code, Code the Transforms: Towards Precise Code Rewriting using LLMs

Add code
Oct 11, 2024
Figure 1 for Don't Transform the Code, Code the Transforms: Towards Precise Code Rewriting using LLMs
Figure 2 for Don't Transform the Code, Code the Transforms: Towards Precise Code Rewriting using LLMs
Figure 3 for Don't Transform the Code, Code the Transforms: Towards Precise Code Rewriting using LLMs
Figure 4 for Don't Transform the Code, Code the Transforms: Towards Precise Code Rewriting using LLMs
Viaarxiv icon

What Makes Large Language Models Reason in (Multi-Turn) Code Generation?

Add code
Oct 10, 2024
Figure 1 for What Makes Large Language Models Reason in (Multi-Turn) Code Generation?
Figure 2 for What Makes Large Language Models Reason in (Multi-Turn) Code Generation?
Figure 3 for What Makes Large Language Models Reason in (Multi-Turn) Code Generation?
Figure 4 for What Makes Large Language Models Reason in (Multi-Turn) Code Generation?
Viaarxiv icon

SWE-bench Multimodal: Do AI Systems Generalize to Visual Software Domains?

Add code
Oct 04, 2024
Figure 1 for SWE-bench Multimodal: Do AI Systems Generalize to Visual Software Domains?
Figure 2 for SWE-bench Multimodal: Do AI Systems Generalize to Visual Software Domains?
Figure 3 for SWE-bench Multimodal: Do AI Systems Generalize to Visual Software Domains?
Figure 4 for SWE-bench Multimodal: Do AI Systems Generalize to Visual Software Domains?
Viaarxiv icon

RLEF: Grounding Code LLMs in Execution Feedback with Reinforcement Learning

Add code
Oct 02, 2024
Figure 1 for RLEF: Grounding Code LLMs in Execution Feedback with Reinforcement Learning
Figure 2 for RLEF: Grounding Code LLMs in Execution Feedback with Reinforcement Learning
Figure 3 for RLEF: Grounding Code LLMs in Execution Feedback with Reinforcement Learning
Figure 4 for RLEF: Grounding Code LLMs in Execution Feedback with Reinforcement Learning
Viaarxiv icon

The Llama 3 Herd of Models

Add code
Jul 31, 2024
Viaarxiv icon