Picture for Pierre-Yves Oudeyer

Pierre-Yves Oudeyer

Inria FLOWERS team Talence France

Automatic Generation of Question Hints for Mathematics Problems using Large Language Models in Educational Technology

Add code
Nov 05, 2024
Viaarxiv icon

Reinforcement Learning for Aligning Large Language Models Agents with Interactive Environments: Quantifying and Mitigating Prompt Overfitting

Add code
Oct 29, 2024
Figure 1 for Reinforcement Learning for Aligning Large Language Models Agents with Interactive Environments: Quantifying and Mitigating Prompt Overfitting
Figure 2 for Reinforcement Learning for Aligning Large Language Models Agents with Interactive Environments: Quantifying and Mitigating Prompt Overfitting
Figure 3 for Reinforcement Learning for Aligning Large Language Models Agents with Interactive Environments: Quantifying and Mitigating Prompt Overfitting
Figure 4 for Reinforcement Learning for Aligning Large Language Models Agents with Interactive Environments: Quantifying and Mitigating Prompt Overfitting
Viaarxiv icon

SAC-GLAM: Improving Online RL for LLM agents with Soft Actor-Critic and Hindsight Relabeling

Add code
Oct 16, 2024
Figure 1 for SAC-GLAM: Improving Online RL for LLM agents with Soft Actor-Critic and Hindsight Relabeling
Figure 2 for SAC-GLAM: Improving Online RL for LLM agents with Soft Actor-Critic and Hindsight Relabeling
Figure 3 for SAC-GLAM: Improving Online RL for LLM agents with Soft Actor-Critic and Hindsight Relabeling
Figure 4 for SAC-GLAM: Improving Online RL for LLM agents with Soft Actor-Critic and Hindsight Relabeling
Viaarxiv icon

Assessing Contamination in Large Language Models: Introducing the LogProber method

Add code
Aug 26, 2024
Viaarxiv icon

Collective Innovation in Groups of Large Language Models

Add code
Jul 07, 2024
Viaarxiv icon

When LLMs Play the Telephone Game: Cumulative Changes and Attractors in Iterated Cultural Transmissions

Add code
Jul 05, 2024
Viaarxiv icon

Inferring the Phylogeny of Large Language Models and Predicting their Performances in Benchmarks

Add code
Apr 06, 2024
Viaarxiv icon

Cultural evolution in populations of Large Language Models

Add code
Mar 13, 2024
Viaarxiv icon

Stick to your Role! Stability of Personal Values Expressed in Large Language Models

Add code
Feb 19, 2024
Viaarxiv icon

Discovering Sensorimotor Agency in Cellular Automata using Diversity Search

Add code
Feb 14, 2024
Figure 1 for Discovering Sensorimotor Agency in Cellular Automata using Diversity Search
Figure 2 for Discovering Sensorimotor Agency in Cellular Automata using Diversity Search
Figure 3 for Discovering Sensorimotor Agency in Cellular Automata using Diversity Search
Figure 4 for Discovering Sensorimotor Agency in Cellular Automata using Diversity Search
Viaarxiv icon