Picture for Guillaume Desjardins

Guillaume Desjardins

Imitating Language via Scalable Inverse Reinforcement Learning

Add code
Sep 02, 2024
Figure 1 for Imitating Language via Scalable Inverse Reinforcement Learning
Figure 2 for Imitating Language via Scalable Inverse Reinforcement Learning
Figure 3 for Imitating Language via Scalable Inverse Reinforcement Learning
Figure 4 for Imitating Language via Scalable Inverse Reinforcement Learning
Viaarxiv icon

RecurrentGemma: Moving Past Transformers for Efficient Open Language Models

Add code
Apr 11, 2024
Figure 1 for RecurrentGemma: Moving Past Transformers for Efficient Open Language Models
Figure 2 for RecurrentGemma: Moving Past Transformers for Efficient Open Language Models
Figure 3 for RecurrentGemma: Moving Past Transformers for Efficient Open Language Models
Figure 4 for RecurrentGemma: Moving Past Transformers for Efficient Open Language Models
Viaarxiv icon

Griffin: Mixing Gated Linear Recurrences with Local Attention for Efficient Language Models

Add code
Feb 29, 2024
Viaarxiv icon

Gemini: A Family of Highly Capable Multimodal Models

Add code
Dec 19, 2023
Viaarxiv icon

APART: Diverse Skill Discovery using All Pairs with Ascending Reward and DropouT

Add code
Aug 24, 2023
Viaarxiv icon

Rapid training of deep neural networks without skip connections or normalization layers using Deep Kernel Shaping

Add code
Oct 05, 2021
Viaarxiv icon

Reward is enough for convex MDPs

Add code
Jun 01, 2021
Figure 1 for Reward is enough for convex MDPs
Figure 2 for Reward is enough for convex MDPs
Viaarxiv icon

Behavior Priors for Efficient Reinforcement Learning

Add code
Oct 27, 2020
Figure 1 for Behavior Priors for Efficient Reinforcement Learning
Figure 2 for Behavior Priors for Efficient Reinforcement Learning
Figure 3 for Behavior Priors for Efficient Reinforcement Learning
Figure 4 for Behavior Priors for Efficient Reinforcement Learning
Viaarxiv icon

Importance Weighted Policy Learning and Adaption

Add code
Sep 10, 2020
Figure 1 for Importance Weighted Policy Learning and Adaption
Figure 2 for Importance Weighted Policy Learning and Adaption
Figure 3 for Importance Weighted Policy Learning and Adaption
Figure 4 for Importance Weighted Policy Learning and Adaption
Viaarxiv icon

Information asymmetry in KL-regularized RL

Add code
May 03, 2019
Figure 1 for Information asymmetry in KL-regularized RL
Figure 2 for Information asymmetry in KL-regularized RL
Figure 3 for Information asymmetry in KL-regularized RL
Figure 4 for Information asymmetry in KL-regularized RL
Viaarxiv icon