Picture for Dibya Ghosh

Dibya Ghosh

What Do Learning Dynamics Reveal About Generalization in LLM Reasoning?

Add code
Nov 12, 2024
Figure 1 for What Do Learning Dynamics Reveal About Generalization in LLM Reasoning?
Figure 2 for What Do Learning Dynamics Reveal About Generalization in LLM Reasoning?
Figure 3 for What Do Learning Dynamics Reveal About Generalization in LLM Reasoning?
Figure 4 for What Do Learning Dynamics Reveal About Generalization in LLM Reasoning?
Viaarxiv icon

Octo: An Open-Source Generalist Robot Policy

Add code
May 20, 2024
Viaarxiv icon

Accelerating Exploration with Unlabeled Prior Data

Add code
Nov 21, 2023
Viaarxiv icon

Robotic Offline RL from Internet Videos via Value-Function Pre-Training

Add code
Sep 22, 2023
Viaarxiv icon

HIQL: Offline Goal-Conditioned RL with Latent States as Actions

Add code
Jul 22, 2023
Viaarxiv icon

Reinforcement Learning from Passive Data via Latent Intentions

Add code
Apr 10, 2023
Viaarxiv icon

Distributionally Adaptive Meta Reinforcement Learning

Add code
Oct 06, 2022
Figure 1 for Distributionally Adaptive Meta Reinforcement Learning
Figure 2 for Distributionally Adaptive Meta Reinforcement Learning
Figure 3 for Distributionally Adaptive Meta Reinforcement Learning
Figure 4 for Distributionally Adaptive Meta Reinforcement Learning
Viaarxiv icon

Offline RL Policies Should be Trained to be Adaptive

Add code
Jul 05, 2022
Figure 1 for Offline RL Policies Should be Trained to be Adaptive
Figure 2 for Offline RL Policies Should be Trained to be Adaptive
Figure 3 for Offline RL Policies Should be Trained to be Adaptive
Figure 4 for Offline RL Policies Should be Trained to be Adaptive
Viaarxiv icon

Why Generalization in RL is Difficult: Epistemic POMDPs and Implicit Partial Observability

Add code
Jul 13, 2021
Figure 1 for Why Generalization in RL is Difficult: Epistemic POMDPs and Implicit Partial Observability
Figure 2 for Why Generalization in RL is Difficult: Epistemic POMDPs and Implicit Partial Observability
Figure 3 for Why Generalization in RL is Difficult: Epistemic POMDPs and Implicit Partial Observability
Figure 4 for Why Generalization in RL is Difficult: Epistemic POMDPs and Implicit Partial Observability
Viaarxiv icon

Implicit Under-Parameterization Inhibits Data-Efficient Deep Reinforcement Learning

Add code
Oct 27, 2020
Figure 1 for Implicit Under-Parameterization Inhibits Data-Efficient Deep Reinforcement Learning
Figure 2 for Implicit Under-Parameterization Inhibits Data-Efficient Deep Reinforcement Learning
Figure 3 for Implicit Under-Parameterization Inhibits Data-Efficient Deep Reinforcement Learning
Figure 4 for Implicit Under-Parameterization Inhibits Data-Efficient Deep Reinforcement Learning
Viaarxiv icon