Picture for John Langford

John Langford

Editors

Learning to Achieve Goals with Belief State Transformers

Add code
Oct 30, 2024
Figure 1 for Learning to Achieve Goals with Belief State Transformers
Figure 2 for Learning to Achieve Goals with Belief State Transformers
Figure 3 for Learning to Achieve Goals with Belief State Transformers
Figure 4 for Learning to Achieve Goals with Belief State Transformers
Viaarxiv icon

EnsemW2S: Can an Ensemble of LLMs be Leveraged to Obtain a Stronger LLM?

Add code
Oct 06, 2024
Viaarxiv icon

Easy2Hard-Bench: Standardized Difficulty Labels for Profiling LLM Performance and Generalization

Add code
Sep 27, 2024
Figure 1 for Easy2Hard-Bench: Standardized Difficulty Labels for Profiling LLM Performance and Generalization
Figure 2 for Easy2Hard-Bench: Standardized Difficulty Labels for Profiling LLM Performance and Generalization
Figure 3 for Easy2Hard-Bench: Standardized Difficulty Labels for Profiling LLM Performance and Generalization
Figure 4 for Easy2Hard-Bench: Standardized Difficulty Labels for Profiling LLM Performance and Generalization
Viaarxiv icon

Towards Principled Representation Learning from Videos for Reinforcement Learning

Add code
Mar 20, 2024
Figure 1 for Towards Principled Representation Learning from Videos for Reinforcement Learning
Figure 2 for Towards Principled Representation Learning from Videos for Reinforcement Learning
Figure 3 for Towards Principled Representation Learning from Videos for Reinforcement Learning
Figure 4 for Towards Principled Representation Learning from Videos for Reinforcement Learning
Viaarxiv icon

Position Paper: Agent AI Towards a Holistic Intelligence

Add code
Feb 28, 2024
Viaarxiv icon

Premier-TACO is a Few-Shot Policy Learner: Pretraining Multitask Representation via Temporal Action-Driven Contrastive Loss

Add code
Feb 13, 2024
Figure 1 for Premier-TACO is a Few-Shot Policy Learner: Pretraining Multitask Representation via Temporal Action-Driven Contrastive Loss
Figure 2 for Premier-TACO is a Few-Shot Policy Learner: Pretraining Multitask Representation via Temporal Action-Driven Contrastive Loss
Figure 3 for Premier-TACO is a Few-Shot Policy Learner: Pretraining Multitask Representation via Temporal Action-Driven Contrastive Loss
Figure 4 for Premier-TACO is a Few-Shot Policy Learner: Pretraining Multitask Representation via Temporal Action-Driven Contrastive Loss
Viaarxiv icon

PcLast: Discovering Plannable Continuous Latent States

Add code
Nov 06, 2023
Viaarxiv icon

Streaming Active Learning with Deep Neural Networks

Add code
Mar 05, 2023
Viaarxiv icon

Towards Data-Driven Offline Simulations for Online Reinforcement Learning

Add code
Nov 14, 2022
Viaarxiv icon

Agent-Controller Representations: Principled Offline RL with Rich Exogenous Information

Add code
Oct 31, 2022
Viaarxiv icon