Picture for Albert Thomas

Albert Thomas

LTCI

Large Language Models Orchestrating Structured Reasoning Achieve Kaggle Grandmaster Level

Add code
Nov 05, 2024
Figure 1 for Large Language Models Orchestrating Structured Reasoning Achieve Kaggle Grandmaster Level
Figure 2 for Large Language Models Orchestrating Structured Reasoning Achieve Kaggle Grandmaster Level
Figure 3 for Large Language Models Orchestrating Structured Reasoning Achieve Kaggle Grandmaster Level
Figure 4 for Large Language Models Orchestrating Structured Reasoning Achieve Kaggle Grandmaster Level
Viaarxiv icon

Zero-shot Model-based Reinforcement Learning using Large Language Models

Add code
Oct 15, 2024
Viaarxiv icon

Differentially Private Model-Based Offline Reinforcement Learning

Add code
Feb 08, 2024
Viaarxiv icon

A Multi-step Loss Function for Robust Learning of the Dynamics in Model-based Reinforcement Learning

Add code
Feb 05, 2024
Viaarxiv icon

Deep autoregressive density nets vs neural ensembles for model-based offline reinforcement learning

Add code
Feb 05, 2024
Figure 1 for Deep autoregressive density nets vs neural ensembles for model-based offline reinforcement learning
Figure 2 for Deep autoregressive density nets vs neural ensembles for model-based offline reinforcement learning
Figure 3 for Deep autoregressive density nets vs neural ensembles for model-based offline reinforcement learning
Figure 4 for Deep autoregressive density nets vs neural ensembles for model-based offline reinforcement learning
Viaarxiv icon

Multi-timestep models for Model-based Reinforcement Learning

Add code
Oct 11, 2023
Viaarxiv icon

Guided Safe Shooting: model based reinforcement learning with safety constraints

Add code
Jun 20, 2022
Figure 1 for Guided Safe Shooting: model based reinforcement learning with safety constraints
Figure 2 for Guided Safe Shooting: model based reinforcement learning with safety constraints
Figure 3 for Guided Safe Shooting: model based reinforcement learning with safety constraints
Figure 4 for Guided Safe Shooting: model based reinforcement learning with safety constraints
Viaarxiv icon

An $α$-No-Regret Algorithm For Graphical Bilinear Bandits

Add code
Jun 01, 2022
Figure 1 for An $α$-No-Regret Algorithm For Graphical Bilinear Bandits
Figure 2 for An $α$-No-Regret Algorithm For Graphical Bilinear Bandits
Figure 3 for An $α$-No-Regret Algorithm For Graphical Bilinear Bandits
Viaarxiv icon

Model-based micro-data reinforcement learning: what are the crucial model properties and which model to choose?

Add code
Jul 24, 2021
Figure 1 for Model-based micro-data reinforcement learning: what are the crucial model properties and which model to choose?
Figure 2 for Model-based micro-data reinforcement learning: what are the crucial model properties and which model to choose?
Figure 3 for Model-based micro-data reinforcement learning: what are the crucial model properties and which model to choose?
Figure 4 for Model-based micro-data reinforcement learning: what are the crucial model properties and which model to choose?
Viaarxiv icon

Refined bounds for randomized experimental design

Add code
Dec 22, 2020
Viaarxiv icon