Picture for Aske Plaat

Aske Plaat

A Unified Framework for Zero-Shot Reinforcement Learning

Add code
Oct 23, 2025
Viaarxiv icon

Analysis of Bluffing by DQN and CFR in Leduc Hold'em Poker

Add code
Sep 04, 2025
Figure 1 for Analysis of Bluffing by DQN and CFR in Leduc Hold'em Poker
Figure 2 for Analysis of Bluffing by DQN and CFR in Leduc Hold'em Poker
Figure 3 for Analysis of Bluffing by DQN and CFR in Leduc Hold'em Poker
Figure 4 for Analysis of Bluffing by DQN and CFR in Leduc Hold'em Poker
Viaarxiv icon

Pluri-perspectivism in Human-robot Co-creativity with Older Adults

Add code
Jul 10, 2025
Figure 1 for Pluri-perspectivism in Human-robot Co-creativity with Older Adults
Figure 2 for Pluri-perspectivism in Human-robot Co-creativity with Older Adults
Figure 3 for Pluri-perspectivism in Human-robot Co-creativity with Older Adults
Figure 4 for Pluri-perspectivism in Human-robot Co-creativity with Older Adults
Viaarxiv icon

Chargax: A JAX Accelerated EV Charging Simulator

Add code
Jul 02, 2025
Viaarxiv icon

Baba is LLM: Reasoning in a Game with Dynamic Rules

Add code
Jun 23, 2025
Viaarxiv icon

Towards a Deeper Understanding of Reasoning Capabilities in Large Language Models

Add code
May 15, 2025
Figure 1 for Towards a Deeper Understanding of Reasoning Capabilities in Large Language Models
Figure 2 for Towards a Deeper Understanding of Reasoning Capabilities in Large Language Models
Figure 3 for Towards a Deeper Understanding of Reasoning Capabilities in Large Language Models
Figure 4 for Towards a Deeper Understanding of Reasoning Capabilities in Large Language Models
Viaarxiv icon

Agentic Large Language Models, a survey

Add code
Mar 29, 2025
Figure 1 for Agentic Large Language Models, a survey
Figure 2 for Agentic Large Language Models, a survey
Figure 3 for Agentic Large Language Models, a survey
Figure 4 for Agentic Large Language Models, a survey
Viaarxiv icon

ACTIVA: Amortized Causal Effect Estimation without Graphs via Transformer-based Variational Autoencoder

Add code
Mar 03, 2025
Figure 1 for ACTIVA: Amortized Causal Effect Estimation without Graphs via Transformer-based Variational Autoencoder
Figure 2 for ACTIVA: Amortized Causal Effect Estimation without Graphs via Transformer-based Variational Autoencoder
Figure 3 for ACTIVA: Amortized Causal Effect Estimation without Graphs via Transformer-based Variational Autoencoder
Figure 4 for ACTIVA: Amortized Causal Effect Estimation without Graphs via Transformer-based Variational Autoencoder
Viaarxiv icon

EconoJax: A Fast & Scalable Economic Simulation in Jax

Add code
Oct 29, 2024
Figure 1 for EconoJax: A Fast & Scalable Economic Simulation in Jax
Figure 2 for EconoJax: A Fast & Scalable Economic Simulation in Jax
Figure 3 for EconoJax: A Fast & Scalable Economic Simulation in Jax
Figure 4 for EconoJax: A Fast & Scalable Economic Simulation in Jax
Viaarxiv icon

World Models Increase Autonomy in Reinforcement Learning

Add code
Aug 20, 2024
Viaarxiv icon