Picture for Martha White

Martha White

Deep Policy Gradient Methods Without Batch Updates, Target Networks, or Replay Buffers

Add code
Nov 22, 2024
Viaarxiv icon

Real-Time Recurrent Learning using Trace Units in Reinforcement Learning

Add code
Sep 02, 2024
Viaarxiv icon

q-exponential family for policy optimization

Add code
Aug 14, 2024
Viaarxiv icon

The Cross-environment Hyperparameter Setting Benchmark for Reinforcement Learning

Add code
Jul 26, 2024
Viaarxiv icon

Investigating the Interplay of Prioritized Replay and Generalization

Add code
Jul 12, 2024
Viaarxiv icon

Position: Benchmarking is Limited in Reinforcement Learning Research

Add code
Jun 23, 2024
Viaarxiv icon

Demystifying the Recency Heuristic in Temporal-Difference Learning

Add code
Jun 18, 2024
Figure 1 for Demystifying the Recency Heuristic in Temporal-Difference Learning
Figure 2 for Demystifying the Recency Heuristic in Temporal-Difference Learning
Figure 3 for Demystifying the Recency Heuristic in Temporal-Difference Learning
Figure 4 for Demystifying the Recency Heuristic in Temporal-Difference Learning
Viaarxiv icon

A New View on Planning in Online Reinforcement Learning

Add code
Jun 03, 2024
Viaarxiv icon

Tuning for the Unknown: Revisiting Evaluation Strategies for Lifelong RL

Add code
Apr 02, 2024
Viaarxiv icon

Investigating the Histogram Loss in Regression

Add code
Feb 20, 2024
Viaarxiv icon