Picture for Nevena Lazic

Nevena Lazic

Frontier LLMs Still Struggle with Simple Reasoning Tasks

Add code
Jul 09, 2025
Viaarxiv icon

Achieving Human Level Competitive Robot Table Tennis

Add code
Aug 07, 2024
Figure 1 for Achieving Human Level Competitive Robot Table Tennis
Figure 2 for Achieving Human Level Competitive Robot Table Tennis
Figure 3 for Achieving Human Level Competitive Robot Table Tennis
Figure 4 for Achieving Human Level Competitive Robot Table Tennis
Viaarxiv icon

Robotic Table Tennis: A Case Study into a High Speed Learning System

Add code
Sep 06, 2023
Figure 1 for Robotic Table Tennis: A Case Study into a High Speed Learning System
Figure 2 for Robotic Table Tennis: A Case Study into a High Speed Learning System
Figure 3 for Robotic Table Tennis: A Case Study into a High Speed Learning System
Figure 4 for Robotic Table Tennis: A Case Study into a High Speed Learning System
Viaarxiv icon

Towards practical reinforcement learning for tokamak magnetic control

Add code
Jul 21, 2023
Figure 1 for Towards practical reinforcement learning for tokamak magnetic control
Figure 2 for Towards practical reinforcement learning for tokamak magnetic control
Figure 3 for Towards practical reinforcement learning for tokamak magnetic control
Figure 4 for Towards practical reinforcement learning for tokamak magnetic control
Viaarxiv icon

Sample Efficient Deep Reinforcement Learning via Local Planning

Add code
Jan 29, 2023
Figure 1 for Sample Efficient Deep Reinforcement Learning via Local Planning
Figure 2 for Sample Efficient Deep Reinforcement Learning via Local Planning
Figure 3 for Sample Efficient Deep Reinforcement Learning via Local Planning
Figure 4 for Sample Efficient Deep Reinforcement Learning via Local Planning
Viaarxiv icon

A New Look at Dynamic Regret for Non-Stationary Stochastic Bandits

Add code
Jan 17, 2022
Viaarxiv icon

Improved Regret Bound and Experience Replay in Regularized Policy Iteration

Add code
Feb 25, 2021
Figure 1 for Improved Regret Bound and Experience Replay in Regularized Policy Iteration
Viaarxiv icon

Neural Rate Control for Video Encoding using Imitation Learning

Add code
Dec 09, 2020
Figure 1 for Neural Rate Control for Video Encoding using Imitation Learning
Figure 2 for Neural Rate Control for Video Encoding using Imitation Learning
Figure 3 for Neural Rate Control for Video Encoding using Imitation Learning
Figure 4 for Neural Rate Control for Video Encoding using Imitation Learning
Viaarxiv icon

A maximum-entropy approach to off-policy evaluation in average-reward MDPs

Add code
Jun 17, 2020
Figure 1 for A maximum-entropy approach to off-policy evaluation in average-reward MDPs
Figure 2 for A maximum-entropy approach to off-policy evaluation in average-reward MDPs
Viaarxiv icon

Robotic Table Tennis with Model-Free Reinforcement Learning

Add code
Mar 31, 2020
Figure 1 for Robotic Table Tennis with Model-Free Reinforcement Learning
Figure 2 for Robotic Table Tennis with Model-Free Reinforcement Learning
Figure 3 for Robotic Table Tennis with Model-Free Reinforcement Learning
Figure 4 for Robotic Table Tennis with Model-Free Reinforcement Learning
Viaarxiv icon