Picture for Cosmin Paduraru

Cosmin Paduraru

Gemini: A Family of Highly Capable Multimodal Models

Add code
Dec 19, 2023
Viaarxiv icon

Towards practical reinforcement learning for tokamak magnetic control

Add code
Jul 21, 2023
Figure 1 for Towards practical reinforcement learning for tokamak magnetic control
Figure 2 for Towards practical reinforcement learning for tokamak magnetic control
Figure 3 for Towards practical reinforcement learning for tokamak magnetic control
Figure 4 for Towards practical reinforcement learning for tokamak magnetic control
Viaarxiv icon

Optimizing Memory Mapping Using Deep Reinforcement Learning

Add code
May 11, 2023
Viaarxiv icon

Transformers Meet Directed Graphs

Add code
Jan 31, 2023
Viaarxiv icon

Controlling Commercial Cooling Systems Using Reinforcement Learning

Add code
Nov 11, 2022
Viaarxiv icon

Optimizing Industrial HVAC Systems with Hierarchical Reinforcement Learning

Add code
Sep 16, 2022
Figure 1 for Optimizing Industrial HVAC Systems with Hierarchical Reinforcement Learning
Figure 2 for Optimizing Industrial HVAC Systems with Hierarchical Reinforcement Learning
Figure 3 for Optimizing Industrial HVAC Systems with Hierarchical Reinforcement Learning
Figure 4 for Optimizing Industrial HVAC Systems with Hierarchical Reinforcement Learning
Viaarxiv icon

Semi-analytical Industrial Cooling System Model for Reinforcement Learning

Add code
Jul 26, 2022
Figure 1 for Semi-analytical Industrial Cooling System Model for Reinforcement Learning
Figure 2 for Semi-analytical Industrial Cooling System Model for Reinforcement Learning
Figure 3 for Semi-analytical Industrial Cooling System Model for Reinforcement Learning
Figure 4 for Semi-analytical Industrial Cooling System Model for Reinforcement Learning
Viaarxiv icon

COptiDICE: Offline Constrained Reinforcement Learning via Stationary Distribution Correction Estimation

Add code
Apr 19, 2022
Figure 1 for COptiDICE: Offline Constrained Reinforcement Learning via Stationary Distribution Correction Estimation
Figure 2 for COptiDICE: Offline Constrained Reinforcement Learning via Stationary Distribution Correction Estimation
Figure 3 for COptiDICE: Offline Constrained Reinforcement Learning via Stationary Distribution Correction Estimation
Figure 4 for COptiDICE: Offline Constrained Reinforcement Learning via Stationary Distribution Correction Estimation
Viaarxiv icon

Active Offline Policy Selection

Add code
Jun 18, 2021
Figure 1 for Active Offline Policy Selection
Figure 2 for Active Offline Policy Selection
Figure 3 for Active Offline Policy Selection
Figure 4 for Active Offline Policy Selection
Viaarxiv icon

Autoregressive Dynamics Models for Offline Policy Evaluation and Optimization

Add code
Apr 28, 2021
Figure 1 for Autoregressive Dynamics Models for Offline Policy Evaluation and Optimization
Figure 2 for Autoregressive Dynamics Models for Offline Policy Evaluation and Optimization
Figure 3 for Autoregressive Dynamics Models for Offline Policy Evaluation and Optimization
Figure 4 for Autoregressive Dynamics Models for Offline Policy Evaluation and Optimization
Viaarxiv icon