Picture for Dhruv Madeka

Dhruv Madeka

Speculative Knowledge Distillation: Bridging the Teacher-Student Gap Through Interleaved Sampling

Add code
Oct 15, 2024
Figure 1 for Speculative Knowledge Distillation: Bridging the Teacher-Student Gap Through Interleaved Sampling
Figure 2 for Speculative Knowledge Distillation: Bridging the Teacher-Student Gap Through Interleaved Sampling
Figure 3 for Speculative Knowledge Distillation: Bridging the Teacher-Student Gap Through Interleaved Sampling
Figure 4 for Speculative Knowledge Distillation: Bridging the Teacher-Student Gap Through Interleaved Sampling
Viaarxiv icon

A Study on the Calibration of In-context Learning

Add code
Dec 11, 2023
Viaarxiv icon

Learning an Inventory Control Policy with General Inventory Arrival Dynamics

Add code
Oct 26, 2023
Viaarxiv icon

Contextual Bandits for Evaluating and Improving Inventory Control Policies

Add code
Oct 24, 2023
Figure 1 for Contextual Bandits for Evaluating and Improving Inventory Control Policies
Viaarxiv icon

Scaling Laws for Imitation Learning in NetHack

Add code
Jul 18, 2023
Viaarxiv icon

Linear Reinforcement Learning with Ball Structure Action Space

Add code
Nov 14, 2022
Viaarxiv icon

Deep Inventory Management

Add code
Oct 06, 2022
Figure 1 for Deep Inventory Management
Figure 2 for Deep Inventory Management
Figure 3 for Deep Inventory Management
Figure 4 for Deep Inventory Management
Viaarxiv icon

MQRetNN: Multi-Horizon Time Series Forecasting with Retrieval Augmentation

Add code
Jul 21, 2022
Figure 1 for MQRetNN: Multi-Horizon Time Series Forecasting with Retrieval Augmentation
Figure 2 for MQRetNN: Multi-Horizon Time Series Forecasting with Retrieval Augmentation
Figure 3 for MQRetNN: Multi-Horizon Time Series Forecasting with Retrieval Augmentation
Figure 4 for MQRetNN: Multi-Horizon Time Series Forecasting with Retrieval Augmentation
Viaarxiv icon

A Few Expert Queries Suffices for Sample-Efficient RL with Resets and Linear Value Approximation

Add code
Jul 18, 2022
Figure 1 for A Few Expert Queries Suffices for Sample-Efficient RL with Resets and Linear Value Approximation
Viaarxiv icon

Assessment of Treatment Effect Estimators for Heavy-Tailed Data

Add code
Dec 19, 2021
Figure 1 for Assessment of Treatment Effect Estimators for Heavy-Tailed Data
Figure 2 for Assessment of Treatment Effect Estimators for Heavy-Tailed Data
Figure 3 for Assessment of Treatment Effect Estimators for Heavy-Tailed Data
Figure 4 for Assessment of Treatment Effect Estimators for Heavy-Tailed Data
Viaarxiv icon