Picture for Hardik Meisheri

Hardik Meisheri

Multi-Agent Learning of Efficient Fulfilment and Routing Strategies in E-Commerce

Add code
Nov 20, 2023
Viaarxiv icon

DCT: Dual Channel Training of Action Embeddings for Reinforcement Learning with Large Discrete Action Spaces

Add code
Jun 28, 2023
Viaarxiv icon

Using Contrastive Samples for Identifying and Leveraging Possible Causal Relationships in Reinforcement Learning

Add code
Oct 28, 2022
Viaarxiv icon

A Learning Based Framework for Handling Uncertain Lead Times in Multi-Product Inventory Management

Add code
Mar 09, 2022
Figure 1 for A Learning Based Framework for Handling Uncertain Lead Times in Multi-Product Inventory Management
Figure 2 for A Learning Based Framework for Handling Uncertain Lead Times in Multi-Product Inventory Management
Figure 3 for A Learning Based Framework for Handling Uncertain Lead Times in Multi-Product Inventory Management
Figure 4 for A Learning Based Framework for Handling Uncertain Lead Times in Multi-Product Inventory Management
Viaarxiv icon

Follow your Nose: Using General Value Functions for Directed Exploration in Reinforcement Learning

Add code
Mar 02, 2022
Figure 1 for Follow your Nose: Using General Value Functions for Directed Exploration in Reinforcement Learning
Figure 2 for Follow your Nose: Using General Value Functions for Directed Exploration in Reinforcement Learning
Figure 3 for Follow your Nose: Using General Value Functions for Directed Exploration in Reinforcement Learning
Figure 4 for Follow your Nose: Using General Value Functions for Directed Exploration in Reinforcement Learning
Viaarxiv icon

Learning to Minimize Cost-to-Serve for Multi-Node Multi-Product Order Fulfilment in Electronic Commerce

Add code
Dec 16, 2021
Figure 1 for Learning to Minimize Cost-to-Serve for Multi-Node Multi-Product Order Fulfilment in Electronic Commerce
Figure 2 for Learning to Minimize Cost-to-Serve for Multi-Node Multi-Product Order Fulfilment in Electronic Commerce
Figure 3 for Learning to Minimize Cost-to-Serve for Multi-Node Multi-Product Order Fulfilment in Electronic Commerce
Figure 4 for Learning to Minimize Cost-to-Serve for Multi-Node Multi-Product Order Fulfilment in Electronic Commerce
Viaarxiv icon

School of hard knocks: Curriculum analysis for Pommerman with a fixed computational budget

Add code
Feb 24, 2021
Figure 1 for School of hard knocks: Curriculum analysis for Pommerman with a fixed computational budget
Figure 2 for School of hard knocks: Curriculum analysis for Pommerman with a fixed computational budget
Figure 3 for School of hard knocks: Curriculum analysis for Pommerman with a fixed computational budget
Figure 4 for School of hard knocks: Curriculum analysis for Pommerman with a fixed computational budget
Viaarxiv icon

Sample Efficient Training in Multi-Agent Adversarial Games with Limited Teammate Communication

Add code
Nov 01, 2020
Figure 1 for Sample Efficient Training in Multi-Agent Adversarial Games with Limited Teammate Communication
Figure 2 for Sample Efficient Training in Multi-Agent Adversarial Games with Limited Teammate Communication
Figure 3 for Sample Efficient Training in Multi-Agent Adversarial Games with Limited Teammate Communication
Figure 4 for Sample Efficient Training in Multi-Agent Adversarial Games with Limited Teammate Communication
Viaarxiv icon

Reinforcement Learning for Multi-Product Multi-Node Inventory Management in Supply Chains

Add code
Jun 07, 2020
Figure 1 for Reinforcement Learning for Multi-Product Multi-Node Inventory Management in Supply Chains
Figure 2 for Reinforcement Learning for Multi-Product Multi-Node Inventory Management in Supply Chains
Figure 3 for Reinforcement Learning for Multi-Product Multi-Node Inventory Management in Supply Chains
Figure 4 for Reinforcement Learning for Multi-Product Multi-Node Inventory Management in Supply Chains
Viaarxiv icon

Accelerating Training in Pommerman with Imitation and Reinforcement Learning

Add code
Nov 13, 2019
Figure 1 for Accelerating Training in Pommerman with Imitation and Reinforcement Learning
Figure 2 for Accelerating Training in Pommerman with Imitation and Reinforcement Learning
Figure 3 for Accelerating Training in Pommerman with Imitation and Reinforcement Learning
Figure 4 for Accelerating Training in Pommerman with Imitation and Reinforcement Learning
Viaarxiv icon