Picture for Mohammadi Zaki

Mohammadi Zaki

Isometric Neural Machine Translation using Phoneme Count Ratio Reward-based Reinforcement Learning

Add code
Mar 20, 2024
Viaarxiv icon

Actor-Critic based Improper Reinforcement Learning

Add code
Jul 19, 2022
Figure 1 for Actor-Critic based Improper Reinforcement Learning
Figure 2 for Actor-Critic based Improper Reinforcement Learning
Figure 3 for Actor-Critic based Improper Reinforcement Learning
Figure 4 for Actor-Critic based Improper Reinforcement Learning
Viaarxiv icon

Improper Learning with Gradient-based Policy Optimization

Add code
Feb 21, 2021
Figure 1 for Improper Learning with Gradient-based Policy Optimization
Figure 2 for Improper Learning with Gradient-based Policy Optimization
Figure 3 for Improper Learning with Gradient-based Policy Optimization
Figure 4 for Improper Learning with Gradient-based Policy Optimization
Viaarxiv icon

Explicit Best Arm Identification in Linear Bandits Using No-Regret Learners

Add code
Jun 13, 2020
Figure 1 for Explicit Best Arm Identification in Linear Bandits Using No-Regret Learners
Figure 2 for Explicit Best Arm Identification in Linear Bandits Using No-Regret Learners
Figure 3 for Explicit Best Arm Identification in Linear Bandits Using No-Regret Learners
Viaarxiv icon

Towards Optimal and Efficient Best Arm Identification in Linear Bandits

Add code
Nov 07, 2019
Figure 1 for Towards Optimal and Efficient Best Arm Identification in Linear Bandits
Figure 2 for Towards Optimal and Efficient Best Arm Identification in Linear Bandits
Figure 3 for Towards Optimal and Efficient Best Arm Identification in Linear Bandits
Viaarxiv icon

Low-rank Bandits with Latent Mixtures

Add code
Sep 06, 2016
Figure 1 for Low-rank Bandits with Latent Mixtures
Figure 2 for Low-rank Bandits with Latent Mixtures
Viaarxiv icon