Picture for Anton Dereventsov

Anton Dereventsov

Data-Centric Approach to Constrained Machine Learning: A Case Study on Conway's Game of Life

Add code
Aug 23, 2024
Viaarxiv icon

An Empirical Categorization of Prompting Techniques for Large Language Models: A Practitioner's Guide

Add code
Feb 18, 2024
Viaarxiv icon

Increasing Entropy to Boost Policy Gradient Performance on Personalization Tasks

Add code
Oct 09, 2023
Viaarxiv icon

Zero-Shot Recommendations with Pre-Trained Large Language Models for Multimodal Nudging

Add code
Sep 02, 2023
Figure 1 for Zero-Shot Recommendations with Pre-Trained Large Language Models for Multimodal Nudging
Figure 2 for Zero-Shot Recommendations with Pre-Trained Large Language Models for Multimodal Nudging
Figure 3 for Zero-Shot Recommendations with Pre-Trained Large Language Models for Multimodal Nudging
Figure 4 for Zero-Shot Recommendations with Pre-Trained Large Language Models for Multimodal Nudging
Viaarxiv icon

Modeling Non-deterministic Human Behaviors in Discrete Food Choices

Add code
Jan 23, 2023
Viaarxiv icon

Examining Policy Entropy of Reinforcement Learning Agents for Personalization Tasks

Add code
Nov 21, 2022
Viaarxiv icon

Simulated Contextual Bandits for Personalization Tasks from Recommendation Datasets

Add code
Oct 12, 2022
Figure 1 for Simulated Contextual Bandits for Personalization Tasks from Recommendation Datasets
Figure 2 for Simulated Contextual Bandits for Personalization Tasks from Recommendation Datasets
Figure 3 for Simulated Contextual Bandits for Personalization Tasks from Recommendation Datasets
Figure 4 for Simulated Contextual Bandits for Personalization Tasks from Recommendation Datasets
Viaarxiv icon

On the Unreasonable Efficiency of State Space Clustering in Personalization Tasks

Add code
Dec 24, 2021
Figure 1 for On the Unreasonable Efficiency of State Space Clustering in Personalization Tasks
Figure 2 for On the Unreasonable Efficiency of State Space Clustering in Personalization Tasks
Figure 3 for On the Unreasonable Efficiency of State Space Clustering in Personalization Tasks
Figure 4 for On the Unreasonable Efficiency of State Space Clustering in Personalization Tasks
Viaarxiv icon

Offline Policy Comparison under Limited Historical Agent-Environment Interactions

Add code
Jun 07, 2021
Figure 1 for Offline Policy Comparison under Limited Historical Agent-Environment Interactions
Figure 2 for Offline Policy Comparison under Limited Historical Agent-Environment Interactions
Figure 3 for Offline Policy Comparison under Limited Historical Agent-Environment Interactions
Figure 4 for Offline Policy Comparison under Limited Historical Agent-Environment Interactions
Viaarxiv icon

An adaptive stochastic gradient-free approach for high-dimensional blackbox optimization

Add code
Jun 18, 2020
Figure 1 for An adaptive stochastic gradient-free approach for high-dimensional blackbox optimization
Figure 2 for An adaptive stochastic gradient-free approach for high-dimensional blackbox optimization
Figure 3 for An adaptive stochastic gradient-free approach for high-dimensional blackbox optimization
Figure 4 for An adaptive stochastic gradient-free approach for high-dimensional blackbox optimization
Viaarxiv icon