Picture for Yifei Ma

Yifei Ma

Murali

Aligning Vision Models with Human Aesthetics in Retrieval: Benchmarks and Algorithms

Add code
Jun 13, 2024
Viaarxiv icon

Optimal Design for Human Feedback

Add code
Apr 22, 2024
Viaarxiv icon

Experimental Design for Active Transductive Inference in Large Language Models

Add code
Apr 12, 2024
Viaarxiv icon

Logic-Scaffolding: Personalized Aspect-Instructed Recommendation Explanation Generation using LLMs

Add code
Dec 22, 2023
Viaarxiv icon

Fixed-Budget Best-Arm Identification with Heterogeneous Reward Variances

Add code
Jun 13, 2023
Viaarxiv icon

Context Uncertainty in Contextual Bandits with Applications to Recommender Systems

Add code
Feb 16, 2022
Figure 1 for Context Uncertainty in Contextual Bandits with Applications to Recommender Systems
Figure 2 for Context Uncertainty in Contextual Bandits with Applications to Recommender Systems
Figure 3 for Context Uncertainty in Contextual Bandits with Applications to Recommender Systems
Figure 4 for Context Uncertainty in Contextual Bandits with Applications to Recommender Systems
Viaarxiv icon

Zero-Shot Recommender Systems

Add code
May 18, 2021
Figure 1 for Zero-Shot Recommender Systems
Figure 2 for Zero-Shot Recommender Systems
Figure 3 for Zero-Shot Recommender Systems
Figure 4 for Zero-Shot Recommender Systems
Viaarxiv icon

Optimal Off-Policy Evaluation for Reinforcement Learning with Marginalized Importance Sampling

Add code
Jun 08, 2019
Figure 1 for Optimal Off-Policy Evaluation for Reinforcement Learning with Marginalized Importance Sampling
Figure 2 for Optimal Off-Policy Evaluation for Reinforcement Learning with Marginalized Importance Sampling
Figure 3 for Optimal Off-Policy Evaluation for Reinforcement Learning with Marginalized Importance Sampling
Figure 4 for Optimal Off-Policy Evaluation for Reinforcement Learning with Marginalized Importance Sampling
Viaarxiv icon

Dynamic Mini-batch SGD for Elastic Distributed Training: Learning in the Limbo of Resources

Add code
May 02, 2019
Figure 1 for Dynamic Mini-batch SGD for Elastic Distributed Training: Learning in the Limbo of Resources
Figure 2 for Dynamic Mini-batch SGD for Elastic Distributed Training: Learning in the Limbo of Resources
Figure 3 for Dynamic Mini-batch SGD for Elastic Distributed Training: Learning in the Limbo of Resources
Figure 4 for Dynamic Mini-batch SGD for Elastic Distributed Training: Learning in the Limbo of Resources
Viaarxiv icon

Imitation-Regularized Offline Learning

Add code
Jan 15, 2019
Figure 1 for Imitation-Regularized Offline Learning
Figure 2 for Imitation-Regularized Offline Learning
Figure 3 for Imitation-Regularized Offline Learning
Figure 4 for Imitation-Regularized Offline Learning
Viaarxiv icon