Picture for Yifei Ma

Yifei Ma

Murali

Aligning Vision Models with Human Aesthetics in Retrieval: Benchmarks and Algorithms

Add code
Jun 13, 2024
Figure 1 for Aligning Vision Models with Human Aesthetics in Retrieval: Benchmarks and Algorithms
Figure 2 for Aligning Vision Models with Human Aesthetics in Retrieval: Benchmarks and Algorithms
Figure 3 for Aligning Vision Models with Human Aesthetics in Retrieval: Benchmarks and Algorithms
Figure 4 for Aligning Vision Models with Human Aesthetics in Retrieval: Benchmarks and Algorithms
Viaarxiv icon

Optimal Design for Human Feedback

Add code
Apr 22, 2024
Figure 1 for Optimal Design for Human Feedback
Figure 2 for Optimal Design for Human Feedback
Figure 3 for Optimal Design for Human Feedback
Viaarxiv icon

Experimental Design for Active Transductive Inference in Large Language Models

Add code
Apr 12, 2024
Viaarxiv icon

Logic-Scaffolding: Personalized Aspect-Instructed Recommendation Explanation Generation using LLMs

Add code
Dec 22, 2023
Viaarxiv icon

Fixed-Budget Best-Arm Identification with Heterogeneous Reward Variances

Add code
Jun 13, 2023
Viaarxiv icon

Context Uncertainty in Contextual Bandits with Applications to Recommender Systems

Add code
Feb 16, 2022
Figure 1 for Context Uncertainty in Contextual Bandits with Applications to Recommender Systems
Figure 2 for Context Uncertainty in Contextual Bandits with Applications to Recommender Systems
Figure 3 for Context Uncertainty in Contextual Bandits with Applications to Recommender Systems
Figure 4 for Context Uncertainty in Contextual Bandits with Applications to Recommender Systems
Viaarxiv icon

Zero-Shot Recommender Systems

Add code
May 18, 2021
Figure 1 for Zero-Shot Recommender Systems
Figure 2 for Zero-Shot Recommender Systems
Figure 3 for Zero-Shot Recommender Systems
Figure 4 for Zero-Shot Recommender Systems
Viaarxiv icon

Optimal Off-Policy Evaluation for Reinforcement Learning with Marginalized Importance Sampling

Add code
Jun 08, 2019
Figure 1 for Optimal Off-Policy Evaluation for Reinforcement Learning with Marginalized Importance Sampling
Figure 2 for Optimal Off-Policy Evaluation for Reinforcement Learning with Marginalized Importance Sampling
Figure 3 for Optimal Off-Policy Evaluation for Reinforcement Learning with Marginalized Importance Sampling
Figure 4 for Optimal Off-Policy Evaluation for Reinforcement Learning with Marginalized Importance Sampling
Viaarxiv icon

Dynamic Mini-batch SGD for Elastic Distributed Training: Learning in the Limbo of Resources

Add code
May 02, 2019
Figure 1 for Dynamic Mini-batch SGD for Elastic Distributed Training: Learning in the Limbo of Resources
Figure 2 for Dynamic Mini-batch SGD for Elastic Distributed Training: Learning in the Limbo of Resources
Figure 3 for Dynamic Mini-batch SGD for Elastic Distributed Training: Learning in the Limbo of Resources
Figure 4 for Dynamic Mini-batch SGD for Elastic Distributed Training: Learning in the Limbo of Resources
Viaarxiv icon

Imitation-Regularized Offline Learning

Add code
Jan 15, 2019
Figure 1 for Imitation-Regularized Offline Learning
Figure 2 for Imitation-Regularized Offline Learning
Figure 3 for Imitation-Regularized Offline Learning
Figure 4 for Imitation-Regularized Offline Learning
Viaarxiv icon