Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Jan Malte Lichtenberg

Ranking Across Different Content Types: The Robust Beauty of Multinomial Blending

Aug 17, 2024

Jan Malte Lichtenberg, Giuseppe Di Benedetto, Matteo Ruffini

Figure 1 for Ranking Across Different Content Types: The Robust Beauty of Multinomial Blending

Figure 2 for Ranking Across Different Content Types: The Robust Beauty of Multinomial Blending

Figure 3 for Ranking Across Different Content Types: The Robust Beauty of Multinomial Blending

Abstract:An increasing number of media streaming services have expanded their offerings to include entities of multiple content types. For instance, audio streaming services that started by offering music only, now also offer podcasts, merchandise items, and videos. Ranking items across different content types into a single slate poses a significant challenge for traditional learning-to-rank (LTR) algorithms due to differing user engagement patterns for different content types. We explore a simple method for cross-content-type ranking, called multinomial blending (MB), which can be used in conjunction with most existing LTR algorithms. We compare MB to existing baselines not only in terms of ranking quality but also from other industry-relevant perspectives such as interpretability, ease-of-use, and stability in dynamic environments with changing user behavior and ranking model retraining. Finally, we report the results of an A/B test from an Amazon Music ranking use-case.

* To appear in 18th ACM Conference on Recommender Systems (RecSys24), Bari, Italy. ACM, New York, NY, USA, 3 pages

Via

Access Paper or Ask Questions

Large Language Models as Recommender Systems: A Study of Popularity Bias

Jun 03, 2024

Jan Malte Lichtenberg, Alexander Buchholz, Pola Schwöbel

Abstract:The issue of popularity bias -- where popular items are disproportionately recommended, overshadowing less popular but potentially relevant items -- remains a significant challenge in recommender systems. Recent advancements have seen the integration of general-purpose Large Language Models (LLMs) into the architecture of such systems. This integration raises concerns that it might exacerbate popularity bias, given that the LLM's training data is likely dominated by popular items. However, it simultaneously presents a novel opportunity to address the bias via prompt tuning. Our study explores this dichotomy, examining whether LLMs contribute to or can alleviate popularity bias in recommender systems. We introduce a principled way to measure popularity bias by discussing existing metrics and proposing a novel metric that fulfills a series of desiderata. Based on our new metric, we compare a simple LLM-based recommender to traditional recommender systems on a movie recommendation task. We find that the LLM recommender exhibits less popularity bias, even without any explicit mitigation.

* Accepted at Gen-IR@SIGIR24 workshop

Via

Access Paper or Ask Questions

Double Clipping: Less-Biased Variance Reduction in Off-Policy Evaluation

Sep 03, 2023

Jan Malte Lichtenberg, Alexander Buchholz, Giuseppe Di Benedetto, Matteo Ruffini, Ben London

Abstract:"Clipping" (a.k.a. importance weight truncation) is a widely used variance-reduction technique for counterfactual off-policy estimators. Like other variance-reduction techniques, clipping reduces variance at the cost of increased bias. However, unlike other techniques, the bias introduced by clipping is always a downward bias (assuming non-negative rewards), yielding a lower bound on the true expected reward. In this work we propose a simple extension, called $\textit{double clipping}$, which aims to compensate this downward bias and thus reduce the overall bias, while maintaining the variance reduction properties of the original estimator.

* Presented at CONSEQUENCES '23 workshop at RecSys 2023 conference in Singapore

Via

Access Paper or Ask Questions

Low-variance estimation in the Plackett-Luce model via quasi-Monte Carlo sampling

May 12, 2022

Alexander Buchholz, Jan Malte Lichtenberg, Giuseppe Di Benedetto, Yannik Stein, Vito Bellini, Matteo Ruffini

Figure 1 for Low-variance estimation in the Plackett-Luce model via quasi-Monte Carlo sampling

Figure 2 for Low-variance estimation in the Plackett-Luce model via quasi-Monte Carlo sampling

Figure 3 for Low-variance estimation in the Plackett-Luce model via quasi-Monte Carlo sampling

Figure 4 for Low-variance estimation in the Plackett-Luce model via quasi-Monte Carlo sampling

Abstract:The Plackett-Luce (PL) model is ubiquitous in learning-to-rank (LTR) because it provides a useful and intuitive probabilistic model for sampling ranked lists. Counterfactual offline evaluation and optimization of ranking metrics are pivotal for using LTR methods in production. When adopting the PL model as a ranking policy, both tasks require the computation of expectations with respect to the model. These are usually approximated via Monte-Carlo (MC) sampling, since the combinatorial scaling in the number of items to be ranked makes their analytical computation intractable. Despite recent advances in improving the computational efficiency of the sampling process via the Gumbel top-k trick, the MC estimates can suffer from high variance. We develop a novel approach to producing more sample-efficient estimators of expectations in the PL model by combining the Gumbel top-k trick with quasi-Monte Carlo (QMC) sampling, a well-established technique for variance reduction. We illustrate our findings both theoretically and empirically using real-world recommendation data from Amazon Music and the Yahoo learning-to-rank challenge.

Via

Access Paper or Ask Questions

Iterative Policy-Space Expansion in Reinforcement Learning

Dec 05, 2019

Jan Malte Lichtenberg, Özgür Şimşek

Figure 1 for Iterative Policy-Space Expansion in Reinforcement Learning

Figure 2 for Iterative Policy-Space Expansion in Reinforcement Learning

Abstract:Humans and animals solve a difficult problem much more easily when they are presented with a sequence of problems that starts simple and slowly increases in difficulty. We explore this idea in the context of reinforcement learning. Rather than providing the agent with an externally provided curriculum of progressively more difficult tasks, the agent solves a single task utilizing a decreasingly constrained policy space. The algorithm we propose first learns to categorize features into positive and negative before gradually learning a more refined policy. Experimental results in Tetris demonstrate superior learning rate of our approach when compared to existing algorithms.

* Workshop on Biological and Artificial Reinforcement Learning at the 33rd Conference on Neural Information Processing Systems (NeurIPS 2019), Vancouver, Canada

Via

Access Paper or Ask Questions