Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:X-Reflect: Cross-Reflection Prompting for Multimodal Recommendation

Aug 27, 2024

Hanjia Lyu, Ryan Rossi, Xiang Chen, Md Mehrab Tanjim, Stefano Petrangeli, Somdeb Sarkhel, Jiebo Luo

Figure 1 for X-Reflect: Cross-Reflection Prompting for Multimodal Recommendation

Figure 2 for X-Reflect: Cross-Reflection Prompting for Multimodal Recommendation

Figure 3 for X-Reflect: Cross-Reflection Prompting for Multimodal Recommendation

Figure 4 for X-Reflect: Cross-Reflection Prompting for Multimodal Recommendation

Share this with someone who'll enjoy it:

Abstract:Large Language Models (LLMs) and Large Multimodal Models (LMMs) have been shown to enhance the effectiveness of enriching item descriptions, thereby improving the accuracy of recommendation systems. However, most existing approaches either rely on text-only prompting or employ basic multimodal strategies that do not fully exploit the complementary information available from both textual and visual modalities. This paper introduces a novel framework, Cross-Reflection Prompting, termed X-Reflect, designed to address these limitations by prompting LMMs to explicitly identify and reconcile supportive and conflicting information between text and images. By capturing nuanced insights from both modalities, this approach generates more comprehensive and contextually richer item representations. Extensive experiments conducted on two widely used benchmarks demonstrate that our method outperforms existing prompting baselines in downstream recommendation accuracy. Additionally, we evaluate the generalizability of our framework across different LMM backbones and the robustness of the prompting strategies, offering insights for optimization. This work underscores the importance of integrating multimodal information and presents a novel solution for improving item understanding in multimodal recommendation systems.

View paper on

Share this with someone who'll enjoy it:

Title:X-Reflect: Cross-Reflection Prompting for Multimodal Recommendation

Paper and Code