Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Anton Masalovich

End-to-end multi-modal product matching in fashion e-commerce

Mar 18, 2024

Sándor Tóth, Stephen Wilson, Alexia Tsoukara, Enric Moreu, Anton Masalovich, Lars Roemheld

Figure 1 for End-to-end multi-modal product matching in fashion e-commerce

Figure 2 for End-to-end multi-modal product matching in fashion e-commerce

Figure 3 for End-to-end multi-modal product matching in fashion e-commerce

Figure 4 for End-to-end multi-modal product matching in fashion e-commerce

Abstract:Product matching, the task of identifying different representations of the same product for better discoverability, curation, and pricing, is a key capability for online marketplace and e-commerce companies. We present a robust multi-modal product matching system in an industry setting, where large datasets, data distribution shifts and unseen domains pose challenges. We compare different approaches and conclude that a relatively straightforward projection of pretrained image and text encoders, trained through contrastive learning, yields state-of-the-art results, while balancing cost and performance. Our solution outperforms single modality matching systems and large pretrained models, such as CLIP. Furthermore we show how a human-in-the-loop process can be combined with model-based predictions to achieve near perfect precision in a production system.

* 9 pages, submitted to SIGKDD

Via

Access Paper or Ask Questions