Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Jiajing Xu

Privacy Preserving Conversion Modeling in Data Clean Room

May 20, 2025

Kungang Li, Xiangyi Chen, Ling Leng, Jiajing Xu, Jiankai Sun, Behnam Rezaei

Abstract:In the realm of online advertising, accurately predicting the conversion rate (CVR) is crucial for enhancing advertising efficiency and user satisfaction. This paper addresses the challenge of CVR prediction while adhering to user privacy preferences and advertiser requirements. Traditional methods face obstacles such as the reluctance of advertisers to share sensitive conversion data and the limitations of model training in secure environments like data clean rooms. We propose a novel model training framework that enables collaborative model training without sharing sample-level gradients with the advertising platform. Our approach introduces several innovative components: (1) utilizing batch-level aggregated gradients instead of sample-level gradients to minimize privacy risks; (2) applying adapter-based parameter-efficient fine-tuning and gradient compression to reduce communication costs; and (3) employing de-biasing techniques to train the model under label differential privacy, thereby maintaining accuracy despite privacy-enhanced label perturbations. Our experimental results, conducted on industrial datasets, demonstrate that our method achieves competitive ROCAUC performance while significantly decreasing communication overhead and complying with both advertiser privacy requirements and user privacy choices. This framework establishes a new standard for privacy-preserving, high-performance CVR prediction in the digital advertising landscape.

* Published in Proceedings of the 18th ACM Conference on Recommender Systems. 2024 (RecSys '24)

Via

Access Paper or Ask Questions

OmniSage: Large Scale, Multi-Entity Heterogeneous Graph Representation Learning

May 01, 2025

Anirudhan Badrinath, Alex Yang, Kousik Rajesh, Prabhat Agarwal, Jaewon Yang, Haoyu Chen, Jiajing Xu, Charles Rosenberg

Abstract:Representation learning, a task of learning latent vectors to represent entities, is a key task in improving search and recommender systems in web applications. Various representation learning methods have been developed, including graph-based approaches for relationships among entities, sequence-based methods for capturing the temporal evolution of user activities, and content-based models for leveraging text and visual content. However, the development of a unifying framework that integrates these diverse techniques to support multiple applications remains a significant challenge. This paper presents OmniSage, a large-scale representation framework that learns universal representations for a variety of applications at Pinterest. OmniSage integrates graph neural networks with content-based models and user sequence models by employing multiple contrastive learning tasks to effectively process graph data, user sequence data, and content signals. To support the training and inference of OmniSage, we developed an efficient infrastructure capable of supporting Pinterest graphs with billions of nodes. The universal representations generated by OmniSage have significantly enhanced user experiences on Pinterest, leading to an approximate 2.5% increase in sitewide repins (saves) across five applications. This paper highlights the impact of unifying representation learning methods, and we will open source the OmniSage code by the time of publication.

Via

Access Paper or Ask Questions

PinRec: Outcome-Conditioned, Multi-Token Generative Retrieval for Industry-Scale Recommendation Systems

Apr 09, 2025

Anirudhan Badrinath, Prabhat Agarwal, Laksh Bhasin, Jaewon Yang, Jiajing Xu, Charles Rosenberg

Abstract:Generative retrieval methods utilize generative sequential modeling techniques, such as transformers, to generate candidate items for recommender systems. These methods have demonstrated promising results in academic benchmarks, surpassing traditional retrieval models like two-tower architectures. However, current generative retrieval methods lack the scalability required for industrial recommender systems, and they are insufficiently flexible to satisfy the multiple metric requirements of modern systems. This paper introduces PinRec, a novel generative retrieval model developed for applications at Pinterest. PinRec utilizes outcome-conditioned generation, enabling modelers to specify how to balance various outcome metrics, such as the number of saves and clicks, to effectively align with business goals and user exploration. Additionally, PinRec incorporates multi-token generation to enhance output diversity while optimizing generation. Our experiments demonstrate that PinRec can successfully balance performance, diversity, and efficiency, delivering a significant positive impact to users using generative models. This paper marks a significant milestone in generative retrieval, as it presents, to our knowledge, the first rigorous study on implementing generative retrieval at the scale of Pinterest.

* Submitted to KDD ADS 2025

Via

Access Paper or Ask Questions

Hybrid Preference Optimization: Augmenting Direct Preference Optimization with Auxiliary Objectives

May 29, 2024

Anirudhan Badrinath, Prabhat Agarwal, Jiajing Xu

Figure 1 for Hybrid Preference Optimization: Augmenting Direct Preference Optimization with Auxiliary Objectives

Figure 2 for Hybrid Preference Optimization: Augmenting Direct Preference Optimization with Auxiliary Objectives

Figure 3 for Hybrid Preference Optimization: Augmenting Direct Preference Optimization with Auxiliary Objectives

Figure 4 for Hybrid Preference Optimization: Augmenting Direct Preference Optimization with Auxiliary Objectives

Abstract:For aligning large language models (LLMs), prior work has leveraged reinforcement learning via human feedback (RLHF) or variations of direct preference optimization (DPO). While DPO offers a simpler framework based on maximum likelihood estimation, it compromises on the ability to tune language models to easily maximize non-differentiable and non-binary objectives according to the LLM designer's preferences (e.g., using simpler language or minimizing specific kinds of harmful content). These may neither align with user preferences nor even be able to be captured tractably by binary preference data. To leverage the simplicity and performance of DPO with the generalizability of RL, we propose a hybrid approach between DPO and RLHF. With a simple augmentation to the implicit reward decomposition of DPO, we allow for tuning LLMs to maximize a set of arbitrary auxiliary rewards using offline RL. The proposed method, Hybrid Preference Optimization (HPO), shows the ability to effectively generalize to both user preferences and auxiliary designer objectives, while preserving alignment performance across a range of challenging benchmarks and model sizes.

Via

Access Paper or Ask Questions

OmniSearchSage: Multi-Task Multi-Entity Embeddings for Pinterest Search

Apr 25, 2024

Prabhat Agarwal, Minhazul Islam Sk, Nikil Pancha, Kurchi Subhra Hazra, Jiajing Xu, Chuck Rosenberg

Figure 1 for OmniSearchSage: Multi-Task Multi-Entity Embeddings for Pinterest Search

Figure 2 for OmniSearchSage: Multi-Task Multi-Entity Embeddings for Pinterest Search

Figure 3 for OmniSearchSage: Multi-Task Multi-Entity Embeddings for Pinterest Search

Figure 4 for OmniSearchSage: Multi-Task Multi-Entity Embeddings for Pinterest Search

Abstract:In this paper, we present OmniSearchSage, a versatile and scalable system for understanding search queries, pins, and products for Pinterest search. We jointly learn a unified query embedding coupled with pin and product embeddings, leading to an improvement of $>8\%$ relevance, $>7\%$ engagement, and $>5\%$ ads CTR in Pinterest's production search system. The main contributors to these gains are improved content understanding, better multi-task learning, and real-time serving. We enrich our entity representations using diverse text derived from image captions from a generative LLM, historical engagement, and user-curated boards. Our multitask learning setup produces a single search query embedding in the same space as pin and product embeddings and compatible with pre-existing pin and product embeddings. We show the value of each feature through ablation studies, and show the effectiveness of a unified model compared to standalone counterparts. Finally, we share how these embeddings have been deployed across the Pinterest search stack, from retrieval to ranking, scaling to serve $300k$ requests per second at low latency. Our implementation of this work is available at https://github.com/pinterest/atg-research/tree/main/omnisearchsage.

* 8 pages, 5 figures, to be published as an oral paper in TheWebConf Industry Track 2024

Via

Access Paper or Ask Questions

Rethinking Personalized Ranking at Pinterest: An End-to-End Approach

Sep 18, 2022

Jiajing Xu, Andrew Zhai, Charles Rosenberg

Figure 1 for Rethinking Personalized Ranking at Pinterest: An End-to-End Approach

Figure 2 for Rethinking Personalized Ranking at Pinterest: An End-to-End Approach

Figure 3 for Rethinking Personalized Ranking at Pinterest: An End-to-End Approach

Figure 4 for Rethinking Personalized Ranking at Pinterest: An End-to-End Approach

Abstract:In this work, we present our journey to revolutionize the personalized recommendation engine through end-to-end learning from raw user actions. We encode user's long-term interest in Pinner- Former, a user embedding optimized for long-term future actions via a new dense all-action loss, and capture user's short-term intention by directly learning from the real-time action sequences. We conducted both offline and online experiments to validate the performance of the new model architecture, and also address the challenge of serving such a complex model using mixed CPU/GPU setup in production. The proposed system has been deployed in production at Pinterest and has delivered significant online gains across organic and Ads applications.

Via

Access Paper or Ask Questions

Evolution of a Web-Scale Near Duplicate Image Detection System

Sep 18, 2022

Andrey Gusev, Jiajing Xu

Figure 1 for Evolution of a Web-Scale Near Duplicate Image Detection System

Figure 2 for Evolution of a Web-Scale Near Duplicate Image Detection System

Figure 3 for Evolution of a Web-Scale Near Duplicate Image Detection System

Figure 4 for Evolution of a Web-Scale Near Duplicate Image Detection System

Abstract:Detecting near duplicate images is fundamental to the content ecosystem of photo sharing web applications. However, such a task is challenging when involving a web-scale image corpus containing billions of images. In this paper, we present an efficient system for detecting near duplicate images across 8 billion images. Our system consists of three stages: candidate generation, candidate selection, and clustering. We also demonstrate that this system can be used to greatly improve the quality of recommendations and search results across a number of real-world applications. In addition, we include the evolution of the system over the course of six years, bringing out experiences and lessons on how new systems are designed to accommodate organic content growth as well as the latest technology. Finally, we are releasing a human-labeled dataset of ~53,000 pairs of images introduced in this paper.

Via

Access Paper or Ask Questions

Visual Search at Pinterest

Mar 08, 2017

Yushi Jing, David Liu, Dmitry Kislyuk, Andrew Zhai, Jiajing Xu, Jeff Donahue, Sarah Tavel

Abstract:We demonstrate that, with the availability of distributed computation platforms such as Amazon Web Services and open-source tools, it is possible for a small engineering team to build, launch and maintain a cost-effective, large-scale visual search system with widely available tools. We also demonstrate, through a comprehensive set of live experiments at Pinterest, that content recommendation powered by visual search improve user engagement. By sharing our implementation details and the experiences learned from launching a commercial visual search engines from scratch, we hope visual search are more widely incorporated into today's commercial applications.

* in Proceedings of the 21th ACM SIGKDD International Conference on Knowledge and Discovery and Data Mining, 2015

Via

Access Paper or Ask Questions

Training and Evaluating Multimodal Word Embeddings with Large-scale Web Annotated Images

Nov 24, 2016

Junhua Mao, Jiajing Xu, Yushi Jing, Alan Yuille

Figure 1 for Training and Evaluating Multimodal Word Embeddings with Large-scale Web Annotated Images

Figure 2 for Training and Evaluating Multimodal Word Embeddings with Large-scale Web Annotated Images

Figure 3 for Training and Evaluating Multimodal Word Embeddings with Large-scale Web Annotated Images

Figure 4 for Training and Evaluating Multimodal Word Embeddings with Large-scale Web Annotated Images

Abstract:In this paper, we focus on training and evaluating effective word embeddings with both text and visual information. More specifically, we introduce a large-scale dataset with 300 million sentences describing over 40 million images crawled and downloaded from publicly available Pins (i.e. an image with sentence descriptions uploaded by users) on Pinterest. This dataset is more than 200 times larger than MS COCO, the standard large-scale image dataset with sentence descriptions. In addition, we construct an evaluation dataset to directly assess the effectiveness of word embeddings in terms of finding semantically similar or related words and phrases. The word/phrase pairs in this evaluation dataset are collected from the click data with millions of users in an image search system, thus contain rich semantic relationships. Based on these datasets, we propose and compare several Recurrent Neural Networks (RNNs) based multimodal (text and image) models. Experiments show that our model benefits from incorporating the visual information into the word embeddings, and a weight sharing strategy is crucial for learning such multimodal embeddings. The project page is: http://www.stat.ucla.edu/~junhua.mao/multimodal_embedding.html

* Appears in NIPS 2016. The datasets introduced in this work will be gradually released on the project page

Via

Access Paper or Ask Questions

A Hybrid Method for Distance Metric Learning

Jun 29, 2012

Yi-Hao Kao, Benjamin Van Roy, Daniel Rubin, Jiajing Xu, Jessica Faruque, Sandy Napel

Figure 1 for A Hybrid Method for Distance Metric Learning

Figure 2 for A Hybrid Method for Distance Metric Learning

Abstract:We consider the problem of learning a measure of distance among vectors in a feature space and propose a hybrid method that simultaneously learns from similarity ratings assigned to pairs of vectors and class labels assigned to individual vectors. Our method is based on a generative model in which class labels can provide information that is not encoded in feature vectors but yet relates to perceived similarity between objects. Experiments with synthetic data as well as a real medical image retrieval problem demonstrate that leveraging class labels through use of our method improves retrieval performance significantly.

Via

Access Paper or Ask Questions