Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Sriram Vasudevan

Weak Supervision for Improved Precision in Search Systems

Mar 10, 2025

Sriram Vasudevan

Abstract:Labeled datasets are essential for modern search engines, which increasingly rely on supervised learning methods like Learning to Rank and massive amounts of data to power deep learning models. However, creating these datasets is both time-consuming and costly, leading to the common use of user click and activity logs as proxies for relevance. In this paper, we present a weak supervision approach to infer the quality of query-document pairs and apply it within a Learning to Rank framework to enhance the precision of a large-scale search system.

* Accepted to the AAAI 2025 Workshop on Computational Jobs Marketplace

Via

Access Paper or Ask Questions

Learning to Retrieve for Job Matching

Feb 21, 2024

Jianqiang Shen, Yuchin Juan, Shaobo Zhang, Ping Liu, Wen Pu, Sriram Vasudevan, Qingquan Song, Fedor Borisyuk, Kay Qianqi Shen, Haichao Wei(+14 more)

Figure 1 for Learning to Retrieve for Job Matching

Figure 2 for Learning to Retrieve for Job Matching

Figure 3 for Learning to Retrieve for Job Matching

Figure 4 for Learning to Retrieve for Job Matching

Abstract:Web-scale search systems typically tackle the scalability challenge with a two-step paradigm: retrieval and ranking. The retrieval step, also known as candidate selection, often involves extracting standardized entities, creating an inverted index, and performing term matching for retrieval. Such traditional methods require manual and time-consuming development of query models. In this paper, we discuss applying learning-to-retrieve technology to enhance LinkedIns job search and recommendation systems. In the realm of promoted jobs, the key objective is to improve the quality of applicants, thereby delivering value to recruiter customers. To achieve this, we leverage confirmed hire data to construct a graph that evaluates a seeker's qualification for a job, and utilize learned links for retrieval. Our learned model is easy to explain, debug, and adjust. On the other hand, the focus for organic jobs is to optimize seeker engagement. We accomplished this by training embeddings for personalized retrieval, fortified by a set of rules derived from the categorization of member feedback. In addition to a solution based on a conventional inverted index, we developed an on-GPU solution capable of supporting both KNN and term matching efficiently.

Via

Access Paper or Ask Questions

LiFT: A Scalable Framework for Measuring Fairness in ML Applications

Aug 14, 2020

Sriram Vasudevan, Krishnaram Kenthapadi

Figure 1 for LiFT: A Scalable Framework for Measuring Fairness in ML Applications

Figure 2 for LiFT: A Scalable Framework for Measuring Fairness in ML Applications

Figure 3 for LiFT: A Scalable Framework for Measuring Fairness in ML Applications

Figure 4 for LiFT: A Scalable Framework for Measuring Fairness in ML Applications

Abstract:Many internet applications are powered by machine learned models, which are usually trained on labeled datasets obtained through either implicit / explicit user feedback signals or human judgments. Since societal biases may be present in the generation of such datasets, it is possible for the trained models to be biased, thereby resulting in potential discrimination and harms for disadvantaged groups. Motivated by the need for understanding and addressing algorithmic bias in web-scale ML systems and the limitations of existing fairness toolkits, we present the LinkedIn Fairness Toolkit (LiFT), a framework for scalable computation of fairness metrics as part of large ML systems. We highlight the key requirements in deployed settings, and present the design of our fairness measurement system. We discuss the challenges encountered in incorporating fairness tools in practice and the lessons learned during deployment at LinkedIn. Finally, we provide open problems based on practical experience.

* Accepted for publication in CIKM 2020

Via

Access Paper or Ask Questions