Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Haggai Roitman

X-Cross: Dynamic Integration of Language Models for Cross-Domain Sequential Recommendation

Apr 29, 2025

Guy Hadad, Haggai Roitman, Yotam Eshel, Bracha Shapira, Lior Rokach

Abstract:As new products are emerging daily, recommendation systems are required to quickly adapt to possible new domains without needing extensive retraining. This work presents ``X-Cross'' -- a novel cross-domain sequential-recommendation model that recommends products in new domains by integrating several domain-specific language models; each model is fine-tuned with low-rank adapters (LoRA). Given a recommendation prompt, operating layer by layer, X-Cross dynamically refines the representation of each source language model by integrating knowledge from all other models. These refined representations are propagated from one layer to the next, leveraging the activations from each domain adapter to ensure domain-specific nuances are preserved while enabling adaptability across domains. Using Amazon datasets for sequential recommendation, X-Cross achieves performance comparable to a model that is fine-tuned with LoRA, while using only 25% of the additional parameters. In cross-domain tasks, such as adapting from Toys domain to Tools, Electronics or Sports, X-Cross demonstrates robust performance, while requiring about 50%-75% less fine-tuning data than LoRA to make fine-tuning effective. Furthermore, X-Cross achieves significant improvement in accuracy over alternative cross-domain baselines. Overall, X-Cross enables scalable and adaptive cross-domain recommendations, reducing computational overhead and providing an efficient solution for data-constrained environments.

* Accepted for publication in SIGIR '25

Via

Access Paper or Ask Questions

Unsupervised Search Algorithm Configuration using Query Performance Prediction

Oct 03, 2022

Haggai Roitman

Figure 1 for Unsupervised Search Algorithm Configuration using Query Performance Prediction

Figure 2 for Unsupervised Search Algorithm Configuration using Query Performance Prediction

Figure 3 for Unsupervised Search Algorithm Configuration using Query Performance Prediction

Abstract:Search engine configuration can be quite difficult for inexpert developers. Instead, an auto-configuration approach can be used to speed up development time. Yet, such an automatic process usually requires relevance labels to train a supervised model. In this work, we suggest a simple solution based on query performance prediction that requires no relevance labels but only a sample of queries in a given domain. Using two example usecases we demonstrate the merits of our solution.

Via

Access Paper or Ask Questions

Learning to Diversify for Product Question Generation

Jul 06, 2022

Haggai Roitman, Uriel Singer, Yotam Eshel, Alexander Nus, Eliyahu Kiperwasser

Figure 1 for Learning to Diversify for Product Question Generation

Figure 2 for Learning to Diversify for Product Question Generation

Figure 3 for Learning to Diversify for Product Question Generation

Figure 4 for Learning to Diversify for Product Question Generation

Abstract:We address the product question generation task. For a given product description, our goal is to generate questions that reflect potential user information needs that are either missing or not well covered in the description. Moreover, we wish to cover diverse user information needs that may span a multitude of product types. To this end, we first show how the T5 pre-trained Transformer encoder-decoder model can be fine-tuned for the task. Yet, while the T5 generated questions have a reasonable quality compared to the state-of-the-art method for the task (KPCNet), many of such questions are still too general, resulting in a sub-optimal global question diversity. As an alternative, we propose a novel learning-to-diversify (LTD) fine-tuning approach that allows to enrich the language learned by the underlying Transformer model. Our empirical evaluation shows that, using our approach significantly improves the global diversity of the underlying Transformer model, while preserves, as much as possible, its generation relevance.

Via

Access Paper or Ask Questions

tBDFS: Temporal Graph Neural Network Leveraging DFS

Jun 12, 2022

Uriel Singer, Haggai Roitman, Ido Guy, Kira Radinsky

Figure 1 for tBDFS: Temporal Graph Neural Network Leveraging DFS

Figure 2 for tBDFS: Temporal Graph Neural Network Leveraging DFS

Figure 3 for tBDFS: Temporal Graph Neural Network Leveraging DFS

Figure 4 for tBDFS: Temporal Graph Neural Network Leveraging DFS

Abstract:Temporal graph neural networks (temporal GNNs) have been widely researched, reaching state-of-the-art results on multiple prediction tasks. A common approach employed by most previous works is to apply a layer that aggregates information from the historical neighbors of a node. Taking a different research direction, in this work, we propose tBDFS -- a novel temporal GNN architecture. tBDFS applies a layer that efficiently aggregates information from temporal paths to a given (target) node in the graph. For each given node, the aggregation is applied in two stages: (1) A single representation is learned for each temporal path ending in that node, and (2) all path representations are aggregated into a final node representation. Overall, our goal is not to add new information to a node, but rather observe the same exact information in a new perspective. This allows our model to directly observe patterns that are path-oriented rather than neighborhood-oriented. This can be thought as a Depth-First Search (DFS) traversal over the temporal graph, compared to the popular Breath-First Search (BFS) traversal that is applied in previous works. We evaluate tBDFS over multiple link prediction tasks and show its favorable performance compared to state-of-the-art baselines. To the best of our knowledge, we are the first to apply a temporal-DFS neural network.

* 9 pages, 2 figures, 2 tables

Via

Access Paper or Ask Questions

Sequential Modeling with Multiple Attributes for Watchlist Recommendation in E-Commerce

Oct 24, 2021

Uriel Singer, Haggai Roitman, Yotam Eshel, Alexander Nus, Ido Guy, Or Levi, Idan Hasson, Eliyahu Kiperwasser

Figure 1 for Sequential Modeling with Multiple Attributes for Watchlist Recommendation in E-Commerce

Figure 2 for Sequential Modeling with Multiple Attributes for Watchlist Recommendation in E-Commerce

Figure 3 for Sequential Modeling with Multiple Attributes for Watchlist Recommendation in E-Commerce

Figure 4 for Sequential Modeling with Multiple Attributes for Watchlist Recommendation in E-Commerce

Abstract:In e-commerce, the watchlist enables users to track items over time and has emerged as a primary feature, playing an important role in users' shopping journey. Watchlist items typically have multiple attributes whose values may change over time (e.g., price, quantity). Since many users accumulate dozens of items on their watchlist, and since shopping intents change over time, recommending the top watchlist items in a given context can be valuable. In this work, we study the watchlist functionality in e-commerce and introduce a novel watchlist recommendation task. Our goal is to prioritize which watchlist items the user should pay attention to next by predicting the next items the user will click. We cast this task as a specialized sequential recommendation task and discuss its characteristics. Our proposed recommendation model, Trans2D, is built on top of the Transformer architecture, where we further suggest a novel extended attention mechanism (Attention2D) that allows to learn complex item-item, attribute-attribute and item-attribute patterns from sequential-data with multiple item attributes. Using a large-scale watchlist dataset from eBay, we evaluate our proposed model, where we demonstrate its superiority compared to multiple state-of-the-art baselines, many of which are adapted for this task.

* International Conference on Web Search and Data Mining (WSDM), 2022

Via

Access Paper or Ask Questions

PreSizE: Predicting Size in E-Commerce using Transformers

May 04, 2021

Yotam Eshel, Or Levi, Haggai Roitman, Alexander Nus

Figure 1 for PreSizE: Predicting Size in E-Commerce using Transformers

Figure 2 for PreSizE: Predicting Size in E-Commerce using Transformers

Figure 3 for PreSizE: Predicting Size in E-Commerce using Transformers

Figure 4 for PreSizE: Predicting Size in E-Commerce using Transformers

Abstract:Recent advances in the e-commerce fashion industry have led to an exploration of novel ways to enhance buyer experience via improved personalization. Predicting a proper size for an item to recommend is an important personalization challenge, and is being studied in this work. Earlier works in this field either focused on modeling explicit buyer fitment feedback or modeling of only a single aspect of the problem (e.g., specific category, brand, etc.). More recent works proposed richer models, either content-based or sequence-based, better accounting for content-based aspects of the problem or better modeling the buyer's online journey. However, both these approaches fail in certain scenarios: either when encountering unseen items (sequence-based models) or when encountering new users (content-based models). To address the aforementioned gaps, we propose PreSizE - a novel deep learning framework which utilizes Transformers for accurate size prediction. PreSizE models the effect of both content-based attributes, such as brand and category, and the buyer's purchase history on her size preferences. Using an extensive set of experiments on a large-scale e-commerce dataset, we demonstrate that PreSizE is capable of achieving superior prediction performance compared to previous state-of-the-art baselines. By encoding item attributes, PreSizE better handles cold-start cases with unseen items, and cases where buyers have little past purchase data. As a proof of concept, we demonstrate that size predictions made by PreSizE can be effectively integrated into an existing production recommender system yielding very effective features and significantly improving recommendations.

* Accepted for publication in SIGIR'2021

Via

Access Paper or Ask Questions

Conversational Document Prediction to Assist Customer Care Agents

Oct 05, 2020

Jatin Ganhotra, Haggai Roitman, Doron Cohen, Nathaniel Mills, Chulaka Gunasekara, Yosi Mass, Sachindra Joshi, Luis Lastras, David Konopnicki

Figure 1 for Conversational Document Prediction to Assist Customer Care Agents

Figure 2 for Conversational Document Prediction to Assist Customer Care Agents

Figure 3 for Conversational Document Prediction to Assist Customer Care Agents

Figure 4 for Conversational Document Prediction to Assist Customer Care Agents

Abstract:A frequent pattern in customer care conversations is the agents responding with appropriate webpage URLs that address users' needs. We study the task of predicting the documents that customer care agents can use to facilitate users' needs. We also introduce a new public dataset which supports the aforementioned problem. Using this dataset and two others, we investigate state-of-the art deep learning (DL) and information retrieval (IR) models for the task. Additionally, we analyze the practicality of such systems in terms of inference time complexity. Our show that an hybrid IR+DL approach provides the best of both worlds.

* EMNLP 2020. The released Twitter dataset is available at: https://github.com/IBM/twitter-customer-care-document-prediction

Via

Access Paper or Ask Questions

A Study of Human Summaries of Scientific Articles

Feb 10, 2020

Odellia Boni, Guy Feigenblat, Doron Cohen, Haggai Roitman, David Konopnicki

Figure 1 for A Study of Human Summaries of Scientific Articles

Figure 2 for A Study of Human Summaries of Scientific Articles

Abstract:Researchers and students face an explosion of newly published papers which may be relevant to their work. This led to a trend of sharing human summaries of scientific papers. We analyze the summaries shared in one of these platforms Shortscience.org. The goal is to characterize human summaries of scientific papers, and use some of the insights obtained to improve and adapt existing automatic summarization systems to the domain of scientific papers.

Via

Access Paper or Ask Questions

A Summarization System for Scientific Documents

Aug 29, 2019

Shai Erera, Michal Shmueli-Scheuer, Guy Feigenblat, Ora Peled Nakash, Odellia Boni, Haggai Roitman, Doron Cohen, Bar Weiner, Yosi Mass, Or Rivlin(+8 more)

Figure 1 for A Summarization System for Scientific Documents

Figure 2 for A Summarization System for Scientific Documents

Figure 3 for A Summarization System for Scientific Documents

Abstract:We present a novel system providing summaries for Computer Science publications. Through a qualitative user study, we identified the most valuable scenarios for discovery, exploration and understanding of scientific documents. Based on these findings, we built a system that retrieves and summarizes scientific documents for a given information need, either in form of a free-text query or by choosing categorized values such as scientific tasks, datasets and more. Our system ingested 270,000 papers, and its summarization module aims to generate concise yet detailed summaries. We validated our approach with human experts.

* Accepted to EMNLP 2019

Via

Access Paper or Ask Questions

A Study of BERT for Non-Factoid Question-Answering under Passage Length Constraints

Aug 19, 2019

Yosi Mass, Haggai Roitman, Shai Erera, Or Rivlin, Bar Weiner, David Konopnicki

Figure 1 for A Study of BERT for Non-Factoid Question-Answering under Passage Length Constraints

Figure 2 for A Study of BERT for Non-Factoid Question-Answering under Passage Length Constraints

Figure 3 for A Study of BERT for Non-Factoid Question-Answering under Passage Length Constraints

Figure 4 for A Study of BERT for Non-Factoid Question-Answering under Passage Length Constraints

Abstract:We study the use of BERT for non-factoid question-answering, focusing on the passage re-ranking task under varying passage lengths. To this end, we explore the fine-tuning of BERT in different learning-to-rank setups, comprising both point-wise and pair-wise methods, resulting in substantial improvements over the state-of-the-art. We then analyze the effectiveness of BERT for different passage lengths and suggest how to cope with large passages.

Via

Access Paper or Ask Questions