Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Paul N. Bennett

PODTILE: Facilitating Podcast Episode Browsing with Auto-generated Chapters

Oct 21, 2024

Azin Ghazimatin, Ekaterina Garmash, Gustavo Penha, Kristen Sheets, Martin Achenbach, Oguz Semerci, Remi Galvez, Marcus Tannenberg, Sahitya Mantravadi, Divya Narayanan(+7 more)

Figure 1 for PODTILE: Facilitating Podcast Episode Browsing with Auto-generated Chapters

Figure 2 for PODTILE: Facilitating Podcast Episode Browsing with Auto-generated Chapters

Figure 3 for PODTILE: Facilitating Podcast Episode Browsing with Auto-generated Chapters

Figure 4 for PODTILE: Facilitating Podcast Episode Browsing with Auto-generated Chapters

Abstract:Listeners of long-form talk-audio content, such as podcast episodes, often find it challenging to understand the overall structure and locate relevant sections. A practical solution is to divide episodes into chapters--semantically coherent segments labeled with titles and timestamps. Since most episodes on our platform at Spotify currently lack creator-provided chapters, automating the creation of chapters is essential. Scaling the chapterization of podcast episodes presents unique challenges. First, episodes tend to be less structured than written texts, featuring spontaneous discussions with nuanced transitions. Second, the transcripts are usually lengthy, averaging about 16,000 tokens, which necessitates efficient processing that can preserve context. To address these challenges, we introduce PODTILE, a fine-tuned encoder-decoder transformer to segment conversational data. The model simultaneously generates chapter transitions and titles for the input transcript. To preserve context, each input text is augmented with global context, including the episode's title, description, and previous chapter titles. In our intrinsic evaluation, PODTILE achieved an 11% improvement in ROUGE score over the strongest baseline. Additionally, we provide insights into the practical benefits of auto-generated chapters for listeners navigating episode content. Our findings indicate that auto-generated chapters serve as a useful tool for engaging with less popular podcasts. Finally, we present empirical evidence that using chapter titles can enhance effectiveness of sparse retrieval in search tasks.

* 9 pages, 4 figures, CIKM industry track 2024

Via

Access Paper or Ask Questions

SummaC: Re-Visiting NLI-based Models for Inconsistency Detection in Summarization

Nov 18, 2021

Philippe Laban, Tobias Schnabel, Paul N. Bennett, Marti A. Hearst

Figure 1 for SummaC: Re-Visiting NLI-based Models for Inconsistency Detection in Summarization

Figure 2 for SummaC: Re-Visiting NLI-based Models for Inconsistency Detection in Summarization

Figure 3 for SummaC: Re-Visiting NLI-based Models for Inconsistency Detection in Summarization

Figure 4 for SummaC: Re-Visiting NLI-based Models for Inconsistency Detection in Summarization

Abstract:In the summarization domain, a key requirement for summaries is to be factually consistent with the input document. Previous work has found that natural language inference (NLI) models do not perform competitively when applied to inconsistency detection. In this work, we revisit the use of NLI for inconsistency detection, finding that past work suffered from a mismatch in input granularity between NLI datasets (sentence-level), and inconsistency detection (document level). We provide a highly effective and light-weight method called SummaCConv that enables NLI models to be successfully used for this task by segmenting documents into sentence units and aggregating scores between pairs of sentences. On our newly introduced benchmark called SummaC (Summary Consistency) consisting of six large inconsistency detection datasets, SummaCConv obtains state-of-the-art results with a balanced accuracy of 74.4%, a 5% point improvement compared to prior work. We make the models and datasets available: https://github.com/tingofurro/summac

* TACL pre-MIT Press publication version; 11 pages, 2 figures, 5 tables

Via

Access Paper or Ask Questions

Zero-Shot Dense Retrieval with Momentum Adversarial Domain Invariant Representations

Oct 14, 2021

Ji Xin, Chenyan Xiong, Ashwin Srinivasan, Ankita Sharma, Damien Jose, Paul N. Bennett

Figure 1 for Zero-Shot Dense Retrieval with Momentum Adversarial Domain Invariant Representations

Figure 2 for Zero-Shot Dense Retrieval with Momentum Adversarial Domain Invariant Representations

Figure 3 for Zero-Shot Dense Retrieval with Momentum Adversarial Domain Invariant Representations

Figure 4 for Zero-Shot Dense Retrieval with Momentum Adversarial Domain Invariant Representations

Abstract:Dense retrieval (DR) methods conduct text retrieval by first encoding texts in the embedding space and then matching them by nearest neighbor search. This requires strong locality properties from the representation space, i.e, the close allocations of each small group of relevant texts, which are hard to generalize to domains without sufficient training data. In this paper, we aim to improve the generalization ability of DR models from source training domains with rich supervision signals to target domains without any relevant labels, in the zero-shot setting. To achieve that, we propose Momentum adversarial Domain Invariant Representation learning (MoDIR), which introduces a momentum method in the DR training process to train a domain classifier distinguishing source versus target, and then adversarially updates the DR encoder to learn domain invariant representations. Our experiments show that MoDIR robustly outperforms its baselines on 10+ ranking datasets from the BEIR benchmark in the zero-shot setup, with more than 10% relative gains on datasets with enough sensitivity for DR models' evaluation. Source code of this paper will be released.

Via

Access Paper or Ask Questions

Domain-Specific Pretraining for Vertical Search: Case Study on Biomedical Literature

Jun 25, 2021

Yu Wang, Jinchao Li, Tristan Naumann, Chenyan Xiong, Hao Cheng, Robert Tinn, Cliff Wong, Naoto Usuyama, Richard Rogahn, Zhihong Shen(+5 more)

Figure 1 for Domain-Specific Pretraining for Vertical Search: Case Study on Biomedical Literature

Figure 2 for Domain-Specific Pretraining for Vertical Search: Case Study on Biomedical Literature

Figure 3 for Domain-Specific Pretraining for Vertical Search: Case Study on Biomedical Literature

Figure 4 for Domain-Specific Pretraining for Vertical Search: Case Study on Biomedical Literature

Abstract:Information overload is a prevalent challenge in many high-value domains. A prominent case in point is the explosion of the biomedical literature on COVID-19, which swelled to hundreds of thousands of papers in a matter of months. In general, biomedical literature expands by two papers every minute, totalling over a million new papers every year. Search in the biomedical realm, and many other vertical domains is challenging due to the scarcity of direct supervision from click logs. Self-supervised learning has emerged as a promising direction to overcome the annotation bottleneck. We propose a general approach for vertical search based on domain-specific pretraining and present a case study for the biomedical domain. Despite being substantially simpler and not using any relevance labels for training or development, our method performs comparably or better than the best systems in the official TREC-COVID evaluation, a COVID-related biomedical search competition. Using distributed computing in modern cloud infrastructure, our system can scale to tens of millions of articles on PubMed and has been deployed as Microsoft Biomedical Search, a new search experience for biomedical literature: https://aka.ms/biomedsearch.

Via

Access Paper or Ask Questions

Generic Intent Representation in Web Search

Jul 24, 2019

Hongfei Zhang, Xia Song, Chenyan Xiong, Corby Rosset, Paul N. Bennett, Nick Craswell, Saurabh Tiwary

Figure 1 for Generic Intent Representation in Web Search

Figure 2 for Generic Intent Representation in Web Search

Figure 3 for Generic Intent Representation in Web Search

Figure 4 for Generic Intent Representation in Web Search

Abstract:This paper presents GEneric iNtent Encoder (GEN Encoder) which learns a distributed representation space for user intent in search. Leveraging large scale user clicks from Bing search logs as weak supervision of user intent, GEN Encoder learns to map queries with shared clicks into similar embeddings end-to-end and then finetunes on multiple paraphrase tasks. Experimental results on an intrinsic evaluation task - query intent similarity modeling - demonstrate GEN Encoder's robust and significant advantages over previous representation methods. Ablation studies reveal the crucial role of learning from implicit user feedback in representing user intent and the contributions of multi-task learning in representation generality. We also demonstrate that GEN Encoder alleviates the sparsity of tail search traffic and cuts down half of the unseen queries by using an efficient approximate nearest neighbor search to effectively identify previous queries with the same search intent. Finally, we demonstrate distances between GEN encodings reflect certain information seeking behaviors in search sessions.

* SIGIR 2019: Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval

Via

Access Paper or Ask Questions

Using Shortlists to Support Decision Making and Improve Recommender System Performance

Feb 08, 2016

Tobias Schnabel, Paul N. Bennett, Susan T. Dumais, Thorsten Joachims

Figure 1 for Using Shortlists to Support Decision Making and Improve Recommender System Performance

Figure 2 for Using Shortlists to Support Decision Making and Improve Recommender System Performance

Figure 3 for Using Shortlists to Support Decision Making and Improve Recommender System Performance

Figure 4 for Using Shortlists to Support Decision Making and Improve Recommender System Performance

Abstract:In this paper, we study shortlists as an interface component for recommender systems with the dual goal of supporting the user's decision process, as well as improving implicit feedback elicitation for increased recommendation quality. A shortlist is a temporary list of candidates that the user is currently considering, e.g., a list of a few movies the user is currently considering for viewing. From a cognitive perspective, shortlists serve as digital short-term memory where users can off-load the items under consideration -- thereby decreasing their cognitive load. From a machine learning perspective, adding items to the shortlist generates a new implicit feedback signal as a by-product of exploration and decision making which can improve recommendation quality. Shortlisting therefore provides additional data for training recommendation systems without the increases in cognitive load that requesting explicit feedback would incur. We perform an user study with a movie recommendation setup to compare interfaces that offer shortlist support with those that do not. From the user studies we conclude: (i) users make better decisions with a shortlist; (ii) users prefer an interface with shortlist support; and (iii) the additional implicit feedback from sessions with a shortlist improves the quality of recommendations by nearly a factor of two.

* 11 pages in WWW 2016

Via

Access Paper or Ask Questions