Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Kenneth Zheng

BASS: Block-wise Adaptation for Speech Summarization

Jul 17, 2023

Roshan Sharma, Kenneth Zheng, Siddhant Arora, Shinji Watanabe, Rita Singh, Bhiksha Raj

Figure 1 for BASS: Block-wise Adaptation for Speech Summarization

Figure 2 for BASS: Block-wise Adaptation for Speech Summarization

Figure 3 for BASS: Block-wise Adaptation for Speech Summarization

Figure 4 for BASS: Block-wise Adaptation for Speech Summarization

Abstract:End-to-end speech summarization has been shown to improve performance over cascade baselines. However, such models are difficult to train on very large inputs (dozens of minutes or hours) owing to compute restrictions and are hence trained with truncated model inputs. Truncation leads to poorer models, and a solution to this problem rests in block-wise modeling, i.e., processing a portion of the input frames at a time. In this paper, we develop a method that allows one to train summarization models on very long sequences in an incremental manner. Speech summarization is realized as a streaming process, where hypothesis summaries are updated every block based on new acoustic information. We devise and test strategies to pass semantic context across the blocks. Experiments on the How2 dataset demonstrate that the proposed block-wise training method improves by 3 points absolute on ROUGE-L over a truncated input baseline.

* Accepted at Interspeech 2023

Via

Access Paper or Ask Questions

Testing the Ability of Language Models to Interpret Figurative Language

Apr 26, 2022

Emmy Liu, Chen Cui, Kenneth Zheng, Graham Neubig

Figure 1 for Testing the Ability of Language Models to Interpret Figurative Language

Figure 2 for Testing the Ability of Language Models to Interpret Figurative Language

Figure 3 for Testing the Ability of Language Models to Interpret Figurative Language

Figure 4 for Testing the Ability of Language Models to Interpret Figurative Language

Abstract:Figurative and metaphorical language are commonplace in discourse, and figurative expressions play an important role in communication and cognition. However, figurative language has been a relatively under-studied area in NLP, and it remains an open question to what extent modern language models can interpret nonliteral phrases. To address this question, we introduce Fig-QA, a Winograd-style nonliteral language understanding task consisting of correctly interpreting paired figurative phrases with divergent meanings. We evaluate the performance of several state-of-the-art language models on this task, and find that although language models achieve performance significantly over chance, they still fall short of human performance, particularly in zero- or few-shot settings. This suggests that further work is needed to improve the nonliteral reasoning capabilities of language models.

* NAACL 2022

Via

Access Paper or Ask Questions