Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Align-SLM: Textless Spoken Language Models with Reinforcement Learning from AI Feedback

Nov 04, 2024

Guan-Ting Lin, Prashanth Gurunath Shivakumar, Aditya Gourav, Yile Gu, Ankur Gandhe, Hung-yi Lee, Ivan Bulyko

Figure 1 for Align-SLM: Textless Spoken Language Models with Reinforcement Learning from AI Feedback

Figure 2 for Align-SLM: Textless Spoken Language Models with Reinforcement Learning from AI Feedback

Figure 3 for Align-SLM: Textless Spoken Language Models with Reinforcement Learning from AI Feedback

Figure 4 for Align-SLM: Textless Spoken Language Models with Reinforcement Learning from AI Feedback

Share this with someone who'll enjoy it:

Abstract:While textless Spoken Language Models (SLMs) have shown potential in end-to-end speech-to-speech modeling, they still lag behind text-based Large Language Models (LLMs) in terms of semantic coherence and relevance. This work introduces the Align-SLM framework, which leverages preference optimization inspired by Reinforcement Learning with AI Feedback (RLAIF) to enhance the semantic understanding of SLMs. Our approach generates multiple speech continuations from a given prompt and uses semantic metrics to create preference data for Direct Preference Optimization (DPO). We evaluate the framework using ZeroSpeech 2021 benchmarks for lexical and syntactic modeling, the spoken version of the StoryCloze dataset for semantic coherence, and other speech generation metrics, including the GPT4-o score and human evaluation. Experimental results show that our method achieves state-of-the-art performance for SLMs on most benchmarks, highlighting the importance of preference optimization to improve the semantics of SLMs.

View paper on

Share this with someone who'll enjoy it:

Title:Align-SLM: Textless Spoken Language Models with Reinforcement Learning from AI Feedback

Paper and Code