Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:OrchestraLLM: Efficient Orchestration of Language Models for Dialogue State Tracking

Nov 16, 2023

Chia-Hsuan Lee, Hao Cheng, Mari Ostendorf

Figure 1 for OrchestraLLM: Efficient Orchestration of Language Models for Dialogue State Tracking

Figure 2 for OrchestraLLM: Efficient Orchestration of Language Models for Dialogue State Tracking

Figure 3 for OrchestraLLM: Efficient Orchestration of Language Models for Dialogue State Tracking

Figure 4 for OrchestraLLM: Efficient Orchestration of Language Models for Dialogue State Tracking

Share this with someone who'll enjoy it:

Abstract:Large language models (LLMs) have revolutionized the landscape of Natural Language Processing systems, but are computationally expensive. To reduce the cost without sacrificing performance, previous studies have explored various approaches to harness the potential of Small Language Models (SLMs) as cost-effective alternatives to their larger counterparts. Driven by findings that SLMs and LLMs exhibit complementary strengths in a structured knowledge extraction task, this work presents a novel SLM/LLM routing framework designed to improve computational efficiency and enhance task performance. First, exemplar pools are created to represent the types of contexts where each LM provides a more reliable answer, leveraging a sentence embedding fine-tuned so that context similarity is close to dialogue state similarity. Then, during inference, the k-nearest exemplars to the testing instance are retrieved, and the instance is routed according to majority vote. In dialogue state tracking tasks, the proposed routing framework enhances performance substantially compared to relying solely on LLMs, while reducing the computational costs by over 50%.

View paper on

Share this with someone who'll enjoy it:

Title:OrchestraLLM: Efficient Orchestration of Language Models for Dialogue State Tracking

Paper and Code