Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:WikiChat: A Few-Shot LLM-Based Chatbot Grounded with Wikipedia

May 23, 2023

Sina J. Semnani, Violet Z. Yao, Heidi C. Zhang, Monica S. Lam

Figure 1 for WikiChat: A Few-Shot LLM-Based Chatbot Grounded with Wikipedia

Figure 2 for WikiChat: A Few-Shot LLM-Based Chatbot Grounded with Wikipedia

Figure 3 for WikiChat: A Few-Shot LLM-Based Chatbot Grounded with Wikipedia

Figure 4 for WikiChat: A Few-Shot LLM-Based Chatbot Grounded with Wikipedia

Share this with someone who'll enjoy it:

Abstract:Despite recent advances in Large Language Models (LLMs), users still cannot trust the information provided in their responses. LLMs cannot speak accurately about events that occurred after their training, which are often topics of great interest to users, and, as we show in this paper, they are highly prone to hallucination when talking about less popular (tail) topics. This paper presents WikiChat, a few-shot LLM-based chatbot that is grounded with live information from Wikipedia. Through many iterations of experimentation, we have crafte a pipeline based on information retrieval that (1) uses LLMs to suggest interesting and relevant facts that are individually verified against Wikipedia, (2) retrieves additional up-to-date information, and (3) composes coherent and engaging time-aware responses. We propose a novel hybrid human-and-LLM evaluation methodology to analyze the factuality and conversationality of LLM-based chatbots. We focus on evaluating important but previously neglected issues such as conversing about recent and tail topics. We evaluate WikiChat against strong fine-tuned and LLM-based baselines across a diverse set of conversation topics. We find that WikiChat outperforms all baselines in terms of the factual accuracy of its claims, by up to 12.1%, 28.3% and 32.7% on head, recent and tail topics, while matching GPT-3.5 in terms of providing natural, relevant, non-repetitive and informational responses.

View paper on

Share this with someone who'll enjoy it:

Title:WikiChat: A Few-Shot LLM-Based Chatbot Grounded with Wikipedia

Paper and Code