Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Harry Li

Mitigating LLM Hallucinations with Knowledge Graphs: A Case Study

Apr 16, 2025

Harry Li, Gabriel Appleby, Kenneth Alperin, Steven R Gomez, Ashley Suh

Abstract:High-stakes domains like cyber operations need responsible and trustworthy AI methods. While large language models (LLMs) are becoming increasingly popular in these domains, they still suffer from hallucinations. This research paper provides learning outcomes from a case study with LinkQ, an open-source natural language interface that was developed to combat hallucinations by forcing an LLM to query a knowledge graph (KG) for ground-truth data during question-answering (QA). We conduct a quantitative evaluation of LinkQ using a well-known KGQA dataset, showing that the system outperforms GPT-4 but still struggles with certain question categories - suggesting that alternative query construction strategies will need to be investigated in future LLM querying systems. We discuss a qualitative study of LinkQ with two domain experts using a real-world cybersecurity KG, outlining these experts' feedback, suggestions, perceived limitations, and future opportunities for systems like LinkQ.

* Presented at the Human-centered Explainable AI Workshop (HCXAI) @ CHI 2025

Via

Access Paper or Ask Questions

Don't Just Translate, Agitate: Using Large Language Models as Devil's Advocates for AI Explanations

Apr 16, 2025

Ashley Suh, Kenneth Alperin, Harry Li, Steven R Gomez

Abstract:This position paper highlights a growing trend in Explainable AI (XAI) research where Large Language Models (LLMs) are used to translate outputs from explainability techniques, like feature-attribution weights, into a natural language explanation. While this approach may improve accessibility or readability for users, recent findings suggest that translating into human-like explanations does not necessarily enhance user understanding and may instead lead to overreliance on AI systems. When LLMs summarize XAI outputs without surfacing model limitations, uncertainties, or inconsistencies, they risk reinforcing the illusion of interpretability rather than fostering meaningful transparency. We argue that - instead of merely translating XAI outputs - LLMs should serve as constructive agitators, or devil's advocates, whose role is to actively interrogate AI explanations by presenting alternative interpretations, potential biases, training data limitations, and cases where the model's reasoning may break down. In this role, LLMs can facilitate users in engaging critically with AI systems and generated explanations, with the potential to reduce overreliance caused by misinterpreted or specious explanations.

* Presented at the Human-centered Explainable AI Workshop (HCXAI) @ CHI 2025

Via

Access Paper or Ask Questions

LinkQ: An LLM-Assisted Visual Interface for Knowledge Graph Question-Answering

Jun 07, 2024

Harry Li, Gabriel Appleby, Ashley Suh

Abstract:We present LinkQ, a system that leverages a large language model (LLM) to facilitate knowledge graph (KG) query construction through natural language question-answering. Traditional approaches often require detailed knowledge of complex graph querying languages, limiting the ability for users -- even experts -- to acquire valuable insights from KG data. LinkQ simplifies this process by first interpreting a user's question, then converting it into a well-formed KG query. By using the LLM to construct a query instead of directly answering the user's question, LinkQ guards against the LLM hallucinating or generating false, erroneous information. By integrating an LLM into LinkQ, users are able to conduct both exploratory and confirmatory data analysis, with the LLM helping to iteratively refine open-ended questions into precise ones. To demonstrate the efficacy of LinkQ, we conducted a qualitative study with five KG practitioners and distill their feedback. Our results indicate that practitioners find LinkQ effective for KG question-answering, and desire future LLM-assisted systems for the exploratory analysis of graph databases.

Via

Access Paper or Ask Questions

ScatterUQ: Interactive Uncertainty Visualizations for Multiclass Deep Learning Problems

Aug 08, 2023

Harry Li, Steven Jorgensen, John Holodnak, Allan Wollaber

Abstract:Recently, uncertainty-aware deep learning methods for multiclass labeling problems have been developed that provide calibrated class prediction probabilities and out-of-distribution (OOD) indicators, letting machine learning (ML) consumers and engineers gauge a model's confidence in its predictions. However, this extra neural network prediction information is challenging to scalably convey visually for arbitrary data sources under multiple uncertainty contexts. To address these challenges, we present ScatterUQ, an interactive system that provides targeted visualizations to allow users to better understand model performance in context-driven uncertainty settings. ScatterUQ leverages recent advances in distance-aware neural networks, together with dimensionality reduction techniques, to construct robust, 2-D scatter plots explaining why a model predicts a test example to be (1) in-distribution and of a particular class, (2) in-distribution but unsure of the class, and (3) out-of-distribution. ML consumers and engineers can visually compare the salient features of test samples with training examples through the use of a ``hover callback'' to understand model uncertainty performance and decide follow up courses of action. We demonstrate the effectiveness of ScatterUQ to explain model uncertainty for a multiclass image classification on a distance-aware neural network trained on Fashion-MNIST and tested on Fashion-MNIST (in distribution) and MNIST digits (out of distribution), as well as a deep learning model for a cyber dataset. We quantitatively evaluate dimensionality reduction techniques to optimize our contextually driven UQ visualizations. Our results indicate that the ScatterUQ system should scale to arbitrary, multiclass datasets. Our code is available at https://github.com/mit-ll-responsible-ai/equine-webapp

* 5 pages, 4 figures, accepted to IEEE VIS 2023

Via

Access Paper or Ask Questions

Characterizing the Users, Challenges, and Visualization Needs of Knowledge Graphs in Practice

Apr 06, 2023

Harry Li, Gabriel Appleby, Camelia Daniela Brumar, Remco Chang, Ashley Suh

Figure 1 for Characterizing the Users, Challenges, and Visualization Needs of Knowledge Graphs in Practice

Figure 2 for Characterizing the Users, Challenges, and Visualization Needs of Knowledge Graphs in Practice

Figure 3 for Characterizing the Users, Challenges, and Visualization Needs of Knowledge Graphs in Practice

Figure 4 for Characterizing the Users, Challenges, and Visualization Needs of Knowledge Graphs in Practice

Abstract:This study presents insights from interviews with nineteen Knowledge Graph (KG) practitioners who work in both enterprise and academic settings on a wide variety of use cases. Through this study, we identify critical challenges experienced by KG practitioners when creating, exploring, and analyzing KGs that could be alleviated through visualization design. Our findings reveal three major personas among KG practitioners - KG Builders, Analysts, and Consumers - each of whom have their own distinct expertise and needs. We discover that KG Builders would benefit from schema enforcers, while KG Analysts need customizable query builders that provide interim query results. For KG Consumers, we identify a lack of efficacy for node-link diagrams, and the need for tailored domain-specific visualizations to promote KG adoption and comprehension. Lastly, we find that implementing KGs effectively in practice requires both technical and social solutions that are not addressed with current tools, technologies, and collaborative workflows. From the analysis of our interviews, we distill several visualization research directions to improve KG usability, including knowledge cards that balance digestibility and discoverability, timeline views to track temporal changes, interfaces that support organic discovery, and semantic explanations for AI and machine learning predictions.

Via

Access Paper or Ask Questions

Harry Li

Mitigating LLM Hallucinations with Knowledge Graphs: A Case Study

Don't Just Translate, Agitate: Using Large Language Models as Devil's Advocates for AI Explanations

More Questions than Answers? Lessons from Integrating Explainable AI into a Cyber-AI Tool

LinkQ: An LLM-Assisted Visual Interface for Knowledge Graph Question-Answering

ScatterUQ: Interactive Uncertainty Visualizations for Multiclass Deep Learning Problems

Characterizing the Users, Challenges, and Visualization Needs of Knowledge Graphs in Practice