Picture for Been Kim

Been Kim

How new data permeates LLM knowledge and how to dilute it

Add code
Apr 13, 2025
Viaarxiv icon

QuestBench: Can LLMs ask the right question to acquire information in reasoning tasks?

Add code
Mar 28, 2025
Viaarxiv icon

We Can't Understand AI Using our Existing Vocabulary

Add code
Feb 11, 2025
Viaarxiv icon

Proactive Agents for Multi-Turn Text-to-Image Generation Under Uncertainty

Add code
Dec 09, 2024
Viaarxiv icon

Getting aligned on representational alignment

Add code
Nov 02, 2023
Viaarxiv icon

Bridging the Human-AI Knowledge Gap: Concept Discovery and Transfer in AlphaZero

Add code
Oct 25, 2023
Viaarxiv icon

State2Explanation: Concept-Based Explanations to Benefit Agent Learning and User Understanding

Add code
Sep 21, 2023
Viaarxiv icon

Don't trust your eyes: on the reliability of feature visualizations

Add code
Jun 21, 2023
Viaarxiv icon

Gaussian Process Probes (GPP) for Uncertainty-Aware Probing

Add code
May 29, 2023
Figure 1 for Gaussian Process Probes (GPP) for Uncertainty-Aware Probing
Figure 2 for Gaussian Process Probes (GPP) for Uncertainty-Aware Probing
Figure 3 for Gaussian Process Probes (GPP) for Uncertainty-Aware Probing
Figure 4 for Gaussian Process Probes (GPP) for Uncertainty-Aware Probing
Viaarxiv icon

Model evaluation for extreme risks

Add code
May 24, 2023
Figure 1 for Model evaluation for extreme risks
Figure 2 for Model evaluation for extreme risks
Figure 3 for Model evaluation for extreme risks
Figure 4 for Model evaluation for extreme risks
Viaarxiv icon