Picture for Jennifer Hu

Jennifer Hu

Privileged Self-Access Matters for Introspection in AI

Add code
Aug 20, 2025
Viaarxiv icon

Inside you are many wolves: Using cognitive models to interpret value trade-offs in LLMs

Add code
Jun 25, 2025
Viaarxiv icon

A suite of LMs comprehend puzzle statements as well as humans

Add code
May 13, 2025
Viaarxiv icon

Linking forward-pass dynamics in Transformers and real-time human processing

Add code
Apr 18, 2025
Viaarxiv icon

Language Models Fail to Introspect About Their Knowledge of Language

Add code
Mar 10, 2025
Viaarxiv icon

Re-evaluating Theory of Mind evaluation in large language models

Add code
Feb 28, 2025
Viaarxiv icon

Shades of Zero: Distinguishing Impossibility from Inconceivability

Add code
Feb 27, 2025
Viaarxiv icon

One fish, two fish, but not the whole sea: Alignment reduces language models' conceptual diversity

Add code
Nov 07, 2024
Figure 1 for One fish, two fish, but not the whole sea: Alignment reduces language models' conceptual diversity
Figure 2 for One fish, two fish, but not the whole sea: Alignment reduces language models' conceptual diversity
Figure 3 for One fish, two fish, but not the whole sea: Alignment reduces language models' conceptual diversity
Figure 4 for One fish, two fish, but not the whole sea: Alignment reduces language models' conceptual diversity
Viaarxiv icon

Elements of World Knowledge (EWOK): A cognition-inspired framework for evaluating basic world knowledge in language models

Add code
May 15, 2024
Figure 1 for Elements of World Knowledge (EWOK): A cognition-inspired framework for evaluating basic world knowledge in language models
Figure 2 for Elements of World Knowledge (EWOK): A cognition-inspired framework for evaluating basic world knowledge in language models
Figure 3 for Elements of World Knowledge (EWOK): A cognition-inspired framework for evaluating basic world knowledge in language models
Figure 4 for Elements of World Knowledge (EWOK): A cognition-inspired framework for evaluating basic world knowledge in language models
Viaarxiv icon

Auxiliary task demands mask the capabilities of smaller language models

Add code
Apr 03, 2024
Figure 1 for Auxiliary task demands mask the capabilities of smaller language models
Figure 2 for Auxiliary task demands mask the capabilities of smaller language models
Figure 3 for Auxiliary task demands mask the capabilities of smaller language models
Figure 4 for Auxiliary task demands mask the capabilities of smaller language models
Viaarxiv icon