Picture for Najoung Kim

Najoung Kim

Is artificial intelligence still intelligence? LLMs generalize to novel adjective-noun pairs, but don't mimic the full human distribution

Add code
Oct 23, 2024
Viaarxiv icon

Generating novel experimental hypotheses from language models: A case study on cross-dative generalization

Add code
Aug 09, 2024
Viaarxiv icon

Beyond Thumbs Up/Down: Untangling Challenges of Fine-Grained Feedback for Text-to-Image Generation

Add code
Jun 24, 2024
Viaarxiv icon

Code Pretraining Improves Entity Tracking Abilities of Language Models

Add code
May 31, 2024
Viaarxiv icon

Syn-QA2: Evaluating False Assumptions in Long-tail Questions with Synthetic QA Datasets

Add code
Mar 18, 2024
Viaarxiv icon

Personas as a Way to Model Truthfulness in Language Models

Add code
Oct 30, 2023
Viaarxiv icon

SLOG: A Structural Generalization Benchmark for Semantic Parsing

Add code
Oct 23, 2023
Viaarxiv icon

Reasoning or Reciting? Exploring the Capabilities and Limitations of Language Models Through Counterfactual Tasks

Add code
Aug 01, 2023
Viaarxiv icon

Inverse Scaling: When Bigger Isn't Better

Add code
Jun 15, 2023
Viaarxiv icon

BoardgameQA: A Dataset for Natural Language Reasoning with Contradictory Information

Add code
Jun 13, 2023
Viaarxiv icon