Picture for Jesse Thomason

Jesse Thomason

University of Southern California

Evaluating Creativity and Deception in Large Language Models: A Simulation Framework for Multi-Agent Balderdash

Add code
Nov 15, 2024
Figure 1 for Evaluating Creativity and Deception in Large Language Models: A Simulation Framework for Multi-Agent Balderdash
Figure 2 for Evaluating Creativity and Deception in Large Language Models: A Simulation Framework for Multi-Agent Balderdash
Figure 3 for Evaluating Creativity and Deception in Large Language Models: A Simulation Framework for Multi-Agent Balderdash
Figure 4 for Evaluating Creativity and Deception in Large Language Models: A Simulation Framework for Multi-Agent Balderdash
Viaarxiv icon

The American Sign Language Knowledge Graph: Infusing ASL Models with Linguistic Knowledge

Add code
Nov 06, 2024
Viaarxiv icon

Generating Contextually-Relevant Navigation Instructions for Blind and Low Vision People

Add code
Jul 11, 2024
Viaarxiv icon

Contrast Sets for Evaluating Language-Guided Robot Policies

Add code
Jun 19, 2024
Viaarxiv icon

When Parts are Greater Than Sums: Individual LLM Components Can Outperform Full Models

Add code
Jun 19, 2024
Viaarxiv icon

Language Models can Infer Action Semantics for Classical Planners from Environment Feedback

Add code
Jun 04, 2024
Viaarxiv icon

TwoStep: Multi-agent Task Planning using Classical Planners and Large Language Models

Add code
Mar 25, 2024
Viaarxiv icon

ViSaRL: Visual Reinforcement Learning Guided by Human Saliency

Add code
Mar 16, 2024
Figure 1 for ViSaRL: Visual Reinforcement Learning Guided by Human Saliency
Figure 2 for ViSaRL: Visual Reinforcement Learning Guided by Human Saliency
Figure 3 for ViSaRL: Visual Reinforcement Learning Guided by Human Saliency
Figure 4 for ViSaRL: Visual Reinforcement Learning Guided by Human Saliency
Viaarxiv icon

Selective "Selective Prediction": Reducing Unnecessary Abstention in Vision-Language Reasoning

Add code
Feb 23, 2024
Viaarxiv icon

WinoViz: Probing Visual Properties of Objects Under Different States

Add code
Feb 21, 2024
Viaarxiv icon