Picture for Jesse Thomason

Jesse Thomason

University of Southern California

Evaluating Creativity and Deception in Large Language Models: A Simulation Framework for Multi-Agent Balderdash

Add code
Nov 15, 2024
Figure 1 for Evaluating Creativity and Deception in Large Language Models: A Simulation Framework for Multi-Agent Balderdash
Figure 2 for Evaluating Creativity and Deception in Large Language Models: A Simulation Framework for Multi-Agent Balderdash
Figure 3 for Evaluating Creativity and Deception in Large Language Models: A Simulation Framework for Multi-Agent Balderdash
Figure 4 for Evaluating Creativity and Deception in Large Language Models: A Simulation Framework for Multi-Agent Balderdash
Viaarxiv icon

The American Sign Language Knowledge Graph: Infusing ASL Models with Linguistic Knowledge

Add code
Nov 06, 2024
Viaarxiv icon

Generating Contextually-Relevant Navigation Instructions for Blind and Low Vision People

Add code
Jul 11, 2024
Viaarxiv icon

When Parts are Greater Than Sums: Individual LLM Components Can Outperform Full Models

Add code
Jun 19, 2024
Figure 1 for When Parts are Greater Than Sums: Individual LLM Components Can Outperform Full Models
Figure 2 for When Parts are Greater Than Sums: Individual LLM Components Can Outperform Full Models
Figure 3 for When Parts are Greater Than Sums: Individual LLM Components Can Outperform Full Models
Figure 4 for When Parts are Greater Than Sums: Individual LLM Components Can Outperform Full Models
Viaarxiv icon

Contrast Sets for Evaluating Language-Guided Robot Policies

Add code
Jun 19, 2024
Figure 1 for Contrast Sets for Evaluating Language-Guided Robot Policies
Figure 2 for Contrast Sets for Evaluating Language-Guided Robot Policies
Figure 3 for Contrast Sets for Evaluating Language-Guided Robot Policies
Figure 4 for Contrast Sets for Evaluating Language-Guided Robot Policies
Viaarxiv icon

Language Models can Infer Action Semantics for Classical Planners from Environment Feedback

Add code
Jun 04, 2024
Viaarxiv icon

TwoStep: Multi-agent Task Planning using Classical Planners and Large Language Models

Add code
Mar 25, 2024
Viaarxiv icon

ViSaRL: Visual Reinforcement Learning Guided by Human Saliency

Add code
Mar 16, 2024
Figure 1 for ViSaRL: Visual Reinforcement Learning Guided by Human Saliency
Figure 2 for ViSaRL: Visual Reinforcement Learning Guided by Human Saliency
Figure 3 for ViSaRL: Visual Reinforcement Learning Guided by Human Saliency
Figure 4 for ViSaRL: Visual Reinforcement Learning Guided by Human Saliency
Viaarxiv icon

Selective "Selective Prediction": Reducing Unnecessary Abstention in Vision-Language Reasoning

Add code
Feb 23, 2024
Viaarxiv icon

WinoViz: Probing Visual Properties of Objects Under Different States

Add code
Feb 21, 2024
Viaarxiv icon