Picture for Alane Suhr

Alane Suhr

Evaluating Model Perception of Color Illusions in Photorealistic Scenes

Add code
Dec 09, 2024
Viaarxiv icon

Using Language Models to Disambiguate Lexical Choices in Translation

Add code
Nov 08, 2024
Viaarxiv icon

Grounding Language in Multi-Perspective Referential Communication

Add code
Oct 04, 2024
Viaarxiv icon

DigiRL: Training In-The-Wild Device-Control Agents with Autonomous Reinforcement Learning

Add code
Jun 14, 2024
Viaarxiv icon

Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement Learning

Add code
May 17, 2024
Figure 1 for Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement Learning
Figure 2 for Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement Learning
Figure 3 for Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement Learning
Figure 4 for Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement Learning
Viaarxiv icon

Autonomous Evaluation and Refinement of Digital Agents

Add code
Apr 10, 2024
Viaarxiv icon

UNcommonsense Reasoning: Abductive Reasoning about Uncommon Situations

Add code
Nov 14, 2023
Viaarxiv icon

What's In My Big Data?

Add code
Oct 31, 2023
Figure 1 for What's In My Big Data?
Figure 2 for What's In My Big Data?
Figure 3 for What's In My Big Data?
Figure 4 for What's In My Big Data?
Viaarxiv icon

Quantifying Language Models' Sensitivity to Spurious Features in Prompt Design or: How I learned to start worrying about prompt formatting

Add code
Oct 17, 2023
Viaarxiv icon

Fine-Grained Human Feedback Gives Better Rewards for Language Model Training

Add code
Jun 02, 2023
Viaarxiv icon