Picture for Paul Pu Liang

Paul Pu Liang

May

Can A Society of Generative Agents Simulate Human Behavior and Inform Public Health Policy? A Case Study on Vaccine Hesitancy

Add code
Mar 12, 2025
Viaarxiv icon

Data Foundations for Large Scale Multimodal Clinical Foundation Models

Add code
Mar 09, 2025
Viaarxiv icon

Language Models' Factuality Depends on the Language of Inquiry

Add code
Feb 25, 2025
Viaarxiv icon

MimeQA: Towards Socially-Intelligent Nonverbal Foundation Models

Add code
Feb 23, 2025
Viaarxiv icon

Understanding the Emergence of Multimodal Representation Alignment

Add code
Feb 22, 2025
Viaarxiv icon

Social Genome: Grounded Social Reasoning Abilities of Multimodal Models

Add code
Feb 21, 2025
Viaarxiv icon

VLM$^2$-Bench: A Closer Look at How Well VLMs Implicitly Link Explicit Matching Visual Cues

Add code
Feb 17, 2025
Viaarxiv icon

Group-Adaptive Threshold Optimization for Robust AI-Generated Text Detection

Add code
Feb 10, 2025
Viaarxiv icon

OS-ATLAS: A Foundation Action Model for Generalist GUI Agents

Add code
Oct 30, 2024
Figure 1 for OS-ATLAS: A Foundation Action Model for Generalist GUI Agents
Figure 2 for OS-ATLAS: A Foundation Action Model for Generalist GUI Agents
Figure 3 for OS-ATLAS: A Foundation Action Model for Generalist GUI Agents
Figure 4 for OS-ATLAS: A Foundation Action Model for Generalist GUI Agents
Viaarxiv icon

VideoWebArena: Evaluating Long Context Multimodal Agents with Video Understanding Web Tasks

Add code
Oct 24, 2024
Figure 1 for VideoWebArena: Evaluating Long Context Multimodal Agents with Video Understanding Web Tasks
Figure 2 for VideoWebArena: Evaluating Long Context Multimodal Agents with Video Understanding Web Tasks
Figure 3 for VideoWebArena: Evaluating Long Context Multimodal Agents with Video Understanding Web Tasks
Figure 4 for VideoWebArena: Evaluating Long Context Multimodal Agents with Video Understanding Web Tasks
Viaarxiv icon