Picture for Minlie Huang

Minlie Huang

EJ

Human Decision-making is Susceptible to AI-driven Manipulation

Add code
Feb 11, 2025
Viaarxiv icon

MAPS: Advancing Multi-Modal Reasoning in Expert-Level Physical Science

Add code
Jan 18, 2025
Viaarxiv icon

Enhanced Large Language Models for Effective Screening of Depression and Anxiety

Add code
Jan 15, 2025
Viaarxiv icon

VisionReward: Fine-Grained Multi-Dimensional Human Preference Learning for Image and Video Generation

Add code
Dec 30, 2024
Viaarxiv icon

LegalAgentBench: Evaluating LLM Agents in Legal Domain

Add code
Dec 23, 2024
Viaarxiv icon

Agent-SafetyBench: Evaluating the Safety of LLM Agents

Add code
Dec 19, 2024
Viaarxiv icon

Understanding the Dark Side of LLMs' Intrinsic Self-Correction

Add code
Dec 19, 2024
Viaarxiv icon

CharacterBench: Benchmarking Character Customization of Large Language Models

Add code
Dec 16, 2024
Figure 1 for CharacterBench: Benchmarking Character Customization of Large Language Models
Figure 2 for CharacterBench: Benchmarking Character Customization of Large Language Models
Figure 3 for CharacterBench: Benchmarking Character Customization of Large Language Models
Figure 4 for CharacterBench: Benchmarking Character Customization of Large Language Models
Viaarxiv icon

SPaR: Self-Play with Tree-Search Refinement to Improve Instruction-Following in Large Language Models

Add code
Dec 16, 2024
Viaarxiv icon

Seeker: Towards Exception Safety Code Generation with Intermediate Language Agents Framework

Add code
Dec 16, 2024
Viaarxiv icon