Picture for Hieu Minh "Jord" Nguyen

Hieu Minh "Jord" Nguyen

Identifying Cooperative Personalities in Multi-agent Contexts through Personality Steering with Representation Engineering

Add code
Mar 17, 2025
Viaarxiv icon

DarkBench: Benchmarking Dark Patterns in Large Language Models

Add code
Mar 13, 2025
Viaarxiv icon

A Survey of Theory of Mind in Large Language Models: Evaluations, Representations, and Safety Risks

Add code
Feb 10, 2025
Viaarxiv icon