Picture for Chandler Smith

Chandler Smith

MALT: Improving Reasoning with Multi-Agent LLM Training

Add code
Dec 02, 2024
Viaarxiv icon

BetterBench: Assessing AI Benchmarks, Uncovering Issues, and Establishing Best Practices

Add code
Nov 20, 2024
Viaarxiv icon

Riemannian Optimization for Non-convex Euclidean Distance Geometry with Global Recovery Guarantees

Add code
Oct 08, 2024
Viaarxiv icon

Evaluating Language Model Character Traits

Add code
Oct 05, 2024
Figure 1 for Evaluating Language Model Character Traits
Figure 2 for Evaluating Language Model Character Traits
Figure 3 for Evaluating Language Model Character Traits
Figure 4 for Evaluating Language Model Character Traits
Viaarxiv icon

Escalation Risks from Language Models in Military and Diplomatic Decision-Making

Add code
Jan 07, 2024
Viaarxiv icon