Picture for Tu Trinh

Tu Trinh

Getting By Goal Misgeneralization With a Little Help From a Mentor

Add code
Oct 28, 2024
Viaarxiv icon

Softmax Probabilities (Mostly) Predict Large Language Model Correctness on Multiple-Choice Q&A

Add code
Feb 20, 2024
Viaarxiv icon

A StrongREJECT for Empty Jailbreaks

Add code
Feb 15, 2024
Viaarxiv icon

Autonomous Assessment of Demonstration Sufficiency via Bayesian Inverse Reinforcement Learning

Add code
Nov 29, 2022
Viaarxiv icon

Efficient Game-Theoretic Planning with Prediction Heuristic for Socially-Compliant Autonomous Driving

Add code
Jul 08, 2022
Figure 1 for Efficient Game-Theoretic Planning with Prediction Heuristic for Socially-Compliant Autonomous Driving
Figure 2 for Efficient Game-Theoretic Planning with Prediction Heuristic for Socially-Compliant Autonomous Driving
Figure 3 for Efficient Game-Theoretic Planning with Prediction Heuristic for Socially-Compliant Autonomous Driving
Figure 4 for Efficient Game-Theoretic Planning with Prediction Heuristic for Socially-Compliant Autonomous Driving
Viaarxiv icon