Picture for Yuang Cai

Yuang Cai

Label-Confidence-Aware Uncertainty Estimation in Natural Language Generation

Add code
Dec 10, 2024
Viaarxiv icon

Approximated Variational Bayesian Inverse Reinforcement Learning for Large Language Model Alignment

Add code
Nov 14, 2024
Viaarxiv icon