Picture for Qinhong Lin

Qinhong Lin

Label-Confidence-Aware Uncertainty Estimation in Natural Language Generation

Add code
Dec 10, 2024
Viaarxiv icon

Approximated Variational Bayesian Inverse Reinforcement Learning for Large Language Model Alignment

Add code
Nov 14, 2024
Viaarxiv icon