Picture for Qiming Ge

Qiming Ge

Inverse-Q*: Token Level Reinforcement Learning for Aligning Large Language Models Without Preference Data

Add code
Aug 27, 2024
Viaarxiv icon

Navigating the OverKill in Large Language Models

Add code
Jan 31, 2024
Viaarxiv icon

Linear Alignment: A Closed-form Solution for Aligning Human Preferences without Tuning and Feedback

Add code
Jan 21, 2024
Viaarxiv icon

Orthogonal Subspace Learning for Language Model Continual Learning

Add code
Oct 22, 2023
Viaarxiv icon