Picture for Han Xia

Han Xia

Inverse-Q*: Token Level Reinforcement Learning for Aligning Large Language Models Without Preference Data

Add code
Aug 27, 2024
Viaarxiv icon

EasyJailbreak: A Unified Framework for Jailbreaking Large Language Models

Add code
Mar 18, 2024
Viaarxiv icon

RoCoIns: Enhancing Robustness of Large Language Models through Code-Style Instructions

Add code
Feb 26, 2024
Viaarxiv icon

Orthogonal Subspace Learning for Language Model Continual Learning

Add code
Oct 22, 2023
Viaarxiv icon

InstructUIE: Multi-task Instruction Tuning for Unified Information Extraction

Add code
Apr 17, 2023
Viaarxiv icon