Picture for Shujian Zhang

Shujian Zhang

T-REG: Preference Optimization with Token-Level Reward Regularization

Add code
Dec 03, 2024
Viaarxiv icon

Instructional Segment Embedding: Improving LLM Safety with Instruction Hierarchy

Add code
Oct 09, 2024
Figure 1 for Instructional Segment Embedding: Improving LLM Safety with Instruction Hierarchy
Figure 2 for Instructional Segment Embedding: Improving LLM Safety with Instruction Hierarchy
Figure 3 for Instructional Segment Embedding: Improving LLM Safety with Instruction Hierarchy
Figure 4 for Instructional Segment Embedding: Improving LLM Safety with Instruction Hierarchy
Viaarxiv icon

SFTMix: Elevating Language Model Instruction Tuning with Mixup Recipe

Add code
Oct 07, 2024
Figure 1 for SFTMix: Elevating Language Model Instruction Tuning with Mixup Recipe
Figure 2 for SFTMix: Elevating Language Model Instruction Tuning with Mixup Recipe
Figure 3 for SFTMix: Elevating Language Model Instruction Tuning with Mixup Recipe
Figure 4 for SFTMix: Elevating Language Model Instruction Tuning with Mixup Recipe
Viaarxiv icon

Score Forgetting Distillation: A Swift, Data-Free Method for Machine Unlearning in Diffusion Models

Add code
Sep 17, 2024
Figure 1 for Score Forgetting Distillation: A Swift, Data-Free Method for Machine Unlearning in Diffusion Models
Figure 2 for Score Forgetting Distillation: A Swift, Data-Free Method for Machine Unlearning in Diffusion Models
Figure 3 for Score Forgetting Distillation: A Swift, Data-Free Method for Machine Unlearning in Diffusion Models
Figure 4 for Score Forgetting Distillation: A Swift, Data-Free Method for Machine Unlearning in Diffusion Models
Viaarxiv icon

WPO: Enhancing RLHF with Weighted Preference Optimization

Add code
Jun 17, 2024
Viaarxiv icon

Statistical Advantages of Perturbing Cosine Router in Sparse Mixture of Experts

Add code
May 23, 2024
Figure 1 for Statistical Advantages of Perturbing Cosine Router in Sparse Mixture of Experts
Figure 2 for Statistical Advantages of Perturbing Cosine Router in Sparse Mixture of Experts
Figure 3 for Statistical Advantages of Perturbing Cosine Router in Sparse Mixture of Experts
Figure 4 for Statistical Advantages of Perturbing Cosine Router in Sparse Mixture of Experts
Viaarxiv icon

Switchable Decision: Dynamic Neural Generation Networks

Add code
May 07, 2024
Viaarxiv icon

Language Rectified Flow: Advancing Diffusion Language Generation with Probabilistic Flows

Add code
Mar 25, 2024
Viaarxiv icon

Sliced Wasserstein with Random-Path Projecting Directions

Add code
Jan 29, 2024
Viaarxiv icon

Preference-grounded Token-level Guidance for Language Model Fine-tuning

Add code
Jun 01, 2023
Viaarxiv icon