Picture for Wenxuan Zhou

Wenxuan Zhou

T-REG: Preference Optimization with Token-Level Reward Regularization

Add code
Dec 03, 2024
Viaarxiv icon

Improving Model Factuality with Fine-grained Critique-based Evaluator

Add code
Oct 24, 2024
Viaarxiv icon

Visual Manipulation with Legs

Add code
Oct 15, 2024
Viaarxiv icon

Instructional Segment Embedding: Improving LLM Safety with Instruction Hierarchy

Add code
Oct 09, 2024
Figure 1 for Instructional Segment Embedding: Improving LLM Safety with Instruction Hierarchy
Figure 2 for Instructional Segment Embedding: Improving LLM Safety with Instruction Hierarchy
Figure 3 for Instructional Segment Embedding: Improving LLM Safety with Instruction Hierarchy
Figure 4 for Instructional Segment Embedding: Improving LLM Safety with Instruction Hierarchy
Viaarxiv icon

SFTMix: Elevating Language Model Instruction Tuning with Mixup Recipe

Add code
Oct 07, 2024
Figure 1 for SFTMix: Elevating Language Model Instruction Tuning with Mixup Recipe
Figure 2 for SFTMix: Elevating Language Model Instruction Tuning with Mixup Recipe
Figure 3 for SFTMix: Elevating Language Model Instruction Tuning with Mixup Recipe
Figure 4 for SFTMix: Elevating Language Model Instruction Tuning with Mixup Recipe
Viaarxiv icon

The Perfect Blend: Redefining RLHF with Mixture of Judges

Add code
Sep 30, 2024
Figure 1 for The Perfect Blend: Redefining RLHF with Mixture of Judges
Figure 2 for The Perfect Blend: Redefining RLHF with Mixture of Judges
Figure 3 for The Perfect Blend: Redefining RLHF with Mixture of Judges
Figure 4 for The Perfect Blend: Redefining RLHF with Mixture of Judges
Viaarxiv icon

Mask-Encoded Sparsification: Mitigating Biased Gradients in Communication-Efficient Split Learning

Add code
Aug 25, 2024
Viaarxiv icon

HACMan++: Spatially-Grounded Motion Primitives for Manipulation

Add code
Jul 11, 2024
Figure 1 for HACMan++: Spatially-Grounded Motion Primitives for Manipulation
Figure 2 for HACMan++: Spatially-Grounded Motion Primitives for Manipulation
Figure 3 for HACMan++: Spatially-Grounded Motion Primitives for Manipulation
Figure 4 for HACMan++: Spatially-Grounded Motion Primitives for Manipulation
Viaarxiv icon

Improving Multilingual Instruction Finetuning via Linguistically Natural and Diverse Datasets

Add code
Jul 01, 2024
Figure 1 for Improving Multilingual Instruction Finetuning via Linguistically Natural and Diverse Datasets
Figure 2 for Improving Multilingual Instruction Finetuning via Linguistically Natural and Diverse Datasets
Figure 3 for Improving Multilingual Instruction Finetuning via Linguistically Natural and Diverse Datasets
Figure 4 for Improving Multilingual Instruction Finetuning via Linguistically Natural and Diverse Datasets
Viaarxiv icon

mDPO: Conditional Preference Optimization for Multimodal Large Language Models

Add code
Jun 17, 2024
Figure 1 for mDPO: Conditional Preference Optimization for Multimodal Large Language Models
Figure 2 for mDPO: Conditional Preference Optimization for Multimodal Large Language Models
Figure 3 for mDPO: Conditional Preference Optimization for Multimodal Large Language Models
Figure 4 for mDPO: Conditional Preference Optimization for Multimodal Large Language Models
Viaarxiv icon