Picture for Wenxuan Zhou

Wenxuan Zhou

MetaScale: Test-Time Scaling with Evolving Meta-Thoughts

Add code
Mar 17, 2025
Viaarxiv icon

Semantic-Clipping: Efficient Vision-Language Modeling with Semantic-Guidedd Visual Selection

Add code
Mar 14, 2025
Viaarxiv icon

ThinkGuard: Deliberative Slow Thinking Leads to Cautious Guardrails

Add code
Feb 19, 2025
Viaarxiv icon

Think Smarter not Harder: Adaptive Reasoning with Inference Aware Optimization

Add code
Jan 31, 2025
Viaarxiv icon

FocalPO: Enhancing Preference Optimizing by Focusing on Correct Preference Rankings

Add code
Jan 11, 2025
Figure 1 for FocalPO: Enhancing Preference Optimizing by Focusing on Correct Preference Rankings
Figure 2 for FocalPO: Enhancing Preference Optimizing by Focusing on Correct Preference Rankings
Figure 3 for FocalPO: Enhancing Preference Optimizing by Focusing on Correct Preference Rankings
Figure 4 for FocalPO: Enhancing Preference Optimizing by Focusing on Correct Preference Rankings
Viaarxiv icon

T-REG: Preference Optimization with Token-Level Reward Regularization

Add code
Dec 03, 2024
Viaarxiv icon

Improving Model Factuality with Fine-grained Critique-based Evaluator

Add code
Oct 24, 2024
Viaarxiv icon

Visual Manipulation with Legs

Add code
Oct 15, 2024
Viaarxiv icon

Instructional Segment Embedding: Improving LLM Safety with Instruction Hierarchy

Add code
Oct 09, 2024
Figure 1 for Instructional Segment Embedding: Improving LLM Safety with Instruction Hierarchy
Figure 2 for Instructional Segment Embedding: Improving LLM Safety with Instruction Hierarchy
Figure 3 for Instructional Segment Embedding: Improving LLM Safety with Instruction Hierarchy
Figure 4 for Instructional Segment Embedding: Improving LLM Safety with Instruction Hierarchy
Viaarxiv icon

SFTMix: Elevating Language Model Instruction Tuning with Mixup Recipe

Add code
Oct 07, 2024
Figure 1 for SFTMix: Elevating Language Model Instruction Tuning with Mixup Recipe
Figure 2 for SFTMix: Elevating Language Model Instruction Tuning with Mixup Recipe
Figure 3 for SFTMix: Elevating Language Model Instruction Tuning with Mixup Recipe
Figure 4 for SFTMix: Elevating Language Model Instruction Tuning with Mixup Recipe
Viaarxiv icon