Picture for Wengang Zhou

Wengang Zhou

Model Evolution Framework with Genetic Algorithm for Multi-Task Reinforcement Learning

Add code
Feb 19, 2025
Viaarxiv icon

Disentangling Length Bias In Preference Learning Via Response-Conditioned Modeling

Add code
Feb 02, 2025
Figure 1 for Disentangling Length Bias In Preference Learning Via Response-Conditioned Modeling
Figure 2 for Disentangling Length Bias In Preference Learning Via Response-Conditioned Modeling
Figure 3 for Disentangling Length Bias In Preference Learning Via Response-Conditioned Modeling
Figure 4 for Disentangling Length Bias In Preference Learning Via Response-Conditioned Modeling
Viaarxiv icon

Uni-Sign: Toward Unified Sign Language Understanding at Scale

Add code
Jan 25, 2025
Figure 1 for Uni-Sign: Toward Unified Sign Language Understanding at Scale
Figure 2 for Uni-Sign: Toward Unified Sign Language Understanding at Scale
Figure 3 for Uni-Sign: Toward Unified Sign Language Understanding at Scale
Figure 4 for Uni-Sign: Toward Unified Sign Language Understanding at Scale
Viaarxiv icon

SmartEraser: Remove Anything from Images using Masked-Region Guidance

Add code
Jan 14, 2025
Figure 1 for SmartEraser: Remove Anything from Images using Masked-Region Guidance
Figure 2 for SmartEraser: Remove Anything from Images using Masked-Region Guidance
Figure 3 for SmartEraser: Remove Anything from Images using Masked-Region Guidance
Figure 4 for SmartEraser: Remove Anything from Images using Masked-Region Guidance
Viaarxiv icon

Multi-Level Optimal Transport for Universal Cross-Tokenizer Knowledge Distillation on Language Models

Add code
Dec 19, 2024
Viaarxiv icon

RL-LLM-DT: An Automatic Decision Tree Generation Method Based on RL Evaluation and LLM Enhancement

Add code
Dec 17, 2024
Figure 1 for RL-LLM-DT: An Automatic Decision Tree Generation Method Based on RL Evaluation and LLM Enhancement
Figure 2 for RL-LLM-DT: An Automatic Decision Tree Generation Method Based on RL Evaluation and LLM Enhancement
Figure 3 for RL-LLM-DT: An Automatic Decision Tree Generation Method Based on RL Evaluation and LLM Enhancement
Figure 4 for RL-LLM-DT: An Automatic Decision Tree Generation Method Based on RL Evaluation and LLM Enhancement
Viaarxiv icon

Make-It-Animatable: An Efficient Framework for Authoring Animation-Ready 3D Characters

Add code
Nov 27, 2024
Figure 1 for Make-It-Animatable: An Efficient Framework for Authoring Animation-Ready 3D Characters
Figure 2 for Make-It-Animatable: An Efficient Framework for Authoring Animation-Ready 3D Characters
Figure 3 for Make-It-Animatable: An Efficient Framework for Authoring Animation-Ready 3D Characters
Figure 4 for Make-It-Animatable: An Efficient Framework for Authoring Animation-Ready 3D Characters
Viaarxiv icon

ROOT: VLM based System for Indoor Scene Understanding and Beyond

Add code
Nov 24, 2024
Figure 1 for ROOT: VLM based System for Indoor Scene Understanding and Beyond
Figure 2 for ROOT: VLM based System for Indoor Scene Understanding and Beyond
Figure 3 for ROOT: VLM based System for Indoor Scene Understanding and Beyond
Figure 4 for ROOT: VLM based System for Indoor Scene Understanding and Beyond
Viaarxiv icon

BoolQuestions: Does Dense Retrieval Understand Boolean Logic in Language?

Add code
Nov 19, 2024
Viaarxiv icon

Trustworthy Alignment of Retrieval-Augmented Large Language Models via Reinforcement Learning

Add code
Oct 22, 2024
Figure 1 for Trustworthy Alignment of Retrieval-Augmented Large Language Models via Reinforcement Learning
Figure 2 for Trustworthy Alignment of Retrieval-Augmented Large Language Models via Reinforcement Learning
Figure 3 for Trustworthy Alignment of Retrieval-Augmented Large Language Models via Reinforcement Learning
Figure 4 for Trustworthy Alignment of Retrieval-Augmented Large Language Models via Reinforcement Learning
Viaarxiv icon