Picture for Zhuokai Zhao

Zhuokai Zhao

From Uncertainty to Trust: Enhancing Reliability in Vision-Language Models with Uncertainty-Guided Dropout Decoding

Add code
Dec 09, 2024
Viaarxiv icon

Beyond Training: Dynamic Token Merging for Zero-Shot Video Understanding

Add code
Nov 21, 2024
Viaarxiv icon

Preference Optimization with Multi-Sample Comparisons

Add code
Oct 16, 2024
Viaarxiv icon

Quantifying Generalization Complexity for Large Language Models

Add code
Oct 02, 2024
Viaarxiv icon

MJ-Bench: Is Your Multimodal Reward Model Really a Good Judge for Text-to-Image Generation?

Add code
Jul 05, 2024
Figure 1 for MJ-Bench: Is Your Multimodal Reward Model Really a Good Judge for Text-to-Image Generation?
Figure 2 for MJ-Bench: Is Your Multimodal Reward Model Really a Good Judge for Text-to-Image Generation?
Figure 3 for MJ-Bench: Is Your Multimodal Reward Model Really a Good Judge for Text-to-Image Generation?
Figure 4 for MJ-Bench: Is Your Multimodal Reward Model Really a Good Judge for Text-to-Image Generation?
Viaarxiv icon

RankCLIP: Ranking-Consistent Language-Image Pretraining

Add code
Apr 15, 2024
Viaarxiv icon

HALC: Object Hallucination Reduction via Adaptive Focal-Contrast Decoding

Add code
Mar 01, 2024
Viaarxiv icon

AutoPRM: Automating Procedural Supervision for Multi-Step Reasoning via Controllable Question Decomposition

Add code
Feb 18, 2024
Viaarxiv icon

Direct Acquisition Optimization for Low-Budget Active Learning

Add code
Feb 08, 2024
Figure 1 for Direct Acquisition Optimization for Low-Budget Active Learning
Figure 2 for Direct Acquisition Optimization for Low-Budget Active Learning
Figure 3 for Direct Acquisition Optimization for Low-Budget Active Learning
Figure 4 for Direct Acquisition Optimization for Low-Budget Active Learning
Viaarxiv icon

RELAX: Reinforcement Learning Enabled 2D-LiDAR Autonomous System for Parsimonious UAVs

Add code
Sep 15, 2023
Viaarxiv icon