Picture for Furong Huang

Furong Huang

MIRA: Towards Mitigating Reward Hacking in Inference-Time Alignment of T2I Diffusion Models

Add code
Oct 02, 2025
Viaarxiv icon

Zebra-CoT: A Dataset for Interleaved Vision Language Reasoning

Add code
Jul 22, 2025
Viaarxiv icon

Reward Models Can Improve Themselves: Reward-Guided Adversarial Failure Mode Discovery for Robust Reward Modeling

Add code
Jul 08, 2025
Viaarxiv icon

ViCrit: A Verifiable Reinforcement Learning Proxy Task for Visual Perception in VLMs

Add code
Jun 11, 2025
Viaarxiv icon

MORSE-500: A Programmatically Controllable Video Benchmark to Stress-Test Multimodal Reasoning

Add code
Jun 05, 2025
Viaarxiv icon

Does Thinking More always Help? Understanding Test-Time Scaling in Reasoning Models

Add code
Jun 04, 2025
Figure 1 for Does Thinking More always Help? Understanding Test-Time Scaling in Reasoning Models
Figure 2 for Does Thinking More always Help? Understanding Test-Time Scaling in Reasoning Models
Figure 3 for Does Thinking More always Help? Understanding Test-Time Scaling in Reasoning Models
Figure 4 for Does Thinking More always Help? Understanding Test-Time Scaling in Reasoning Models
Viaarxiv icon

EnsemW2S: Enhancing Weak-to-Strong Generalization with Large Language Model Ensembles

Add code
May 28, 2025
Viaarxiv icon

Zero-Shot Vision Encoder Grafting via LLM Surrogates

Add code
May 28, 2025
Viaarxiv icon

Effort-aware Fairness: Incorporating a Philosophy-informed, Human-centered Notion of Effort into Algorithmic Fairness Metrics

Add code
May 25, 2025
Figure 1 for Effort-aware Fairness: Incorporating a Philosophy-informed, Human-centered Notion of Effort into Algorithmic Fairness Metrics
Figure 2 for Effort-aware Fairness: Incorporating a Philosophy-informed, Human-centered Notion of Effort into Algorithmic Fairness Metrics
Figure 3 for Effort-aware Fairness: Incorporating a Philosophy-informed, Human-centered Notion of Effort into Algorithmic Fairness Metrics
Figure 4 for Effort-aware Fairness: Incorporating a Philosophy-informed, Human-centered Notion of Effort into Algorithmic Fairness Metrics
Viaarxiv icon

FLARE: Robot Learning with Implicit World Modeling

Add code
May 21, 2025
Viaarxiv icon