Picture for Huaibo Huang

Huaibo Huang

Random Wins All: Rethinking Grouping Strategies for Vision Tokens

Add code
Feb 28, 2026
Viaarxiv icon

BitDance: Scaling Autoregressive Generative Models with Binary Tokens

Add code
Feb 15, 2026
Viaarxiv icon

UniWeTok: An Unified Binary Tokenizer with Codebook Size $\mathit{2^{128}}$ for Unified Multimodal Large Language Model

Add code
Feb 15, 2026
Viaarxiv icon

Expand and Prune: Maximizing Trajectory Diversity for Effective GRPO in Generative Models

Add code
Dec 17, 2025
Viaarxiv icon

HAD: Hybrid Architecture Distillation Outperforms Teacher in Genomic Sequence Modeling

Add code
May 27, 2025
Viaarxiv icon

T^2Agent A Tool-augmented Multimodal Misinformation Detection Agent with Monte Carlo Tree Search

Add code
May 26, 2025
Viaarxiv icon

Breaking Complexity Barriers: High-Resolution Image Restoration with Rank Enhanced Linear Attention

Add code
May 22, 2025
Viaarxiv icon

Unlocking the Potential of Difficulty Prior in RL-based Multimodal Reasoning

Add code
May 19, 2025
Figure 1 for Unlocking the Potential of Difficulty Prior in RL-based Multimodal Reasoning
Figure 2 for Unlocking the Potential of Difficulty Prior in RL-based Multimodal Reasoning
Figure 3 for Unlocking the Potential of Difficulty Prior in RL-based Multimodal Reasoning
Figure 4 for Unlocking the Potential of Difficulty Prior in RL-based Multimodal Reasoning
Viaarxiv icon

NOFT: Test-Time Noise Finetune via Information Bottleneck for Highly Correlated Asset Creation

Add code
May 18, 2025
Viaarxiv icon

Video-SafetyBench: A Benchmark for Safety Evaluation of Video LVLMs

Add code
May 17, 2025
Viaarxiv icon