Picture for Huan Wang

Huan Wang

Stephen

Plug-and-Play 1.x-Bit KV Cache Quantization for Video Large Language Models

Add code
Mar 20, 2025
Viaarxiv icon

Niagara: Normal-Integrated Geometric Affine Field for Scene Reconstruction from a Single View

Add code
Mar 16, 2025
Viaarxiv icon

Autoregressive Image Generation with Randomized Parallel Decoding

Add code
Mar 13, 2025
Viaarxiv icon

PersonaBench: Evaluating AI Models on Understanding Personal Information through Accessing (Synthetic) Private User Data

Add code
Feb 28, 2025
Viaarxiv icon

FreeBlend: Advancing Concept Blending with Staged Feedback-Driven Interpolation Diffusion

Add code
Feb 08, 2025
Viaarxiv icon

Poison as Cure: Visual Noise for Mitigating Object Hallucinations in LVMs

Add code
Jan 31, 2025
Figure 1 for Poison as Cure: Visual Noise for Mitigating Object Hallucinations in LVMs
Figure 2 for Poison as Cure: Visual Noise for Mitigating Object Hallucinations in LVMs
Figure 3 for Poison as Cure: Visual Noise for Mitigating Object Hallucinations in LVMs
Figure 4 for Poison as Cure: Visual Noise for Mitigating Object Hallucinations in LVMs
Viaarxiv icon

Dynamic Token Reduction during Generation for Vision Language Models

Add code
Jan 24, 2025
Viaarxiv icon

TACO: Learning Multi-modal Action Models with Synthetic Chains-of-Thought-and-Action

Add code
Dec 10, 2024
Figure 1 for TACO: Learning Multi-modal Action Models with Synthetic Chains-of-Thought-and-Action
Figure 2 for TACO: Learning Multi-modal Action Models with Synthetic Chains-of-Thought-and-Action
Figure 3 for TACO: Learning Multi-modal Action Models with Synthetic Chains-of-Thought-and-Action
Figure 4 for TACO: Learning Multi-modal Action Models with Synthetic Chains-of-Thought-and-Action
Viaarxiv icon

Slicing Vision Transformer for Flexible Inference

Add code
Dec 06, 2024
Viaarxiv icon

Is Oracle Pruning the True Oracle?

Add code
Nov 28, 2024
Viaarxiv icon