Picture for Kai Han

Kai Han

Needle Threading: Can LLMs Follow Threads through Near-Million-Scale Haystacks?

Add code
Nov 07, 2024
Viaarxiv icon

BiGR: Harnessing Binary Latent Codes for Image Generation and Improved Visual Representation Capabilities

Add code
Oct 18, 2024
Viaarxiv icon

Free Video-LLM: Prompt-guided Visual Perception for Efficient Training-free Video LLMs

Add code
Oct 14, 2024
Viaarxiv icon

AvatarGO: Zero-shot 4D Human-Object Interaction Generation and Animation

Add code
Oct 09, 2024
Figure 1 for AvatarGO: Zero-shot 4D Human-Object Interaction Generation and Animation
Figure 2 for AvatarGO: Zero-shot 4D Human-Object Interaction Generation and Animation
Figure 3 for AvatarGO: Zero-shot 4D Human-Object Interaction Generation and Animation
Figure 4 for AvatarGO: Zero-shot 4D Human-Object Interaction Generation and Animation
Viaarxiv icon

CusConcept: Customized Visual Concept Decomposition with Diffusion Models

Add code
Oct 01, 2024
Viaarxiv icon

Dissecting Out-of-Distribution Detection and Open-Set Recognition: A Critical Analysis of Methods and Benchmarks

Add code
Aug 30, 2024
Viaarxiv icon

GRAB: A Challenging GRaph Analysis Benchmark for Large Multimodal Models

Add code
Aug 21, 2024
Figure 1 for GRAB: A Challenging GRaph Analysis Benchmark for Large Multimodal Models
Figure 2 for GRAB: A Challenging GRaph Analysis Benchmark for Large Multimodal Models
Figure 3 for GRAB: A Challenging GRaph Analysis Benchmark for Large Multimodal Models
Figure 4 for GRAB: A Challenging GRaph Analysis Benchmark for Large Multimodal Models
Viaarxiv icon

Token Compensator: Altering Inference Cost of Vision Transformer without Re-Tuning

Add code
Aug 13, 2024
Viaarxiv icon

HiLo: A Learning Framework for Generalized Category Discovery Robust to Domain Shifts

Add code
Aug 08, 2024
Viaarxiv icon

LatentArtiFusion: An Effective and Efficient Histological Artifacts Restoration Framework

Add code
Jul 29, 2024
Viaarxiv icon