Picture for Weiming Hu

Weiming Hu

VQ-Map: Bird's-Eye-View Map Layout Estimation in Tokenized Discrete Space via Vector Quantization

Add code
Nov 03, 2024
Viaarxiv icon

Token Caching for Diffusion Transformer Acceleration

Add code
Sep 27, 2024
Viaarxiv icon

MobileIQA: Exploiting Mobile-level Diverse Opinion Network For No-Reference Image Quality Assessment Using Knowledge Distillation

Add code
Sep 02, 2024
Viaarxiv icon

Content-decoupled Contrastive Learning-based Implicit Degradation Modeling for Blind Image Super-Resolution

Add code
Aug 10, 2024
Viaarxiv icon

vTensor: Flexible Virtual Tensor Management for Efficient LLM Serving

Add code
Jul 22, 2024
Figure 1 for vTensor: Flexible Virtual Tensor Management for Efficient LLM Serving
Figure 2 for vTensor: Flexible Virtual Tensor Management for Efficient LLM Serving
Figure 3 for vTensor: Flexible Virtual Tensor Management for Efficient LLM Serving
Figure 4 for vTensor: Flexible Virtual Tensor Management for Efficient LLM Serving
Viaarxiv icon

MIBench: Evaluating Multimodal Large Language Models over Multiple Images

Add code
Jul 21, 2024
Viaarxiv icon

Temporal Correlation Meets Embedding: Towards a 2nd Generation of JDE-based Real-Time Multi-Object Tracking

Add code
Jul 19, 2024
Viaarxiv icon

Animate3D: Animating Any 3D Model with Multi-view Video Diffusion

Add code
Jul 16, 2024
Figure 1 for Animate3D: Animating Any 3D Model with Multi-view Video Diffusion
Figure 2 for Animate3D: Animating Any 3D Model with Multi-view Video Diffusion
Figure 3 for Animate3D: Animating Any 3D Model with Multi-view Video Diffusion
Figure 4 for Animate3D: Animating Any 3D Model with Multi-view Video Diffusion
Viaarxiv icon

How to Make Cross Encoder a Good Teacher for Efficient Image-Text Retrieval?

Add code
Jul 10, 2024
Viaarxiv icon

EA-VTR: Event-Aware Video-Text Retrieval

Add code
Jul 10, 2024
Viaarxiv icon