Picture for Hao Tang

Hao Tang

PartRM: Modeling Part-Level Dynamics with Large Cross-State Reconstruction Model

Add code
Mar 25, 2025
Viaarxiv icon

HOIGPT: Learning Long Sequence Hand-Object Interaction with Language Models

Add code
Mar 24, 2025
Viaarxiv icon

Beyond Semantics: Rediscovering Spatial Awareness in Vision-Language Models

Add code
Mar 21, 2025
Viaarxiv icon

MambaIC: State Space Models for High-Performance Learned Image Compression

Add code
Mar 16, 2025
Viaarxiv icon

Dynamic Scene Reconstruction: Recent Advance in Real-time Rendering and Streaming

Add code
Mar 11, 2025
Viaarxiv icon

TR-DQ: Time-Rotation Diffusion Quantization

Add code
Mar 09, 2025
Viaarxiv icon

OT-DETECTOR: Delving into Optimal Transport for Zero-shot Out-of-Distribution Detection

Add code
Mar 09, 2025
Viaarxiv icon

UFO: A Unified Approach to Fine-grained Visual Perception via Open-ended Language Interface

Add code
Mar 04, 2025
Viaarxiv icon

Improved YOLOv7x-Based Defect Detection Algorithm for Power Equipment

Add code
Feb 25, 2025
Viaarxiv icon

Parameter Efficient Merging for Multimodal Large Language Models with Complementary Parameter Adaptation

Add code
Feb 24, 2025
Viaarxiv icon