Picture for Zikai Song

Zikai Song

IP-MOT: Instance Prompt Learning for Cross-Domain Multi-Object Tracking

Add code
Oct 30, 2024
Viaarxiv icon

Autogenic Language Embedding for Coherent Point Tracking

Add code
Jul 30, 2024
Viaarxiv icon

Coupled Mamba: Enhanced Multi-modal Fusion with Coupled State Space Model

Add code
May 29, 2024
Viaarxiv icon

DEEM: Diffusion Models Serve as the Eyes of Large Language Models for Image Perception

Add code
May 24, 2024
Figure 1 for DEEM: Diffusion Models Serve as the Eyes of Large Language Models for Image Perception
Figure 2 for DEEM: Diffusion Models Serve as the Eyes of Large Language Models for Image Perception
Figure 3 for DEEM: Diffusion Models Serve as the Eyes of Large Language Models for Image Perception
Figure 4 for DEEM: Diffusion Models Serve as the Eyes of Large Language Models for Image Perception
Viaarxiv icon

EfficientGS: Streamlining Gaussian Splatting for Large-Scale High-Resolution Scene Representation

Add code
Apr 19, 2024
Viaarxiv icon

AMD:Anatomical Motion Diffusion with Interpretable Motion Decomposition and Fusion

Add code
Dec 21, 2023
Viaarxiv icon

Optimized View and Geometry Distillation from Multi-view Diffuser

Add code
Dec 17, 2023
Viaarxiv icon

Fine-grained Appearance Transfer with Diffusion Models

Add code
Nov 27, 2023
Viaarxiv icon

Progressive Text-to-Image Diffusion with Soft Latent Direction

Add code
Sep 18, 2023
Viaarxiv icon

DiffusionTrack: Diffusion Model For Multi-Object Tracking

Add code
Aug 19, 2023
Viaarxiv icon