Picture for David Doermann

David Doermann

IDOL: Unified Dual-Modal Latent Diffusion for Human-Centric Joint Video-Depth Generation

Add code
Jul 15, 2024
Viaarxiv icon

ClawMachine: Fetching Visual Tokens as An Entity for Referring and Grounding

Add code
Jun 17, 2024
Viaarxiv icon

Motion Consistency Model: Accelerating Video Diffusion with Disentangled Motion-Appearance Distillation

Add code
Jun 11, 2024
Viaarxiv icon

Artemis: Towards Referential Understanding in Complex Videos

Add code
Jun 01, 2024
Viaarxiv icon

ChartReformer: Natural Language-Driven Chart Image Editing

Add code
Mar 01, 2024
Viaarxiv icon

Federated Learning via Input-Output Collaborative Distillation

Add code
Dec 22, 2023
Viaarxiv icon

The Analysis and Extraction of Structure from Organizational Charts

Add code
Nov 16, 2023
Viaarxiv icon

Player Re-Identification Using Body Part Appearences

Add code
Oct 23, 2023
Viaarxiv icon

SOAR: Scene-debiasing Open-set Action Recognition

Add code
Sep 03, 2023
Viaarxiv icon

Towards Generic Image Manipulation Detection with Weakly-Supervised Self-Consistency Learning

Add code
Sep 03, 2023
Viaarxiv icon