Picture for Chengjie Wang

Chengjie Wang

Textual Decomposition Then Sub-motion-space Scattering for Open-Vocabulary Motion Generation

Add code
Nov 06, 2024
Figure 1 for Textual Decomposition Then Sub-motion-space Scattering for Open-Vocabulary Motion Generation
Figure 2 for Textual Decomposition Then Sub-motion-space Scattering for Open-Vocabulary Motion Generation
Figure 3 for Textual Decomposition Then Sub-motion-space Scattering for Open-Vocabulary Motion Generation
Figure 4 for Textual Decomposition Then Sub-motion-space Scattering for Open-Vocabulary Motion Generation
Viaarxiv icon

LLaVA-KD: A Framework of Distilling Multimodal Large Language Models

Add code
Oct 21, 2024
Viaarxiv icon

MMAD: The First-Ever Comprehensive Benchmark for Multimodal Large Language Models in Industrial Anomaly Detection

Add code
Oct 12, 2024
Viaarxiv icon

CAR: Controllable Autoregressive Modeling for Visual Generation

Add code
Oct 07, 2024
Viaarxiv icon

SVP: Style-Enhanced Vivid Portrait Talking Head Diffusion Model

Add code
Sep 05, 2024
Viaarxiv icon

Anno-incomplete Multi-dataset Detection

Add code
Aug 29, 2024
Figure 1 for Anno-incomplete Multi-dataset Detection
Figure 2 for Anno-incomplete Multi-dataset Detection
Figure 3 for Anno-incomplete Multi-dataset Detection
Figure 4 for Anno-incomplete Multi-dataset Detection
Viaarxiv icon

VTON-HandFit: Virtual Try-on for Arbitrary Hand Pose Guided by Hand Priors Embedding

Add code
Aug 27, 2024
Viaarxiv icon

DualAnoDiff: Dual-Interrelated Diffusion Model for Few-Shot Anomaly Image Generation

Add code
Aug 24, 2024
Viaarxiv icon

LLaVA-VSD: Large Language-and-Vision Assistant for Visual Spatial Description

Add code
Aug 09, 2024
Viaarxiv icon

MDT-A2G: Exploring Masked Diffusion Transformers for Co-Speech Gesture Generation

Add code
Aug 06, 2024
Viaarxiv icon