Picture for Yabiao Wang

Yabiao Wang

Textual Decomposition Then Sub-motion-space Scattering for Open-Vocabulary Motion Generation

Add code
Nov 06, 2024
Figure 1 for Textual Decomposition Then Sub-motion-space Scattering for Open-Vocabulary Motion Generation
Figure 2 for Textual Decomposition Then Sub-motion-space Scattering for Open-Vocabulary Motion Generation
Figure 3 for Textual Decomposition Then Sub-motion-space Scattering for Open-Vocabulary Motion Generation
Figure 4 for Textual Decomposition Then Sub-motion-space Scattering for Open-Vocabulary Motion Generation
Viaarxiv icon

OSV: One Step is Enough for High-Quality Image to Video Generation

Add code
Sep 17, 2024
Figure 1 for OSV: One Step is Enough for High-Quality Image to Video Generation
Figure 2 for OSV: One Step is Enough for High-Quality Image to Video Generation
Figure 3 for OSV: One Step is Enough for High-Quality Image to Video Generation
Figure 4 for OSV: One Step is Enough for High-Quality Image to Video Generation
Viaarxiv icon

Mamba-YOLO-World: Marrying YOLO-World with Mamba for Open-Vocabulary Detection

Add code
Sep 16, 2024
Viaarxiv icon

SaRA: High-Efficient Diffusion Model Fine-tuning with Progressive Sparse Low-Rank Adaptation

Add code
Sep 10, 2024
Viaarxiv icon

Temporal and Interactive Modeling for Efficient Human-Human Motion Generation

Add code
Aug 30, 2024
Figure 1 for Temporal and Interactive Modeling for Efficient Human-Human Motion Generation
Figure 2 for Temporal and Interactive Modeling for Efficient Human-Human Motion Generation
Figure 3 for Temporal and Interactive Modeling for Efficient Human-Human Motion Generation
Figure 4 for Temporal and Interactive Modeling for Efficient Human-Human Motion Generation
Viaarxiv icon

DualAnoDiff: Dual-Interrelated Diffusion Model for Few-Shot Anomaly Image Generation

Add code
Aug 24, 2024
Viaarxiv icon

LLaVA-VSD: Large Language-and-Vision Assistant for Visual Spatial Description

Add code
Aug 09, 2024
Viaarxiv icon

MDT-A2G: Exploring Masked Diffusion Transformers for Co-Speech Gesture Generation

Add code
Aug 06, 2024
Viaarxiv icon

ADer: A Comprehensive Benchmark for Multi-class Visual Anomaly Detection

Add code
Jun 06, 2024
Viaarxiv icon

VividPose: Advancing Stable Video Diffusion for Realistic Human Image Animation

Add code
May 28, 2024
Viaarxiv icon