Picture for Wei-Shi Zheng

Wei-Shi Zheng

Learning Implicit Features with Flow Infused Attention for Realistic Virtual Try-On

Add code
Dec 16, 2024
Viaarxiv icon

Light-T2M: A Lightweight and Fast Model for Text-to-motion Generation

Add code
Dec 15, 2024
Viaarxiv icon

TechCoach: Towards Technical Keypoint-Aware Descriptive Action Coaching

Add code
Nov 26, 2024
Figure 1 for TechCoach: Towards Technical Keypoint-Aware Descriptive Action Coaching
Figure 2 for TechCoach: Towards Technical Keypoint-Aware Descriptive Action Coaching
Figure 3 for TechCoach: Towards Technical Keypoint-Aware Descriptive Action Coaching
Figure 4 for TechCoach: Towards Technical Keypoint-Aware Descriptive Action Coaching
Viaarxiv icon

InTraGen: Trajectory-controlled Video Generation for Object Interactions

Add code
Nov 25, 2024
Viaarxiv icon

Frozen-DETR: Enhancing DETR with Image Understanding from Frozen Foundation Models

Add code
Oct 25, 2024
Figure 1 for Frozen-DETR: Enhancing DETR with Image Understanding from Frozen Foundation Models
Figure 2 for Frozen-DETR: Enhancing DETR with Image Understanding from Frozen Foundation Models
Figure 3 for Frozen-DETR: Enhancing DETR with Image Understanding from Frozen Foundation Models
Figure 4 for Frozen-DETR: Enhancing DETR with Image Understanding from Frozen Foundation Models
Viaarxiv icon

Real-to-Sim Grasp: Rethinking the Gap between Simulation and Real World in Grasp Detection

Add code
Oct 09, 2024
Figure 1 for Real-to-Sim Grasp: Rethinking the Gap between Simulation and Real World in Grasp Detection
Figure 2 for Real-to-Sim Grasp: Rethinking the Gap between Simulation and Real World in Grasp Detection
Figure 3 for Real-to-Sim Grasp: Rethinking the Gap between Simulation and Real World in Grasp Detection
Figure 4 for Real-to-Sim Grasp: Rethinking the Gap between Simulation and Real World in Grasp Detection
Viaarxiv icon

Towards Completeness: A Generalizable Action Proposal Generator for Zero-Shot Temporal Action Localization

Add code
Aug 25, 2024
Figure 1 for Towards Completeness: A Generalizable Action Proposal Generator for Zero-Shot Temporal Action Localization
Figure 2 for Towards Completeness: A Generalizable Action Proposal Generator for Zero-Shot Temporal Action Localization
Figure 3 for Towards Completeness: A Generalizable Action Proposal Generator for Zero-Shot Temporal Action Localization
Figure 4 for Towards Completeness: A Generalizable Action Proposal Generator for Zero-Shot Temporal Action Localization
Viaarxiv icon

ParGo: Bridging Vision-Language with Partial and Global Views

Add code
Aug 23, 2024
Viaarxiv icon

PixelFade: Privacy-preserving Person Re-identification with Noise-guided Progressive Replacement

Add code
Aug 10, 2024
Viaarxiv icon

Loc4Plan: Locating Before Planning for Outdoor Vision and Language Navigation

Add code
Aug 09, 2024
Viaarxiv icon