Picture for He Zhang

He Zhang

Augmenting Image Annotation: A Human-LMM Collaborative Framework for Efficient Object Selection and Label Generation

Add code
Mar 14, 2025
Viaarxiv icon

One-Step Diffusion Model for Image Motion-Deblurring

Add code
Mar 09, 2025
Viaarxiv icon

Benchmarking Zero-Shot Facial Emotion Annotation with Large Language Models: A Multi-Class and Multi-Frame Approach in DailyLife

Add code
Feb 18, 2025
Viaarxiv icon

Multitwine: Multi-Object Compositing with Text and Layout Control

Add code
Feb 07, 2025
Viaarxiv icon

TransPixar: Advancing Text-to-Video Generation with Transparency

Add code
Jan 06, 2025
Figure 1 for TransPixar: Advancing Text-to-Video Generation with Transparency
Figure 2 for TransPixar: Advancing Text-to-Video Generation with Transparency
Figure 3 for TransPixar: Advancing Text-to-Video Generation with Transparency
Figure 4 for TransPixar: Advancing Text-to-Video Generation with Transparency
Viaarxiv icon

Text2Relight: Creative Portrait Relighting with Text Guidance

Add code
Dec 18, 2024
Figure 1 for Text2Relight: Creative Portrait Relighting with Text Guidance
Figure 2 for Text2Relight: Creative Portrait Relighting with Text Guidance
Figure 3 for Text2Relight: Creative Portrait Relighting with Text Guidance
Figure 4 for Text2Relight: Creative Portrait Relighting with Text Guidance
Viaarxiv icon

Second Language (Arabic) Acquisition of LLMs via Progressive Vocabulary Expansion

Add code
Dec 16, 2024
Figure 1 for Second Language (Arabic) Acquisition of LLMs via Progressive Vocabulary Expansion
Figure 2 for Second Language (Arabic) Acquisition of LLMs via Progressive Vocabulary Expansion
Figure 3 for Second Language (Arabic) Acquisition of LLMs via Progressive Vocabulary Expansion
Figure 4 for Second Language (Arabic) Acquisition of LLMs via Progressive Vocabulary Expansion
Viaarxiv icon

UniReal: Universal Image Generation and Editing via Learning Real-world Dynamics

Add code
Dec 10, 2024
Figure 1 for UniReal: Universal Image Generation and Editing via Learning Real-world Dynamics
Figure 2 for UniReal: Universal Image Generation and Editing via Learning Real-world Dynamics
Figure 3 for UniReal: Universal Image Generation and Editing via Learning Real-world Dynamics
Figure 4 for UniReal: Universal Image Generation and Editing via Learning Real-world Dynamics
Viaarxiv icon

CHOICE: Coordinated Human-Object Interaction in Cluttered Environments for Pick-and-Place Actions

Add code
Dec 09, 2024
Viaarxiv icon

DIVE: Taming DINO for Subject-Driven Video Editing

Add code
Dec 04, 2024
Figure 1 for DIVE: Taming DINO for Subject-Driven Video Editing
Figure 2 for DIVE: Taming DINO for Subject-Driven Video Editing
Figure 3 for DIVE: Taming DINO for Subject-Driven Video Editing
Figure 4 for DIVE: Taming DINO for Subject-Driven Video Editing
Viaarxiv icon