Picture for Jue Wang

Jue Wang

NowYouSee Me: Context-Aware Automatic Audio Description

Add code
Dec 13, 2024
Viaarxiv icon

GEXIA: Granularity Expansion and Iterative Approximation for Scalable Multi-grained Video-language Learning

Add code
Dec 10, 2024
Viaarxiv icon

Video Token Merging for Long-form Video Understanding

Add code
Oct 31, 2024
Figure 1 for Video Token Merging for Long-form Video Understanding
Figure 2 for Video Token Merging for Long-form Video Understanding
Figure 3 for Video Token Merging for Long-form Video Understanding
Figure 4 for Video Token Merging for Long-form Video Understanding
Viaarxiv icon

DPDEdit: Detail-Preserved Diffusion Models for Multimodal Fashion Image Editing

Add code
Sep 02, 2024
Viaarxiv icon

FlexEdit: Marrying Free-Shape Masks to VLLM for Flexible Image Editing

Add code
Aug 22, 2024
Figure 1 for FlexEdit: Marrying Free-Shape Masks to VLLM for Flexible Image Editing
Figure 2 for FlexEdit: Marrying Free-Shape Masks to VLLM for Flexible Image Editing
Figure 3 for FlexEdit: Marrying Free-Shape Masks to VLLM for Flexible Image Editing
Figure 4 for FlexEdit: Marrying Free-Shape Masks to VLLM for Flexible Image Editing
Viaarxiv icon

Text-Guided Video Masked Autoencoder

Add code
Aug 01, 2024
Figure 1 for Text-Guided Video Masked Autoencoder
Figure 2 for Text-Guided Video Masked Autoencoder
Figure 3 for Text-Guided Video Masked Autoencoder
Figure 4 for Text-Guided Video Masked Autoencoder
Viaarxiv icon

Ada-adapter:Fast Few-shot Style Personlization of Diffusion Model with Pre-trained Image Encoder

Add code
Jul 08, 2024
Figure 1 for Ada-adapter:Fast Few-shot Style Personlization of Diffusion Model with Pre-trained Image Encoder
Figure 2 for Ada-adapter:Fast Few-shot Style Personlization of Diffusion Model with Pre-trained Image Encoder
Figure 3 for Ada-adapter:Fast Few-shot Style Personlization of Diffusion Model with Pre-trained Image Encoder
Figure 4 for Ada-adapter:Fast Few-shot Style Personlization of Diffusion Model with Pre-trained Image Encoder
Viaarxiv icon

An I2I Inpainting Approach for Efficient Channel Knowledge Map Construction

Add code
Jun 14, 2024
Viaarxiv icon

Mixture-of-Agents Enhances Large Language Model Capabilities

Add code
Jun 07, 2024
Figure 1 for Mixture-of-Agents Enhances Large Language Model Capabilities
Figure 2 for Mixture-of-Agents Enhances Large Language Model Capabilities
Figure 3 for Mixture-of-Agents Enhances Large Language Model Capabilities
Figure 4 for Mixture-of-Agents Enhances Large Language Model Capabilities
Viaarxiv icon

UniIF: Unified Molecule Inverse Folding

Add code
May 29, 2024
Figure 1 for UniIF: Unified Molecule Inverse Folding
Figure 2 for UniIF: Unified Molecule Inverse Folding
Figure 3 for UniIF: Unified Molecule Inverse Folding
Figure 4 for UniIF: Unified Molecule Inverse Folding
Viaarxiv icon