Picture for Yuxin Zhang

Yuxin Zhang

TextRefiner: Internal Visual Feature as Efficient Refiner for Vision-Language Models Prompt Tuning

Add code
Dec 11, 2024
Viaarxiv icon

VidMusician: Video-to-Music Generation with Semantic-Rhythmic Alignment via Hierarchical Visual Features

Add code
Dec 09, 2024
Viaarxiv icon

HeadRouter: A Training-free Image Editing Framework for MM-DiTs by Adaptively Routing Attention Heads

Add code
Nov 22, 2024
Viaarxiv icon

LumiSculpt: A Consistency Lighting Control Network for Video Generation

Add code
Oct 30, 2024
Figure 1 for LumiSculpt: A Consistency Lighting Control Network for Video Generation
Figure 2 for LumiSculpt: A Consistency Lighting Control Network for Video Generation
Figure 3 for LumiSculpt: A Consistency Lighting Control Network for Video Generation
Figure 4 for LumiSculpt: A Consistency Lighting Control Network for Video Generation
Viaarxiv icon

DREB-Net: Dual-stream Restoration Embedding Blur-feature Fusion Network for High-mobility UAV Object Detection

Add code
Oct 23, 2024
Viaarxiv icon

Ultrasound Image Enhancement with the Variance of Diffusion Models

Add code
Sep 17, 2024
Figure 1 for Ultrasound Image Enhancement with the Variance of Diffusion Models
Figure 2 for Ultrasound Image Enhancement with the Variance of Diffusion Models
Figure 3 for Ultrasound Image Enhancement with the Variance of Diffusion Models
Figure 4 for Ultrasound Image Enhancement with the Variance of Diffusion Models
Viaarxiv icon

Compact Implicit Neural Representations for Plane Wave Images

Add code
Sep 17, 2024
Figure 1 for Compact Implicit Neural Representations for Plane Wave Images
Figure 2 for Compact Implicit Neural Representations for Plane Wave Images
Figure 3 for Compact Implicit Neural Representations for Plane Wave Images
Figure 4 for Compact Implicit Neural Representations for Plane Wave Images
Viaarxiv icon

Passenger hazard perception based on EEG signals for highly automated driving vehicles

Add code
Aug 29, 2024
Figure 1 for Passenger hazard perception based on EEG signals for highly automated driving vehicles
Figure 2 for Passenger hazard perception based on EEG signals for highly automated driving vehicles
Figure 3 for Passenger hazard perception based on EEG signals for highly automated driving vehicles
Figure 4 for Passenger hazard perception based on EEG signals for highly automated driving vehicles
Viaarxiv icon

Advancing Multimodal Large Language Models with Quantization-Aware Scale Learning for Efficient Adaptation

Add code
Aug 07, 2024
Viaarxiv icon

Diffree: Text-Guided Shape Free Object Inpainting with Diffusion Model

Add code
Jul 24, 2024
Figure 1 for Diffree: Text-Guided Shape Free Object Inpainting with Diffusion Model
Figure 2 for Diffree: Text-Guided Shape Free Object Inpainting with Diffusion Model
Figure 3 for Diffree: Text-Guided Shape Free Object Inpainting with Diffusion Model
Figure 4 for Diffree: Text-Guided Shape Free Object Inpainting with Diffusion Model
Viaarxiv icon