Picture for Weili Guan

Weili Guan

Content-aware Balanced Spectrum Encoding in Masked Modeling for Time Series Classification

Add code
Dec 17, 2024
Viaarxiv icon

Multiple Information Prompt Learning for Cloth-Changing Person Re-Identification

Add code
Nov 01, 2024
Viaarxiv icon

Vision-guided and Mask-enhanced Adaptive Denoising for Prompt-based Image Editing

Add code
Oct 14, 2024
Figure 1 for Vision-guided and Mask-enhanced Adaptive Denoising for Prompt-based Image Editing
Figure 2 for Vision-guided and Mask-enhanced Adaptive Denoising for Prompt-based Image Editing
Figure 3 for Vision-guided and Mask-enhanced Adaptive Denoising for Prompt-based Image Editing
Figure 4 for Vision-guided and Mask-enhanced Adaptive Denoising for Prompt-based Image Editing
Viaarxiv icon

Token-level Correlation-guided Compression for Efficient Multimodal Document Understanding

Add code
Jul 19, 2024
Viaarxiv icon

MoME: Mixture of Multimodal Experts for Generalist Multimodal Large Language Models

Add code
Jul 17, 2024
Viaarxiv icon

MMGRec: Multimodal Generative Recommendation with Transformer Model

Add code
Apr 25, 2024
Viaarxiv icon

UniAV: Unified Audio-Visual Perception for Multi-Task Video Localization

Add code
Apr 04, 2024
Viaarxiv icon

Prompt-based Multi-interest Learning Method for Sequential Recommendation

Add code
Jan 09, 2024
Viaarxiv icon

Uncovering Hidden Connections: Iterative Tracking and Reasoning for Video-grounded Dialog

Add code
Oct 11, 2023
Figure 1 for Uncovering Hidden Connections: Iterative Tracking and Reasoning for Video-grounded Dialog
Figure 2 for Uncovering Hidden Connections: Iterative Tracking and Reasoning for Video-grounded Dialog
Figure 3 for Uncovering Hidden Connections: Iterative Tracking and Reasoning for Video-grounded Dialog
Figure 4 for Uncovering Hidden Connections: Iterative Tracking and Reasoning for Video-grounded Dialog
Viaarxiv icon

Knowledge-Aware Prompt Tuning for Generalizable Vision-Language Models

Add code
Aug 22, 2023
Viaarxiv icon