Picture for Weili Guan

Weili Guan

PTQ1.61: Push the Real Limit of Extremely Low-Bit Post-Training Quantization Methods for Large Language Models

Add code
Feb 18, 2025
Viaarxiv icon

FALCON: Resolving Visual Redundancy and Fragmentation in High-resolution Multimodal Large Language Models via Visual Registers

Add code
Jan 27, 2025
Viaarxiv icon

Content-aware Balanced Spectrum Encoding in Masked Modeling for Time Series Classification

Add code
Dec 17, 2024
Viaarxiv icon

Multiple Information Prompt Learning for Cloth-Changing Person Re-Identification

Add code
Nov 01, 2024
Viaarxiv icon

Vision-guided and Mask-enhanced Adaptive Denoising for Prompt-based Image Editing

Add code
Oct 14, 2024
Figure 1 for Vision-guided and Mask-enhanced Adaptive Denoising for Prompt-based Image Editing
Figure 2 for Vision-guided and Mask-enhanced Adaptive Denoising for Prompt-based Image Editing
Figure 3 for Vision-guided and Mask-enhanced Adaptive Denoising for Prompt-based Image Editing
Figure 4 for Vision-guided and Mask-enhanced Adaptive Denoising for Prompt-based Image Editing
Viaarxiv icon

Token-level Correlation-guided Compression for Efficient Multimodal Document Understanding

Add code
Jul 19, 2024
Viaarxiv icon

MoME: Mixture of Multimodal Experts for Generalist Multimodal Large Language Models

Add code
Jul 17, 2024
Viaarxiv icon

MMGRec: Multimodal Generative Recommendation with Transformer Model

Add code
Apr 25, 2024
Viaarxiv icon

UniAV: Unified Audio-Visual Perception for Multi-Task Video Localization

Add code
Apr 04, 2024
Figure 1 for UniAV: Unified Audio-Visual Perception for Multi-Task Video Localization
Figure 2 for UniAV: Unified Audio-Visual Perception for Multi-Task Video Localization
Figure 3 for UniAV: Unified Audio-Visual Perception for Multi-Task Video Localization
Figure 4 for UniAV: Unified Audio-Visual Perception for Multi-Task Video Localization
Viaarxiv icon

Prompt-based Multi-interest Learning Method for Sequential Recommendation

Add code
Jan 09, 2024
Viaarxiv icon