Picture for Weili Guan

Weili Guan

Curriculum Coarse-to-Fine Selection for High-IPC Dataset Distillation

Add code
Mar 24, 2025
Viaarxiv icon

Embodied Crowd Counting

Add code
Mar 11, 2025
Viaarxiv icon

MegaSR: Mining Customized Semantics and Expressive Guidance for Image Super-Resolution

Add code
Mar 11, 2025
Viaarxiv icon

PTQ1.61: Push the Real Limit of Extremely Low-Bit Post-Training Quantization Methods for Large Language Models

Add code
Feb 18, 2025
Viaarxiv icon

FALCON: Resolving Visual Redundancy and Fragmentation in High-resolution Multimodal Large Language Models via Visual Registers

Add code
Jan 27, 2025
Figure 1 for FALCON: Resolving Visual Redundancy and Fragmentation in High-resolution Multimodal Large Language Models via Visual Registers
Figure 2 for FALCON: Resolving Visual Redundancy and Fragmentation in High-resolution Multimodal Large Language Models via Visual Registers
Figure 3 for FALCON: Resolving Visual Redundancy and Fragmentation in High-resolution Multimodal Large Language Models via Visual Registers
Figure 4 for FALCON: Resolving Visual Redundancy and Fragmentation in High-resolution Multimodal Large Language Models via Visual Registers
Viaarxiv icon

Content-aware Balanced Spectrum Encoding in Masked Modeling for Time Series Classification

Add code
Dec 17, 2024
Viaarxiv icon

Multiple Information Prompt Learning for Cloth-Changing Person Re-Identification

Add code
Nov 01, 2024
Viaarxiv icon

Vision-guided and Mask-enhanced Adaptive Denoising for Prompt-based Image Editing

Add code
Oct 14, 2024
Figure 1 for Vision-guided and Mask-enhanced Adaptive Denoising for Prompt-based Image Editing
Figure 2 for Vision-guided and Mask-enhanced Adaptive Denoising for Prompt-based Image Editing
Figure 3 for Vision-guided and Mask-enhanced Adaptive Denoising for Prompt-based Image Editing
Figure 4 for Vision-guided and Mask-enhanced Adaptive Denoising for Prompt-based Image Editing
Viaarxiv icon

Token-level Correlation-guided Compression for Efficient Multimodal Document Understanding

Add code
Jul 19, 2024
Viaarxiv icon

MoME: Mixture of Multimodal Experts for Generalist Multimodal Large Language Models

Add code
Jul 17, 2024
Viaarxiv icon