Picture for Li Shen

Li Shen

Retrieval-Augmented Perception: High-Resolution Image Perception Meets Visual RAG

Add code
Mar 03, 2025
Viaarxiv icon

Parameter Efficient Merging for Multimodal Large Language Models with Complementary Parameter Adaptation

Add code
Feb 24, 2025
Viaarxiv icon

Stable-SPAM: How to Train in 4-Bit More Stably than 16-Bit Adam

Add code
Feb 24, 2025
Figure 1 for Stable-SPAM: How to Train in 4-Bit More Stably than 16-Bit Adam
Figure 2 for Stable-SPAM: How to Train in 4-Bit More Stably than 16-Bit Adam
Figure 3 for Stable-SPAM: How to Train in 4-Bit More Stably than 16-Bit Adam
Figure 4 for Stable-SPAM: How to Train in 4-Bit More Stably than 16-Bit Adam
Viaarxiv icon

A Survey on Mechanistic Interpretability for Multi-Modal Foundation Models

Add code
Feb 22, 2025
Figure 1 for A Survey on Mechanistic Interpretability for Multi-Modal Foundation Models
Figure 2 for A Survey on Mechanistic Interpretability for Multi-Modal Foundation Models
Figure 3 for A Survey on Mechanistic Interpretability for Multi-Modal Foundation Models
Figure 4 for A Survey on Mechanistic Interpretability for Multi-Modal Foundation Models
Viaarxiv icon

Edit Once, Update Everywhere: A Simple Framework for Cross-Lingual Knowledge Synchronization in LLMs

Add code
Feb 20, 2025
Viaarxiv icon

PEARL: Towards Permutation-Resilient LLMs

Add code
Feb 20, 2025
Viaarxiv icon

On Theoretical Limits of Learning with Label Differential Privacy

Add code
Feb 20, 2025
Viaarxiv icon

Zero Token-Driven Deep Thinking in LLMs: Unlocking the Full Potential of Existing Parameters via Cyclic Refinement

Add code
Feb 17, 2025
Viaarxiv icon

Mix Data or Merge Models? Balancing the Helpfulness, Honesty, and Harmlessness of Large Language Model via Model Merging

Add code
Feb 13, 2025
Viaarxiv icon

HRP: High-Rank Preheating for Superior LoRA Initialization

Add code
Feb 11, 2025
Viaarxiv icon