Picture for Li Shen

Li Shen

Zero Token-Driven Deep Thinking in LLMs: Unlocking the Full Potential of Existing Parameters via Cyclic Refinement

Add code
Feb 17, 2025
Viaarxiv icon

Mix Data or Merge Models? Balancing the Helpfulness, Honesty, and Harmlessness of Large Language Model via Model Merging

Add code
Feb 13, 2025
Viaarxiv icon

HRP: High-Rank Preheating for Superior LoRA Initialization

Add code
Feb 11, 2025
Viaarxiv icon

Mask-Enhanced Autoregressive Prediction: Pay Less Attention to Learn More

Add code
Feb 11, 2025
Viaarxiv icon

Near-Optimal Online Learning for Multi-Agent Submodular Coordination: Tight Approximation and Communication Efficiency

Add code
Feb 07, 2025
Viaarxiv icon

Leveraging Reasoning with Guidelines to Elicit and Utilize Knowledge for Enhancing Safety Alignment

Add code
Feb 06, 2025
Viaarxiv icon

Mediator: Memory-efficient LLM Merging with Less Parameter Conflicts and Uncertainty Based Routing

Add code
Feb 06, 2025
Viaarxiv icon

TeZO: Empowering the Low-Rankness on the Temporal Dimension in the Zeroth-Order Optimization for Fine-tuning LLMs

Add code
Jan 31, 2025
Viaarxiv icon

Panacea: Mitigating Harmful Fine-tuning for Large Language Models via Post-fine-tuning Perturbation

Add code
Jan 30, 2025
Viaarxiv icon

Merging Models on the Fly Without Retraining: A Sequential Approach to Scalable Continual Model Merging

Add code
Jan 16, 2025
Viaarxiv icon