Picture for Li Shen

Li Shen

Adaptive Defense against Harmful Fine-Tuning for Large Language Models via Bayesian Data Scheduler

Add code
Oct 31, 2025
Viaarxiv icon

Sparse Model Inversion: Efficient Inversion of Vision Transformers for Data-Free Applications

Add code
Oct 31, 2025
Viaarxiv icon

Unveiling the Power of Multiple Gossip Steps: A Stability-Based Generalization Analysis in Decentralized Training

Add code
Oct 09, 2025
Viaarxiv icon

Effective Policy Learning for Multi-Agent Online Coordination Beyond Submodular Objectives

Add code
Sep 26, 2025
Viaarxiv icon

Beyond Two-Stage Training: Cooperative SFT and RL for LLM Reasoning

Add code
Sep 08, 2025
Viaarxiv icon

Diffusion Language Models Know the Answer Before Decoding

Add code
Aug 27, 2025
Figure 1 for Diffusion Language Models Know the Answer Before Decoding
Figure 2 for Diffusion Language Models Know the Answer Before Decoding
Figure 3 for Diffusion Language Models Know the Answer Before Decoding
Figure 4 for Diffusion Language Models Know the Answer Before Decoding
Viaarxiv icon

DynamiCare: A Dynamic Multi-Agent Framework for Interactive and Open-Ended Medical Decision-Making

Add code
Jul 03, 2025
Viaarxiv icon

AlphaDecay:Module-wise Weight Decay for Heavy-Tailed Balancing in LLMs

Add code
Jun 17, 2025
Viaarxiv icon

MaskPro: Linear-Space Probabilistic Learning for Strict (N:M)-Sparsity on Large Language Models

Add code
Jun 15, 2025
Viaarxiv icon

TrojanTO: Action-Level Backdoor Attacks against Trajectory Optimization Models

Add code
Jun 15, 2025
Viaarxiv icon