Picture for Sinno Jialin Pan

Sinno Jialin Pan

MemDLM: Memory-Enhanced DLM Training

Add code
Mar 23, 2026
Viaarxiv icon

Beyond Speedup -- Utilizing KV Cache for Sampling and Reasoning

Add code
Jan 28, 2026
Viaarxiv icon

SCOPE: Prompt Evolution for Enhancing Agent Effectiveness

Add code
Dec 17, 2025
Figure 1 for SCOPE: Prompt Evolution for Enhancing Agent Effectiveness
Figure 2 for SCOPE: Prompt Evolution for Enhancing Agent Effectiveness
Figure 3 for SCOPE: Prompt Evolution for Enhancing Agent Effectiveness
Figure 4 for SCOPE: Prompt Evolution for Enhancing Agent Effectiveness
Viaarxiv icon

MetaDefense: Defending Finetuning-based Jailbreak Attack Before and During Generation

Add code
Oct 09, 2025
Figure 1 for MetaDefense: Defending Finetuning-based Jailbreak Attack Before and During Generation
Figure 2 for MetaDefense: Defending Finetuning-based Jailbreak Attack Before and During Generation
Figure 3 for MetaDefense: Defending Finetuning-based Jailbreak Attack Before and During Generation
Figure 4 for MetaDefense: Defending Finetuning-based Jailbreak Attack Before and During Generation
Viaarxiv icon

PreMoe: Lightening MoEs on Constrained Memory by Expert Pruning and Retrieval

Add code
May 23, 2025
Figure 1 for PreMoe: Lightening MoEs on Constrained Memory by Expert Pruning and Retrieval
Figure 2 for PreMoe: Lightening MoEs on Constrained Memory by Expert Pruning and Retrieval
Figure 3 for PreMoe: Lightening MoEs on Constrained Memory by Expert Pruning and Retrieval
Figure 4 for PreMoe: Lightening MoEs on Constrained Memory by Expert Pruning and Retrieval
Viaarxiv icon

Reinforcing Compositional Retrieval: Retrieving Step-by-Step for Composing Informative Contexts

Add code
Apr 15, 2025
Figure 1 for Reinforcing Compositional Retrieval: Retrieving Step-by-Step for Composing Informative Contexts
Figure 2 for Reinforcing Compositional Retrieval: Retrieving Step-by-Step for Composing Informative Contexts
Figure 3 for Reinforcing Compositional Retrieval: Retrieving Step-by-Step for Composing Informative Contexts
Figure 4 for Reinforcing Compositional Retrieval: Retrieving Step-by-Step for Composing Informative Contexts
Viaarxiv icon

CMoE: Fast Carving of Mixture-of-Experts for Efficient LLM Inference

Add code
Feb 06, 2025
Figure 1 for CMoE: Fast Carving of Mixture-of-Experts for Efficient LLM Inference
Figure 2 for CMoE: Fast Carving of Mixture-of-Experts for Efficient LLM Inference
Figure 3 for CMoE: Fast Carving of Mixture-of-Experts for Efficient LLM Inference
Figure 4 for CMoE: Fast Carving of Mixture-of-Experts for Efficient LLM Inference
Viaarxiv icon

KVTuner: Sensitivity-Aware Layer-wise Mixed Precision KV Cache Quantization for Efficient and Nearly Lossless LLM Inference

Add code
Feb 06, 2025
Figure 1 for KVTuner: Sensitivity-Aware Layer-wise Mixed Precision KV Cache Quantization for Efficient and Nearly Lossless LLM Inference
Figure 2 for KVTuner: Sensitivity-Aware Layer-wise Mixed Precision KV Cache Quantization for Efficient and Nearly Lossless LLM Inference
Figure 3 for KVTuner: Sensitivity-Aware Layer-wise Mixed Precision KV Cache Quantization for Efficient and Nearly Lossless LLM Inference
Figure 4 for KVTuner: Sensitivity-Aware Layer-wise Mixed Precision KV Cache Quantization for Efficient and Nearly Lossless LLM Inference
Viaarxiv icon

FuseGPT: Learnable Layers Fusion of Generative Pre-trained Transformers

Add code
Nov 21, 2024
Figure 1 for FuseGPT: Learnable Layers Fusion of Generative Pre-trained Transformers
Figure 2 for FuseGPT: Learnable Layers Fusion of Generative Pre-trained Transformers
Figure 3 for FuseGPT: Learnable Layers Fusion of Generative Pre-trained Transformers
Figure 4 for FuseGPT: Learnable Layers Fusion of Generative Pre-trained Transformers
Viaarxiv icon

State Chrono Representation for Enhancing Generalization in Reinforcement Learning

Add code
Nov 09, 2024
Figure 1 for State Chrono Representation for Enhancing Generalization in Reinforcement Learning
Figure 2 for State Chrono Representation for Enhancing Generalization in Reinforcement Learning
Figure 3 for State Chrono Representation for Enhancing Generalization in Reinforcement Learning
Figure 4 for State Chrono Representation for Enhancing Generalization in Reinforcement Learning
Viaarxiv icon