Picture for Li Du

Li Du

School of Electronic Science and Engineering, Nanjing University

Supervised Fine-Tuning: An Activation Pattern Optimization Process for Attention Heads

Add code
Sep 24, 2024
Figure 1 for Supervised Fine-Tuning: An Activation Pattern Optimization Process for Attention Heads
Figure 2 for Supervised Fine-Tuning: An Activation Pattern Optimization Process for Attention Heads
Figure 3 for Supervised Fine-Tuning: An Activation Pattern Optimization Process for Attention Heads
Figure 4 for Supervised Fine-Tuning: An Activation Pattern Optimization Process for Attention Heads
Viaarxiv icon

Beyond IID: Optimizing Instruction Learning from the Perspective of Instruction Interaction and Dependency

Add code
Sep 11, 2024
Viaarxiv icon

PAT: Pruning-Aware Tuning for Large Language Models

Add code
Aug 27, 2024
Figure 1 for PAT: Pruning-Aware Tuning for Large Language Models
Figure 2 for PAT: Pruning-Aware Tuning for Large Language Models
Figure 3 for PAT: Pruning-Aware Tuning for Large Language Models
Figure 4 for PAT: Pruning-Aware Tuning for Large Language Models
Viaarxiv icon

Causal-Guided Active Learning for Debiasing Large Language Models

Add code
Aug 23, 2024
Viaarxiv icon

Diagnosing and Remedying Knowledge Deficiencies in LLMs via Label-free Curricular Meaningful Learning

Add code
Aug 21, 2024
Viaarxiv icon

AquilaMoE: Efficient Training for MoE Models with Scale-Up and Scale-Out Strategies

Add code
Aug 13, 2024
Viaarxiv icon

SpikeLLM: Scaling up Spiking Neural Network to Large Language Models via Saliency-based Spiking

Add code
Jul 05, 2024
Figure 1 for SpikeLLM: Scaling up Spiking Neural Network to Large Language Models via Saliency-based Spiking
Figure 2 for SpikeLLM: Scaling up Spiking Neural Network to Large Language Models via Saliency-based Spiking
Figure 3 for SpikeLLM: Scaling up Spiking Neural Network to Large Language Models via Saliency-based Spiking
Figure 4 for SpikeLLM: Scaling up Spiking Neural Network to Large Language Models via Saliency-based Spiking
Viaarxiv icon

SFC: Achieve Accurate Fast Convolution under Low-precision Arithmetic

Add code
Jul 03, 2024
Viaarxiv icon

Decomposing the Neurons: Activation Sparsity via Mixture of Experts for Continual Test Time Adaptation

Add code
May 26, 2024
Figure 1 for Decomposing the Neurons: Activation Sparsity via Mixture of Experts for Continual Test Time Adaptation
Figure 2 for Decomposing the Neurons: Activation Sparsity via Mixture of Experts for Continual Test Time Adaptation
Figure 3 for Decomposing the Neurons: Activation Sparsity via Mixture of Experts for Continual Test Time Adaptation
Figure 4 for Decomposing the Neurons: Activation Sparsity via Mixture of Experts for Continual Test Time Adaptation
Viaarxiv icon

Medical Dialogue: A Survey of Categories, Methods, Evaluation and Challenges

Add code
May 17, 2024
Figure 1 for Medical Dialogue: A Survey of Categories, Methods, Evaluation and Challenges
Figure 2 for Medical Dialogue: A Survey of Categories, Methods, Evaluation and Challenges
Figure 3 for Medical Dialogue: A Survey of Categories, Methods, Evaluation and Challenges
Figure 4 for Medical Dialogue: A Survey of Categories, Methods, Evaluation and Challenges
Viaarxiv icon