Picture for Li Du

Li Du

School of Electronic Science and Engineering, Nanjing University

Fast Controlled Generation from Language Models with Adaptive Weighted Rejection Sampling

Add code
Apr 07, 2025
Viaarxiv icon

MoLe-VLA: Dynamic Layer-skipping Vision Language Action Model via Mixture-of-Layers for Efficient Robot Manipulation

Add code
Mar 26, 2025
Viaarxiv icon

SliceOcc: Indoor 3D Semantic Occupancy Prediction with Vertical Slice Representation

Add code
Jan 28, 2025
Figure 1 for SliceOcc: Indoor 3D Semantic Occupancy Prediction with Vertical Slice Representation
Figure 2 for SliceOcc: Indoor 3D Semantic Occupancy Prediction with Vertical Slice Representation
Figure 3 for SliceOcc: Indoor 3D Semantic Occupancy Prediction with Vertical Slice Representation
Figure 4 for SliceOcc: Indoor 3D Semantic Occupancy Prediction with Vertical Slice Representation
Viaarxiv icon

Supervised Fine-Tuning: An Activation Pattern Optimization Process for Attention Heads

Add code
Sep 24, 2024
Figure 1 for Supervised Fine-Tuning: An Activation Pattern Optimization Process for Attention Heads
Figure 2 for Supervised Fine-Tuning: An Activation Pattern Optimization Process for Attention Heads
Figure 3 for Supervised Fine-Tuning: An Activation Pattern Optimization Process for Attention Heads
Figure 4 for Supervised Fine-Tuning: An Activation Pattern Optimization Process for Attention Heads
Viaarxiv icon

Beyond IID: Optimizing Instruction Learning from the Perspective of Instruction Interaction and Dependency

Add code
Sep 11, 2024
Viaarxiv icon

PAT: Pruning-Aware Tuning for Large Language Models

Add code
Aug 27, 2024
Figure 1 for PAT: Pruning-Aware Tuning for Large Language Models
Figure 2 for PAT: Pruning-Aware Tuning for Large Language Models
Figure 3 for PAT: Pruning-Aware Tuning for Large Language Models
Figure 4 for PAT: Pruning-Aware Tuning for Large Language Models
Viaarxiv icon

Causal-Guided Active Learning for Debiasing Large Language Models

Add code
Aug 23, 2024
Viaarxiv icon

Diagnosing and Remedying Knowledge Deficiencies in LLMs via Label-free Curricular Meaningful Learning

Add code
Aug 21, 2024
Figure 1 for Diagnosing and Remedying Knowledge Deficiencies in LLMs via Label-free Curricular Meaningful Learning
Figure 2 for Diagnosing and Remedying Knowledge Deficiencies in LLMs via Label-free Curricular Meaningful Learning
Figure 3 for Diagnosing and Remedying Knowledge Deficiencies in LLMs via Label-free Curricular Meaningful Learning
Figure 4 for Diagnosing and Remedying Knowledge Deficiencies in LLMs via Label-free Curricular Meaningful Learning
Viaarxiv icon

AquilaMoE: Efficient Training for MoE Models with Scale-Up and Scale-Out Strategies

Add code
Aug 13, 2024
Viaarxiv icon

SpikeLLM: Scaling up Spiking Neural Network to Large Language Models via Saliency-based Spiking

Add code
Jul 05, 2024
Figure 1 for SpikeLLM: Scaling up Spiking Neural Network to Large Language Models via Saliency-based Spiking
Figure 2 for SpikeLLM: Scaling up Spiking Neural Network to Large Language Models via Saliency-based Spiking
Figure 3 for SpikeLLM: Scaling up Spiking Neural Network to Large Language Models via Saliency-based Spiking
Figure 4 for SpikeLLM: Scaling up Spiking Neural Network to Large Language Models via Saliency-based Spiking
Viaarxiv icon