Picture for Liang Ding

Liang Ding

Enhancing Input-Label Mapping in In-Context Learning with Contrastive Decoding

Add code
Feb 19, 2025
Viaarxiv icon

"Short-length" Adversarial Training Helps LLMs Defend "Long-length" Jailbreak Attacks: Theoretical and Empirical Evidence

Add code
Feb 06, 2025
Viaarxiv icon

TeZO: Empowering the Low-Rankness on the Temporal Dimension in the Zeroth-Order Optimization for Fine-tuning LLMs

Add code
Jan 31, 2025
Viaarxiv icon

The Energy Loss Phenomenon in RLHF: A New Perspective on Mitigating Reward Hacking

Add code
Jan 31, 2025
Figure 1 for The Energy Loss Phenomenon in RLHF: A New Perspective on Mitigating Reward Hacking
Figure 2 for The Energy Loss Phenomenon in RLHF: A New Perspective on Mitigating Reward Hacking
Figure 3 for The Energy Loss Phenomenon in RLHF: A New Perspective on Mitigating Reward Hacking
Figure 4 for The Energy Loss Phenomenon in RLHF: A New Perspective on Mitigating Reward Hacking
Viaarxiv icon

Leveraging Metamemory Mechanisms for Enhanced Data-Free Code Generation in LLMs

Add code
Jan 14, 2025
Viaarxiv icon

DynamicKV: Task-Aware Adaptive KV Cache Compression for Long Context LLMs

Add code
Dec 19, 2024
Viaarxiv icon

Self-Evolution Knowledge Distillation for LLM-based Machine Translation

Add code
Dec 19, 2024
Viaarxiv icon

CogSteer: Cognition-Inspired Selective Layer Intervention for Efficient Semantic Steering in Large Language Models

Add code
Oct 23, 2024
Figure 1 for CogSteer: Cognition-Inspired Selective Layer Intervention for Efficient Semantic Steering in Large Language Models
Figure 2 for CogSteer: Cognition-Inspired Selective Layer Intervention for Efficient Semantic Steering in Large Language Models
Figure 3 for CogSteer: Cognition-Inspired Selective Layer Intervention for Efficient Semantic Steering in Large Language Models
Figure 4 for CogSteer: Cognition-Inspired Selective Layer Intervention for Efficient Semantic Steering in Large Language Models
Viaarxiv icon

Learning from Imperfect Data: Towards Efficient Knowledge Distillation of Autoregressive Language Models for Text-to-SQL

Add code
Oct 15, 2024
Figure 1 for Learning from Imperfect Data: Towards Efficient Knowledge Distillation of Autoregressive Language Models for Text-to-SQL
Figure 2 for Learning from Imperfect Data: Towards Efficient Knowledge Distillation of Autoregressive Language Models for Text-to-SQL
Figure 3 for Learning from Imperfect Data: Towards Efficient Knowledge Distillation of Autoregressive Language Models for Text-to-SQL
Figure 4 for Learning from Imperfect Data: Towards Efficient Knowledge Distillation of Autoregressive Language Models for Text-to-SQL
Viaarxiv icon

Simultaneous Computation and Memory Efficient Zeroth-Order Optimizer for Fine-Tuning Large Language Models

Add code
Oct 13, 2024
Figure 1 for Simultaneous Computation and Memory Efficient Zeroth-Order Optimizer for Fine-Tuning Large Language Models
Figure 2 for Simultaneous Computation and Memory Efficient Zeroth-Order Optimizer for Fine-Tuning Large Language Models
Figure 3 for Simultaneous Computation and Memory Efficient Zeroth-Order Optimizer for Fine-Tuning Large Language Models
Figure 4 for Simultaneous Computation and Memory Efficient Zeroth-Order Optimizer for Fine-Tuning Large Language Models
Viaarxiv icon