Picture for Dawei Zhu

Dawei Zhu

More Tokens, Lower Precision: Towards the Optimal Token-Precision Trade-off in KV Cache Compression

Add code
Dec 17, 2024
Viaarxiv icon

Instruction-Tuned LLMs Succeed in Document-Level MT Without Fine-Tuning -- But BLEU Turns a Blind Eye

Add code
Oct 29, 2024
Figure 1 for Instruction-Tuned LLMs Succeed in Document-Level MT Without Fine-Tuning -- But BLEU Turns a Blind Eye
Figure 2 for Instruction-Tuned LLMs Succeed in Document-Level MT Without Fine-Tuning -- But BLEU Turns a Blind Eye
Figure 3 for Instruction-Tuned LLMs Succeed in Document-Level MT Without Fine-Tuning -- But BLEU Turns a Blind Eye
Figure 4 for Instruction-Tuned LLMs Succeed in Document-Level MT Without Fine-Tuning -- But BLEU Turns a Blind Eye
Viaarxiv icon

AgentBank: Towards Generalized LLM Agents via Fine-Tuning on 50000+ Interaction Trajectories

Add code
Oct 10, 2024
Figure 1 for AgentBank: Towards Generalized LLM Agents via Fine-Tuning on 50000+ Interaction Trajectories
Figure 2 for AgentBank: Towards Generalized LLM Agents via Fine-Tuning on 50000+ Interaction Trajectories
Figure 3 for AgentBank: Towards Generalized LLM Agents via Fine-Tuning on 50000+ Interaction Trajectories
Figure 4 for AgentBank: Towards Generalized LLM Agents via Fine-Tuning on 50000+ Interaction Trajectories
Viaarxiv icon

To Preserve or To Compress: An In-Depth Study of Connector Selection in Multimodal Large Language Models

Add code
Oct 09, 2024
Viaarxiv icon

The Accuracy Paradox in RLHF: When Better Reward Models Don't Yield Better Language Models

Add code
Oct 09, 2024
Viaarxiv icon

From Calculation to Adjudication: Examining LLM judges on Mathematical Reasoning Tasks

Add code
Sep 06, 2024
Viaarxiv icon

Assessing "Implicit" Retrieval Robustness of Large Language Models

Add code
Jun 26, 2024
Viaarxiv icon

EERPD: Leveraging Emotion and Emotion Regulation for Improving Personality Detection

Add code
Jun 23, 2024
Figure 1 for EERPD: Leveraging Emotion and Emotion Regulation for Improving Personality Detection
Figure 2 for EERPD: Leveraging Emotion and Emotion Regulation for Improving Personality Detection
Figure 3 for EERPD: Leveraging Emotion and Emotion Regulation for Improving Personality Detection
Figure 4 for EERPD: Leveraging Emotion and Emotion Regulation for Improving Personality Detection
Viaarxiv icon

InternLM-Law: An Open Source Chinese Legal Large Language Model

Add code
Jun 21, 2024
Viaarxiv icon

Long Context Alignment with Short Instructions and Synthesized Positions

Add code
May 07, 2024
Figure 1 for Long Context Alignment with Short Instructions and Synthesized Positions
Figure 2 for Long Context Alignment with Short Instructions and Synthesized Positions
Figure 3 for Long Context Alignment with Short Instructions and Synthesized Positions
Figure 4 for Long Context Alignment with Short Instructions and Synthesized Positions
Viaarxiv icon