Picture for Jixuan Leng

Jixuan Leng

S$^{2}$FT: Efficient, Scalable and Generalizable LLM Fine-tuning by Structured Sparsity

Add code
Dec 10, 2024
Viaarxiv icon

Taming Overconfidence in LLMs: Reward Calibration in RLHF

Add code
Oct 13, 2024
Figure 1 for Taming Overconfidence in LLMs: Reward Calibration in RLHF
Figure 2 for Taming Overconfidence in LLMs: Reward Calibration in RLHF
Figure 3 for Taming Overconfidence in LLMs: Reward Calibration in RLHF
Figure 4 for Taming Overconfidence in LLMs: Reward Calibration in RLHF
Viaarxiv icon

Choosing Wisely and Learning Deeply: Selective Cross-Modality Distillation via CLIP for Domain Generalization

Add code
Nov 26, 2023
Viaarxiv icon