Picture for Jiancong Xiao

Jiancong Xiao

Magnetic Preference Optimization: Achieving Last-iterate Convergence for Language Models Alignment

Add code
Oct 22, 2024
Viaarxiv icon

Entropic Distribution Matching in Supervised Fine-tuning of LLMs: Less Overfitting and Better Diversity

Add code
Aug 29, 2024
Viaarxiv icon

Fine-Tuning Linear Layers Only Is a Simple yet Effective Way for Task Arithmetic

Add code
Jul 09, 2024
Viaarxiv icon

Bridging the Gap: Rademacher Complexity in Robust and Standard Generalization

Add code
Jun 08, 2024
Viaarxiv icon

On the Algorithmic Bias of Aligning Large Language Models with RLHF: Preference Collapse and Matching Regularization

Add code
May 26, 2024
Viaarxiv icon

Uniformly Stable Algorithms for Adversarial Training and Beyond

Add code
May 03, 2024
Viaarxiv icon

PAC-Bayesian Spectrally-Normalized Bounds for Adversarially Robust Generalization

Add code
Oct 09, 2023
Viaarxiv icon

Adversarial Rademacher Complexity of Deep Neural Networks

Add code
Nov 27, 2022
Viaarxiv icon

Stability Analysis and Generalization Bounds of Adversarial Training

Add code
Oct 03, 2022
Figure 1 for Stability Analysis and Generalization Bounds of Adversarial Training
Figure 2 for Stability Analysis and Generalization Bounds of Adversarial Training
Figure 3 for Stability Analysis and Generalization Bounds of Adversarial Training
Figure 4 for Stability Analysis and Generalization Bounds of Adversarial Training
Viaarxiv icon

Understanding Adversarial Robustness Against On-manifold Adversarial Examples

Add code
Oct 02, 2022
Figure 1 for Understanding Adversarial Robustness Against On-manifold Adversarial Examples
Figure 2 for Understanding Adversarial Robustness Against On-manifold Adversarial Examples
Figure 3 for Understanding Adversarial Robustness Against On-manifold Adversarial Examples
Figure 4 for Understanding Adversarial Robustness Against On-manifold Adversarial Examples
Viaarxiv icon