Picture for Zixin Wen

Zixin Wen

Faster WIND: Accelerating Iterative Best-of-$N$ Distillation for LLM Alignment

Add code
Oct 28, 2024
Viaarxiv icon

Transformers Provably Learn Feature-Position Correlations in Masked Image Modeling

Add code
Mar 04, 2024
Viaarxiv icon

Revisiting Disentanglement in Downstream Tasks: A Study on Its Necessity for Abstract Visual Reasoning

Add code
Mar 01, 2024
Viaarxiv icon

What Matters In The Structured Pruning of Generative Language Models?

Add code
Feb 07, 2023
Viaarxiv icon

The Mechanism of Prediction Head in Non-contrastive Self-supervised Learning

Add code
May 14, 2022
Figure 1 for The Mechanism of Prediction Head in Non-contrastive Self-supervised Learning
Figure 2 for The Mechanism of Prediction Head in Non-contrastive Self-supervised Learning
Figure 3 for The Mechanism of Prediction Head in Non-contrastive Self-supervised Learning
Figure 4 for The Mechanism of Prediction Head in Non-contrastive Self-supervised Learning
Viaarxiv icon

Improving Multi-Modal Learning with Uni-Modal Teachers

Add code
Jun 21, 2021
Figure 1 for Improving Multi-Modal Learning with Uni-Modal Teachers
Figure 2 for Improving Multi-Modal Learning with Uni-Modal Teachers
Figure 3 for Improving Multi-Modal Learning with Uni-Modal Teachers
Figure 4 for Improving Multi-Modal Learning with Uni-Modal Teachers
Viaarxiv icon

Toward Understanding the Feature Learning Process of Self-supervised Contrastive Learning

Add code
Jun 12, 2021
Figure 1 for Toward Understanding the Feature Learning Process of Self-supervised Contrastive Learning
Figure 2 for Toward Understanding the Feature Learning Process of Self-supervised Contrastive Learning
Figure 3 for Toward Understanding the Feature Learning Process of Self-supervised Contrastive Learning
Figure 4 for Toward Understanding the Feature Learning Process of Self-supervised Contrastive Learning
Viaarxiv icon

Convergence of End-to-End Training in Deep Unsupervised Contrasitive Learning

Add code
Feb 21, 2020
Viaarxiv icon