Picture for Yuxin Ren

Yuxin Ren

Is Attention Required for Transformer Inference? Explore Function-preserving Attention Replacement

Add code
May 29, 2025
Viaarxiv icon

On Affine Homotopy between Language Encoders

Add code
Jun 04, 2024
Figure 1 for On Affine Homotopy between Language Encoders
Figure 2 for On Affine Homotopy between Language Encoders
Figure 3 for On Affine Homotopy between Language Encoders
Figure 4 for On Affine Homotopy between Language Encoders
Viaarxiv icon

Non-autoregressive Generative Models for Reranking Recommendation

Add code
Feb 10, 2024
Viaarxiv icon

All Roads Lead to Rome? Exploring the Invariance of Transformers' Representations

Add code
May 23, 2023
Figure 1 for All Roads Lead to Rome? Exploring the Invariance of Transformers' Representations
Figure 2 for All Roads Lead to Rome? Exploring the Invariance of Transformers' Representations
Figure 3 for All Roads Lead to Rome? Exploring the Invariance of Transformers' Representations
Figure 4 for All Roads Lead to Rome? Exploring the Invariance of Transformers' Representations
Viaarxiv icon

Tailoring Instructions to Student's Learning Levels Boosts Knowledge Distillation

Add code
May 16, 2023
Figure 1 for Tailoring Instructions to Student's Learning Levels Boosts Knowledge Distillation
Figure 2 for Tailoring Instructions to Student's Learning Levels Boosts Knowledge Distillation
Figure 3 for Tailoring Instructions to Student's Learning Levels Boosts Knowledge Distillation
Figure 4 for Tailoring Instructions to Student's Learning Levels Boosts Knowledge Distillation
Viaarxiv icon

Tackling Instance-Dependent Label Noise with Dynamic Distribution Calibration

Add code
Oct 11, 2022
Figure 1 for Tackling Instance-Dependent Label Noise with Dynamic Distribution Calibration
Figure 2 for Tackling Instance-Dependent Label Noise with Dynamic Distribution Calibration
Figure 3 for Tackling Instance-Dependent Label Noise with Dynamic Distribution Calibration
Figure 4 for Tackling Instance-Dependent Label Noise with Dynamic Distribution Calibration
Viaarxiv icon

Exploring Extreme Parameter Compression for Pre-trained Language Models

Add code
May 20, 2022
Figure 1 for Exploring Extreme Parameter Compression for Pre-trained Language Models
Figure 2 for Exploring Extreme Parameter Compression for Pre-trained Language Models
Figure 3 for Exploring Extreme Parameter Compression for Pre-trained Language Models
Figure 4 for Exploring Extreme Parameter Compression for Pre-trained Language Models
Viaarxiv icon