Picture for Yaodong Yu

Yaodong Yu

Scaling Laws in Patchification: An Image Is Worth 50,176 Tokens And More

Add code
Feb 06, 2025
Viaarxiv icon

Trading Inference-Time Compute for Adversarial Robustness

Add code
Jan 31, 2025
Figure 1 for Trading Inference-Time Compute for Adversarial Robustness
Figure 2 for Trading Inference-Time Compute for Adversarial Robustness
Figure 3 for Trading Inference-Time Compute for Adversarial Robustness
Figure 4 for Trading Inference-Time Compute for Adversarial Robustness
Viaarxiv icon

Token Statistics Transformer: Linear-Time Attention via Variational Rate Reduction

Add code
Dec 23, 2024
Viaarxiv icon

M-VAR: Decoupled Scale-wise Autoregressive Modeling for High-Quality Image Generation

Add code
Nov 15, 2024
Figure 1 for M-VAR: Decoupled Scale-wise Autoregressive Modeling for High-Quality Image Generation
Figure 2 for M-VAR: Decoupled Scale-wise Autoregressive Modeling for High-Quality Image Generation
Figure 3 for M-VAR: Decoupled Scale-wise Autoregressive Modeling for High-Quality Image Generation
Figure 4 for M-VAR: Decoupled Scale-wise Autoregressive Modeling for High-Quality Image Generation
Viaarxiv icon

Causal Image Modeling for Efficient Visual Understanding

Add code
Oct 10, 2024
Figure 1 for Causal Image Modeling for Efficient Visual Understanding
Figure 2 for Causal Image Modeling for Efficient Visual Understanding
Figure 3 for Causal Image Modeling for Efficient Visual Understanding
Figure 4 for Causal Image Modeling for Efficient Visual Understanding
Viaarxiv icon

Accuracy on the wrong line: On the pitfalls of noisy data for out-of-distribution generalisation

Add code
Jun 27, 2024
Viaarxiv icon

A Global Geometric Analysis of Maximal Coding Rate Reduction

Add code
Jun 04, 2024
Figure 1 for A Global Geometric Analysis of Maximal Coding Rate Reduction
Figure 2 for A Global Geometric Analysis of Maximal Coding Rate Reduction
Figure 3 for A Global Geometric Analysis of Maximal Coding Rate Reduction
Figure 4 for A Global Geometric Analysis of Maximal Coding Rate Reduction
Viaarxiv icon

Scaling White-Box Transformers for Vision

Add code
Jun 03, 2024
Figure 1 for Scaling White-Box Transformers for Vision
Figure 2 for Scaling White-Box Transformers for Vision
Figure 3 for Scaling White-Box Transformers for Vision
Figure 4 for Scaling White-Box Transformers for Vision
Viaarxiv icon

Masked Completion via Structured Diffusion with White-Box Transformers

Add code
Apr 03, 2024
Viaarxiv icon

Differentially Private Representation Learning via Image Captioning

Add code
Mar 04, 2024
Figure 1 for Differentially Private Representation Learning via Image Captioning
Figure 2 for Differentially Private Representation Learning via Image Captioning
Figure 3 for Differentially Private Representation Learning via Image Captioning
Figure 4 for Differentially Private Representation Learning via Image Captioning
Viaarxiv icon