Picture for Sucheng Ren

Sucheng Ren

CoPE: Clipped RoPE as A Scalable Free Lunch for Long Context LLMs

Add code
Feb 05, 2026
Viaarxiv icon

Spiral RoPE: Rotate Your Rotary Positional Embeddings in the 2D Plane

Add code
Feb 03, 2026
Viaarxiv icon

OpenVision 3: A Family of Unified Visual Encoder for Both Understanding and Generation

Add code
Jan 21, 2026
Viaarxiv icon

Grouping First, Attending Smartly: Training-Free Acceleration for Diffusion Transformers

Add code
May 20, 2025
Figure 1 for Grouping First, Attending Smartly: Training-Free Acceleration for Diffusion Transformers
Figure 2 for Grouping First, Attending Smartly: Training-Free Acceleration for Diffusion Transformers
Figure 3 for Grouping First, Attending Smartly: Training-Free Acceleration for Diffusion Transformers
Figure 4 for Grouping First, Attending Smartly: Training-Free Acceleration for Diffusion Transformers
Viaarxiv icon

Beyond Next-Token: Next-X Prediction for Autoregressive Visual Generation

Add code
Feb 27, 2025
Viaarxiv icon

FlowAR: Scale-wise Autoregressive Image Generation Meets Flow Matching

Add code
Dec 19, 2024
Viaarxiv icon

HResFormer: Hybrid Residual Transformer for Volumetric Medical Image Segmentation

Add code
Dec 16, 2024
Figure 1 for HResFormer: Hybrid Residual Transformer for Volumetric Medical Image Segmentation
Figure 2 for HResFormer: Hybrid Residual Transformer for Volumetric Medical Image Segmentation
Figure 3 for HResFormer: Hybrid Residual Transformer for Volumetric Medical Image Segmentation
Figure 4 for HResFormer: Hybrid Residual Transformer for Volumetric Medical Image Segmentation
Viaarxiv icon

M-VAR: Decoupled Scale-wise Autoregressive Modeling for High-Quality Image Generation

Add code
Nov 15, 2024
Figure 1 for M-VAR: Decoupled Scale-wise Autoregressive Modeling for High-Quality Image Generation
Figure 2 for M-VAR: Decoupled Scale-wise Autoregressive Modeling for High-Quality Image Generation
Figure 3 for M-VAR: Decoupled Scale-wise Autoregressive Modeling for High-Quality Image Generation
Figure 4 for M-VAR: Decoupled Scale-wise Autoregressive Modeling for High-Quality Image Generation
Viaarxiv icon

Causal Image Modeling for Efficient Visual Understanding

Add code
Oct 10, 2024
Figure 1 for Causal Image Modeling for Efficient Visual Understanding
Figure 2 for Causal Image Modeling for Efficient Visual Understanding
Figure 3 for Causal Image Modeling for Efficient Visual Understanding
Figure 4 for Causal Image Modeling for Efficient Visual Understanding
Viaarxiv icon

What If We Recaption Billions of Web Images with LLaMA-3?

Add code
Jun 12, 2024
Figure 1 for What If We Recaption Billions of Web Images with LLaMA-3?
Figure 2 for What If We Recaption Billions of Web Images with LLaMA-3?
Figure 3 for What If We Recaption Billions of Web Images with LLaMA-3?
Figure 4 for What If We Recaption Billions of Web Images with LLaMA-3?
Viaarxiv icon