Picture for Bo Chen

Bo Chen

DJI Innovations Inc

Numerical Pruning for Efficient Autoregressive Models

Add code
Dec 17, 2024
Viaarxiv icon

LazyDiT: Lazy Learning for the Acceleration of Diffusion Transformers

Add code
Dec 17, 2024
Viaarxiv icon

MIND: Effective Incorrect Assignment Detection through a Multi-Modal Structure-Enhanced Language Model

Add code
Dec 05, 2024
Viaarxiv icon

LIBER: Lifelong User Behavior Modeling Based on Large Language Models

Add code
Nov 22, 2024
Figure 1 for LIBER: Lifelong User Behavior Modeling Based on Large Language Models
Figure 2 for LIBER: Lifelong User Behavior Modeling Based on Large Language Models
Figure 3 for LIBER: Lifelong User Behavior Modeling Based on Large Language Models
Figure 4 for LIBER: Lifelong User Behavior Modeling Based on Large Language Models
Viaarxiv icon

Circuit Complexity Bounds for RoPE-based Transformer Architecture

Add code
Nov 12, 2024
Viaarxiv icon

Training Compute-Optimal Protein Language Models

Add code
Nov 04, 2024
Viaarxiv icon

Beyond Positive History: Re-ranking with List-level Hybrid Feedback

Add code
Oct 28, 2024
Figure 1 for Beyond Positive History: Re-ranking with List-level Hybrid Feedback
Figure 2 for Beyond Positive History: Re-ranking with List-level Hybrid Feedback
Figure 3 for Beyond Positive History: Re-ranking with List-level Hybrid Feedback
Figure 4 for Beyond Positive History: Re-ranking with List-level Hybrid Feedback
Viaarxiv icon

Bypassing the Exponential Dependency: Looped Transformers Efficiently Learn In-context by Multi-step Gradient Descent

Add code
Oct 15, 2024
Viaarxiv icon

HSR-Enhanced Sparse Attention Acceleration

Add code
Oct 14, 2024
Figure 1 for HSR-Enhanced Sparse Attention Acceleration
Figure 2 for HSR-Enhanced Sparse Attention Acceleration
Viaarxiv icon

Scalable Weibull Graph Attention Autoencoder for Modeling Document Networks

Add code
Oct 13, 2024
Viaarxiv icon