Picture for Lei Wu

Lei Wu

BlueLM-V-3B: Algorithm and System Co-Design for Multimodal Large Language Models on Mobile Devices

Add code
Nov 16, 2024
Viaarxiv icon

Prove Your Point!: Bringing Proof-Enhancement Principles to Argumentative Essay Generation

Add code
Oct 30, 2024
Viaarxiv icon

How Transformers Implement Induction Heads: Approximation and Optimization Analysis

Add code
Oct 15, 2024
Viaarxiv icon

DTactive: A Vision-Based Tactile Sensor with Active Surface

Add code
Oct 10, 2024
Viaarxiv icon

Why Do You Grok? A Theoretical Analysis of Grokking Modular Addition

Add code
Jul 17, 2024
Viaarxiv icon

Improving Generalization and Convergence by Enhancing Implicit Regularization

Add code
May 31, 2024
Figure 1 for Improving Generalization and Convergence by Enhancing Implicit Regularization
Figure 2 for Improving Generalization and Convergence by Enhancing Implicit Regularization
Figure 3 for Improving Generalization and Convergence by Enhancing Implicit Regularization
Figure 4 for Improving Generalization and Convergence by Enhancing Implicit Regularization
Viaarxiv icon

Exploring Neural Network Landscapes: Star-Shaped and Geodesic Connectivity

Add code
Apr 09, 2024
Figure 1 for Exploring Neural Network Landscapes: Star-Shaped and Geodesic Connectivity
Figure 2 for Exploring Neural Network Landscapes: Star-Shaped and Geodesic Connectivity
Figure 3 for Exploring Neural Network Landscapes: Star-Shaped and Geodesic Connectivity
Figure 4 for Exploring Neural Network Landscapes: Star-Shaped and Geodesic Connectivity
Viaarxiv icon

A Duality Analysis of Kernel Ridge Regression in the Noiseless Regime

Add code
Feb 24, 2024
Viaarxiv icon

The Implicit Bias of Gradient Noise: A Symmetry Perspective

Add code
Feb 11, 2024
Viaarxiv icon

Achieving Margin Maximization Exponentially Fast via Progressive Norm Rescaling

Add code
Dec 08, 2023
Viaarxiv icon