Picture for Han Bao

Han Bao

Any-stepsize Gradient Descent for Separable Data under Fenchel--Young Losses

Add code
Feb 07, 2025
Viaarxiv icon

TAID: Temporally Adaptive Interpolated Distillation for Efficient Knowledge Transfer in Language Models

Add code
Jan 29, 2025
Viaarxiv icon

Online Inverse Linear Optimization: Improved Regret Bound, Robustness to Suboptimality, and Toward Tight Regret Analysis

Add code
Jan 27, 2025
Viaarxiv icon

Revisiting Online Learning Approach to Inverse Linear Optimization: A Fenchel$-$Young Loss Perspective and Gap-Dependent Regret Analysis

Add code
Jan 24, 2025
Viaarxiv icon

DeepSeek-V3 Technical Report

Add code
Dec 27, 2024
Figure 1 for DeepSeek-V3 Technical Report
Figure 2 for DeepSeek-V3 Technical Report
Figure 3 for DeepSeek-V3 Technical Report
Figure 4 for DeepSeek-V3 Technical Report
Viaarxiv icon

Zipfian Whitening

Add code
Nov 01, 2024
Figure 1 for Zipfian Whitening
Figure 2 for Zipfian Whitening
Figure 3 for Zipfian Whitening
Figure 4 for Zipfian Whitening
Viaarxiv icon

AutoBench-V: Can Large Vision-Language Models Benchmark Themselves?

Add code
Oct 29, 2024
Viaarxiv icon

FastAttention: Extend FlashAttention2 to NPUs and Low-resource GPUs

Add code
Oct 22, 2024
Viaarxiv icon

Beyond 2:4: exploring V:N:M sparsity for efficient transformer inference on GPUs

Add code
Oct 21, 2024
Figure 1 for Beyond 2:4: exploring V:N:M sparsity for efficient transformer inference on GPUs
Figure 2 for Beyond 2:4: exploring V:N:M sparsity for efficient transformer inference on GPUs
Figure 3 for Beyond 2:4: exploring V:N:M sparsity for efficient transformer inference on GPUs
Figure 4 for Beyond 2:4: exploring V:N:M sparsity for efficient transformer inference on GPUs
Viaarxiv icon

FlatQuant: Flatness Matters for LLM Quantization

Add code
Oct 12, 2024
Figure 1 for FlatQuant: Flatness Matters for LLM Quantization
Figure 2 for FlatQuant: Flatness Matters for LLM Quantization
Figure 3 for FlatQuant: Flatness Matters for LLM Quantization
Figure 4 for FlatQuant: Flatness Matters for LLM Quantization
Viaarxiv icon