Picture for Li Shang

Li Shang

The Curse and Blessing of Mean Bias in FP4-Quantized LLM Training

Add code
Mar 11, 2026
Viaarxiv icon

Multi-Head Attention as a Source of Catastrophic Forgetting in MoE Transformers

Add code
Feb 13, 2026
Viaarxiv icon

SD-MoE: Spectral Decomposition for Effective Expert Specialization

Add code
Feb 13, 2026
Viaarxiv icon

Dispelling the Curse of Singularities in Neural Network Optimizations

Add code
Feb 01, 2026
Viaarxiv icon

White-Box Op-Amp Design via Human-Mimicking Reasoning

Add code
Jan 29, 2026
Viaarxiv icon

Bridging the Initialization Gap: A Co-Optimization Framework for Mixed-Size Global Placement

Add code
Nov 13, 2025
Viaarxiv icon

AnalogSeeker: An Open-source Foundation Language Model for Analog Circuit Design

Add code
Aug 14, 2025
Viaarxiv icon

Bidirectional Knowledge Distillation for Enhancing Sequential Recommendation with Large Language Models

Add code
May 23, 2025
Viaarxiv icon

The Power of Graph Signal Processing for Chip Placement Acceleration

Add code
Feb 24, 2025
Figure 1 for The Power of Graph Signal Processing for Chip Placement Acceleration
Figure 2 for The Power of Graph Signal Processing for Chip Placement Acceleration
Figure 3 for The Power of Graph Signal Processing for Chip Placement Acceleration
Figure 4 for The Power of Graph Signal Processing for Chip Placement Acceleration
Viaarxiv icon

Mitigating Popularity Bias in Collaborative Filtering through Fair Sampling

Add code
Feb 19, 2025
Viaarxiv icon