Picture for Yingxin Li

Yingxin Li

JAQ: Joint Efficient Architecture Design and Low-Bit Quantization with Hardware-Software Co-Exploration

Add code
Jan 09, 2025
Figure 1 for JAQ: Joint Efficient Architecture Design and Low-Bit Quantization with Hardware-Software Co-Exploration
Figure 2 for JAQ: Joint Efficient Architecture Design and Low-Bit Quantization with Hardware-Software Co-Exploration
Figure 3 for JAQ: Joint Efficient Architecture Design and Low-Bit Quantization with Hardware-Software Co-Exploration
Figure 4 for JAQ: Joint Efficient Architecture Design and Low-Bit Quantization with Hardware-Software Co-Exploration
Viaarxiv icon

EMS: Adaptive Evict-then-Merge Strategy for Head-wise KV Cache Compression Based on Global-Local Importance

Add code
Dec 11, 2024
Viaarxiv icon

Towards NeuroAI: Introducing Neuronal Diversity into Artificial Neural Networks

Add code
Jan 23, 2023
Viaarxiv icon