Picture for Pingcheng Dong

Pingcheng Dong

Genetic Quantization-Aware Approximation for Non-Linear Operations in Transformers

Add code
Mar 29, 2024
Figure 1 for Genetic Quantization-Aware Approximation for Non-Linear Operations in Transformers
Figure 2 for Genetic Quantization-Aware Approximation for Non-Linear Operations in Transformers
Figure 3 for Genetic Quantization-Aware Approximation for Non-Linear Operations in Transformers
Figure 4 for Genetic Quantization-Aware Approximation for Non-Linear Operations in Transformers
Viaarxiv icon

Boundary and Relation Distillation for Semantic Segmentation

Add code
Jan 24, 2024
Viaarxiv icon

LLM-FP4: 4-Bit Floating-Point Quantized Transformers

Add code
Oct 25, 2023
Viaarxiv icon