Picture for Qingyuan Li

Qingyuan Li

Flash Communication: Reducing Tensor Parallelization Bottleneck for Fast Large Language Model Inference

Add code
Dec 06, 2024
Viaarxiv icon

Integer Scale: A Free Lunch for Faster Fine-grained Quantization of LLMs

Add code
May 23, 2024
Viaarxiv icon

UMIE: Unified Multimodal Information Extraction with Instruction Tuning

Add code
Jan 05, 2024
Figure 1 for UMIE: Unified Multimodal Information Extraction with Instruction Tuning
Figure 2 for UMIE: Unified Multimodal Information Extraction with Instruction Tuning
Figure 3 for UMIE: Unified Multimodal Information Extraction with Instruction Tuning
Figure 4 for UMIE: Unified Multimodal Information Extraction with Instruction Tuning
Viaarxiv icon

A Speed Odyssey for Deployable Quantization of LLMs

Add code
Nov 16, 2023
Viaarxiv icon

Norm Tweaking: High-performance Low-bit Quantization of Large Language Models

Add code
Sep 06, 2023
Viaarxiv icon

FPTQ: Fine-grained Post-Training Quantization for Large Language Models

Add code
Aug 30, 2023
Viaarxiv icon

EAPruning: Evolutionary Pruning for Vision Transformers and CNNs

Add code
Oct 01, 2022
Figure 1 for EAPruning: Evolutionary Pruning for Vision Transformers and CNNs
Figure 2 for EAPruning: Evolutionary Pruning for Vision Transformers and CNNs
Figure 3 for EAPruning: Evolutionary Pruning for Vision Transformers and CNNs
Figure 4 for EAPruning: Evolutionary Pruning for Vision Transformers and CNNs
Viaarxiv icon

YOLOv6: A Single-Stage Object Detection Framework for Industrial Applications

Add code
Sep 07, 2022
Figure 1 for YOLOv6: A Single-Stage Object Detection Framework for Industrial Applications
Figure 2 for YOLOv6: A Single-Stage Object Detection Framework for Industrial Applications
Figure 3 for YOLOv6: A Single-Stage Object Detection Framework for Industrial Applications
Figure 4 for YOLOv6: A Single-Stage Object Detection Framework for Industrial Applications
Viaarxiv icon

ScarletNAS: Bridging the Gap Between Scalability and Fairness in Neural Architecture Search

Add code
Sep 13, 2019
Figure 1 for ScarletNAS: Bridging the Gap Between Scalability and Fairness in Neural Architecture Search
Figure 2 for ScarletNAS: Bridging the Gap Between Scalability and Fairness in Neural Architecture Search
Figure 3 for ScarletNAS: Bridging the Gap Between Scalability and Fairness in Neural Architecture Search
Figure 4 for ScarletNAS: Bridging the Gap Between Scalability and Fairness in Neural Architecture Search
Viaarxiv icon

Fast, Accurate and Lightweight Super-Resolution with Neural Architecture Search

Add code
Jan 24, 2019
Figure 1 for Fast, Accurate and Lightweight Super-Resolution with Neural Architecture Search
Figure 2 for Fast, Accurate and Lightweight Super-Resolution with Neural Architecture Search
Figure 3 for Fast, Accurate and Lightweight Super-Resolution with Neural Architecture Search
Figure 4 for Fast, Accurate and Lightweight Super-Resolution with Neural Architecture Search
Viaarxiv icon