Picture for Chao Fang

Chao Fang

Jamming Detection and Channel Estimation for Spatially Correlated Beamspace Massive MIMO

Add code
Oct 18, 2024
Viaarxiv icon

Efficient Arbitrary Precision Acceleration for Large Language Models on GPU Tensor Cores

Add code
Sep 26, 2024
Viaarxiv icon

Co-Designing Binarized Transformer and Hardware Accelerator for Efficient End-to-End Edge Deployment

Add code
Jul 16, 2024
Viaarxiv icon

BETA: Binarized Energy-Efficient Transformer Accelerator at the Edge

Add code
Jan 23, 2024
Viaarxiv icon

Efficient N:M Sparse DNN Training Using Algorithm, Architecture, and Dataflow Co-Design

Add code
Sep 22, 2023
Viaarxiv icon

A Precision-Scalable RISC-V DNN Processor with On-Device Learning Capability at the Extreme Edge

Add code
Sep 15, 2023
Viaarxiv icon

NVAutoNet: Fast and Accurate 360$^{\circ}$ 3D Visual Perception For Self Driving

Add code
Mar 30, 2023
Viaarxiv icon

PDPU: An Open-Source Posit Dot-Product Unit for Deep Learning Applications

Add code
Feb 03, 2023
Viaarxiv icon

BEBERT: Efficient and robust binary ensemble BERT

Add code
Oct 28, 2022
Viaarxiv icon

An Algorithm-Hardware Co-Optimized Framework for Accelerating N:M Sparse Transformers

Add code
Aug 12, 2022
Figure 1 for An Algorithm-Hardware Co-Optimized Framework for Accelerating N:M Sparse Transformers
Figure 2 for An Algorithm-Hardware Co-Optimized Framework for Accelerating N:M Sparse Transformers
Figure 3 for An Algorithm-Hardware Co-Optimized Framework for Accelerating N:M Sparse Transformers
Figure 4 for An Algorithm-Hardware Co-Optimized Framework for Accelerating N:M Sparse Transformers
Viaarxiv icon