Picture for Dongkun Shin

Dongkun Shin

A2SF: Accumulative Attention Scoring with Forgetting Factor for Token Pruning in Transformer Decoder

Add code
Jul 31, 2024
Viaarxiv icon

Toward Efficient Permutation for Hierarchical N:M Sparsity on GPUs

Add code
Jul 30, 2024
Viaarxiv icon

Octave-YOLO: Cross frequency detection network with octave convolution

Add code
Jul 29, 2024
Viaarxiv icon

Realizing Unaligned Block-wise Pruning for DNN Acceleration on Mobile Devices

Add code
Jul 29, 2024
Viaarxiv icon