Picture for Kan Wu

Kan Wu

Stephen

Hunyuan-Large: An Open-Source MoE Model with 52 Billion Activated Parameters by Tencent

Add code
Nov 05, 2024
Figure 1 for Hunyuan-Large: An Open-Source MoE Model with 52 Billion Activated Parameters by Tencent
Figure 2 for Hunyuan-Large: An Open-Source MoE Model with 52 Billion Activated Parameters by Tencent
Figure 3 for Hunyuan-Large: An Open-Source MoE Model with 52 Billion Activated Parameters by Tencent
Figure 4 for Hunyuan-Large: An Open-Source MoE Model with 52 Billion Activated Parameters by Tencent
Viaarxiv icon

Lossless KV Cache Compression to 2%

Add code
Oct 20, 2024
Viaarxiv icon

Physical design optimization for automated drug dispensing systems in a human-machine interaction environment

Add code
Dec 18, 2023
Viaarxiv icon

A Quick Response Algorithm for Dynamic Autonomous Mobile Robot Routing Problem with Time Windows

Add code
Nov 26, 2023
Viaarxiv icon

FP8-LM: Training FP8 Large Language Models

Add code
Oct 27, 2023
Viaarxiv icon

TinyCLIP: CLIP Distillation via Affinity Mimicking and Weight Inheritance

Add code
Sep 21, 2023
Viaarxiv icon

The Multi-trip Autonomous Mobile Robots Scheduling Problem with Time Windows in a Stochastic Environment at Smart Hospitals

Add code
Jul 30, 2023
Viaarxiv icon

TinyViT: Fast Pretraining Distillation for Small Vision Transformers

Add code
Jul 21, 2022
Figure 1 for TinyViT: Fast Pretraining Distillation for Small Vision Transformers
Figure 2 for TinyViT: Fast Pretraining Distillation for Small Vision Transformers
Figure 3 for TinyViT: Fast Pretraining Distillation for Small Vision Transformers
Figure 4 for TinyViT: Fast Pretraining Distillation for Small Vision Transformers
Viaarxiv icon

MiniViT: Compressing Vision Transformers with Weight Multiplexing

Add code
Apr 14, 2022
Figure 1 for MiniViT: Compressing Vision Transformers with Weight Multiplexing
Figure 2 for MiniViT: Compressing Vision Transformers with Weight Multiplexing
Figure 3 for MiniViT: Compressing Vision Transformers with Weight Multiplexing
Figure 4 for MiniViT: Compressing Vision Transformers with Weight Multiplexing
Viaarxiv icon

Searching the Search Space of Vision Transformer

Add code
Nov 29, 2021
Figure 1 for Searching the Search Space of Vision Transformer
Figure 2 for Searching the Search Space of Vision Transformer
Figure 3 for Searching the Search Space of Vision Transformer
Figure 4 for Searching the Search Space of Vision Transformer
Viaarxiv icon