Picture for Qingtian Feng

Qingtian Feng

DM3D: Distortion-Minimized Weight Pruning for Lossless 3D Object Detection

Add code
Jul 02, 2024
Viaarxiv icon

LoRA-Switch: Boosting the Efficiency of Dynamic LLM Adapters via System-Algorithm Co-design

Add code
May 28, 2024
Viaarxiv icon

Serving MoE Models on Resource-constrained Edge Devices via Dynamic Expert Swapping

Add code
Aug 29, 2023
Figure 1 for Serving MoE Models on Resource-constrained Edge Devices via Dynamic Expert Swapping
Figure 2 for Serving MoE Models on Resource-constrained Edge Devices via Dynamic Expert Swapping
Figure 3 for Serving MoE Models on Resource-constrained Edge Devices via Dynamic Expert Swapping
Figure 4 for Serving MoE Models on Resource-constrained Edge Devices via Dynamic Expert Swapping
Viaarxiv icon