Picture for Yimin Jiang

Yimin Jiang

DSV: Exploiting Dynamic Sparsity to Accelerate Large-Scale Video DiT Training

Add code
Feb 11, 2025
Viaarxiv icon

InfinitePOD: Building Datacenter-Scale High-Bandwidth Domain for LLM with Optical Circuit Switching Transceivers

Add code
Feb 07, 2025
Viaarxiv icon

Teola: Towards End-to-End Optimization of LLM-based Applications

Add code
Jun 29, 2024
Viaarxiv icon

Scene-Adaptive Person Search via Bilateral Modulations

Add code
May 05, 2024
Viaarxiv icon

Adaptive Gating in Mixture-of-Experts based Language Models

Add code
Oct 11, 2023
Figure 1 for Adaptive Gating in Mixture-of-Experts based Language Models
Figure 2 for Adaptive Gating in Mixture-of-Experts based Language Models
Figure 3 for Adaptive Gating in Mixture-of-Experts based Language Models
Figure 4 for Adaptive Gating in Mixture-of-Experts based Language Models
Viaarxiv icon