Picture for Yanzhe Li

Yanzhe Li

Every FLOP Counts: Scaling a 300B Mixture-of-Experts LING LLM without Premium GPUs

Add code
Mar 07, 2025
Figure 1 for Every FLOP Counts: Scaling a 300B Mixture-of-Experts LING LLM without Premium GPUs
Figure 2 for Every FLOP Counts: Scaling a 300B Mixture-of-Experts LING LLM without Premium GPUs
Figure 3 for Every FLOP Counts: Scaling a 300B Mixture-of-Experts LING LLM without Premium GPUs
Figure 4 for Every FLOP Counts: Scaling a 300B Mixture-of-Experts LING LLM without Premium GPUs
Viaarxiv icon

Modelling prospective memory and resilient situated communications via Wizard of Oz

Add code
Nov 09, 2023
Viaarxiv icon

Adversarial Sub-sequence for Text Generation

Add code
May 30, 2019
Figure 1 for Adversarial Sub-sequence for Text Generation
Figure 2 for Adversarial Sub-sequence for Text Generation
Figure 3 for Adversarial Sub-sequence for Text Generation
Figure 4 for Adversarial Sub-sequence for Text Generation
Viaarxiv icon