Picture for Hanyu Zhao

Hanyu Zhao

CCI3.0-HQ: a large-scale Chinese dataset of high quality designed for pre-training large language models

Add code
Oct 24, 2024
Viaarxiv icon

Beyond IID: Optimizing Instruction Learning from the Perspective of Instruction Interaction and Dependency

Add code
Sep 11, 2024
Viaarxiv icon

AquilaMoE: Efficient Training for MoE Models with Scale-Up and Scale-Out Strategies

Add code
Aug 13, 2024
Viaarxiv icon

Boosting Large-scale Parallel Training Efficiency with C4: A Communication-Driven Approach

Add code
Jun 07, 2024
Figure 1 for Boosting Large-scale Parallel Training Efficiency with C4: A Communication-Driven Approach
Figure 2 for Boosting Large-scale Parallel Training Efficiency with C4: A Communication-Driven Approach
Figure 3 for Boosting Large-scale Parallel Training Efficiency with C4: A Communication-Driven Approach
Figure 4 for Boosting Large-scale Parallel Training Efficiency with C4: A Communication-Driven Approach
Viaarxiv icon

Llumnix: Dynamic Scheduling for Large Language Model Serving

Add code
Jun 05, 2024
Figure 1 for Llumnix: Dynamic Scheduling for Large Language Model Serving
Figure 2 for Llumnix: Dynamic Scheduling for Large Language Model Serving
Figure 3 for Llumnix: Dynamic Scheduling for Large Language Model Serving
Figure 4 for Llumnix: Dynamic Scheduling for Large Language Model Serving
Viaarxiv icon

Variational Continual Test-Time Adaptation

Add code
Feb 13, 2024
Viaarxiv icon

ROAM: memory-efficient large DNN training via optimized operator ordering and memory layout

Add code
Oct 30, 2023
Viaarxiv icon

Artificial Intelligence Security Competition (AISC)

Add code
Dec 07, 2022
Figure 1 for Artificial Intelligence Security Competition (AISC)
Figure 2 for Artificial Intelligence Security Competition (AISC)
Figure 3 for Artificial Intelligence Security Competition (AISC)
Figure 4 for Artificial Intelligence Security Competition (AISC)
Viaarxiv icon

Instance-wise Prompt Tuning for Pretrained Language Models

Add code
Jun 04, 2022
Figure 1 for Instance-wise Prompt Tuning for Pretrained Language Models
Figure 2 for Instance-wise Prompt Tuning for Pretrained Language Models
Figure 3 for Instance-wise Prompt Tuning for Pretrained Language Models
Figure 4 for Instance-wise Prompt Tuning for Pretrained Language Models
Viaarxiv icon

A Roadmap for Big Model

Add code
Apr 02, 2022
Figure 1 for A Roadmap for Big Model
Figure 2 for A Roadmap for Big Model
Figure 3 for A Roadmap for Big Model
Figure 4 for A Roadmap for Big Model
Viaarxiv icon