Picture for Zhenbo Sun

Zhenbo Sun

A Multi-Power Law for Loss Curve Prediction Across Learning Rate Schedules

Add code
Mar 17, 2025
Viaarxiv icon

CPM-2: Large-scale Cost-effective Pre-trained Language Models

Add code
Jun 24, 2021
Figure 1 for CPM-2: Large-scale Cost-effective Pre-trained Language Models
Figure 2 for CPM-2: Large-scale Cost-effective Pre-trained Language Models
Figure 3 for CPM-2: Large-scale Cost-effective Pre-trained Language Models
Figure 4 for CPM-2: Large-scale Cost-effective Pre-trained Language Models
Viaarxiv icon

CPM: A Large-scale Generative Chinese Pre-trained Language Model

Add code
Dec 01, 2020
Figure 1 for CPM: A Large-scale Generative Chinese Pre-trained Language Model
Figure 2 for CPM: A Large-scale Generative Chinese Pre-trained Language Model
Figure 3 for CPM: A Large-scale Generative Chinese Pre-trained Language Model
Figure 4 for CPM: A Large-scale Generative Chinese Pre-trained Language Model
Viaarxiv icon