Picture for Jinbao Xue

Jinbao Xue

Hunyuan-Large: An Open-Source MoE Model with 52 Billion Activated Parameters by Tencent

Add code
Nov 05, 2024
Viaarxiv icon

Efficiently Training 7B LLM with 1 Million Sequence Length on 8 GPUs

Add code
Jul 16, 2024
Viaarxiv icon

BeamVQ: Aligning Space-Time Forecasting Model via Self-training on Physics-aware Metrics

Add code
May 27, 2024
Figure 1 for BeamVQ: Aligning Space-Time Forecasting Model via Self-training on Physics-aware Metrics
Figure 2 for BeamVQ: Aligning Space-Time Forecasting Model via Self-training on Physics-aware Metrics
Figure 3 for BeamVQ: Aligning Space-Time Forecasting Model via Self-training on Physics-aware Metrics
Figure 4 for BeamVQ: Aligning Space-Time Forecasting Model via Self-training on Physics-aware Metrics
Viaarxiv icon

Surge Phenomenon in Optimal Learning Rate and Batch Size Scaling

Add code
May 23, 2024
Figure 1 for Surge Phenomenon in Optimal Learning Rate and Batch Size Scaling
Figure 2 for Surge Phenomenon in Optimal Learning Rate and Batch Size Scaling
Figure 3 for Surge Phenomenon in Optimal Learning Rate and Batch Size Scaling
Figure 4 for Surge Phenomenon in Optimal Learning Rate and Batch Size Scaling
Viaarxiv icon

Hunyuan-DiT: A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding

Add code
May 14, 2024
Viaarxiv icon

Angel-PTM: A Scalable and Economical Large-scale Pre-training System in Tencent

Add code
Mar 06, 2023
Viaarxiv icon

M6: A Chinese Multimodal Pretrainer

Add code
Mar 02, 2021
Figure 1 for M6: A Chinese Multimodal Pretrainer
Figure 2 for M6: A Chinese Multimodal Pretrainer
Figure 3 for M6: A Chinese Multimodal Pretrainer
Figure 4 for M6: A Chinese Multimodal Pretrainer
Viaarxiv icon

A Multi-Semantic Metapath Model for Large Scale Heterogeneous Network Representation Learning

Add code
Jul 19, 2020
Figure 1 for A Multi-Semantic Metapath Model for Large Scale Heterogeneous Network Representation Learning
Figure 2 for A Multi-Semantic Metapath Model for Large Scale Heterogeneous Network Representation Learning
Figure 3 for A Multi-Semantic Metapath Model for Large Scale Heterogeneous Network Representation Learning
Figure 4 for A Multi-Semantic Metapath Model for Large Scale Heterogeneous Network Representation Learning
Viaarxiv icon