Picture for Shuhao Gu

Shuhao Gu

CCI3.0-HQ: a large-scale Chinese dataset of high quality designed for pre-training large language models

Add code
Oct 24, 2024
Viaarxiv icon

Infinity-MM: Scaling Multimodal Performance with Large-Scale and High-Quality Instruction Data

Add code
Oct 24, 2024
Viaarxiv icon

ReTok: Replacing Tokenizer to Enhance Representation Efficiency in Large Language Model

Add code
Oct 06, 2024
Viaarxiv icon

Aquila2 Technical Report

Add code
Aug 14, 2024
Viaarxiv icon

AquilaMoE: Efficient Training for MoE Models with Scale-Up and Scale-Out Strategies

Add code
Aug 13, 2024
Viaarxiv icon

Addressing the Length Bias Problem in Document-Level Neural Machine Translation

Add code
Nov 20, 2023
Viaarxiv icon

Enhancing Neural Machine Translation with Semantic Units

Add code
Oct 17, 2023
Viaarxiv icon

Continual Learning of Neural Machine Translation within Low Forgetting Risk Regions

Add code
Nov 04, 2022
Viaarxiv icon

Improving Zero-Shot Multilingual Translation with Universal Representations and Cross-Mappings

Add code
Oct 28, 2022
Viaarxiv icon

Importance-based Neuron Allocation for Multilingual Neural Machine Translation

Add code
Jul 14, 2021
Figure 1 for Importance-based Neuron Allocation for Multilingual Neural Machine Translation
Figure 2 for Importance-based Neuron Allocation for Multilingual Neural Machine Translation
Figure 3 for Importance-based Neuron Allocation for Multilingual Neural Machine Translation
Figure 4 for Importance-based Neuron Allocation for Multilingual Neural Machine Translation
Viaarxiv icon