Picture for Ziyun Dai

Ziyun Dai

OpenCSG Chinese Corpus: A Series of High-quality Chinese Datasets for LLM Training

Add code
Jan 14, 2025
Viaarxiv icon