Picture for Kexin Yang

Kexin Yang

additional authors not shown

OPUS: Towards Efficient and Principled Data Selection in Large Language Model Pre-training in Every Iteration

Add code
Feb 05, 2026
Viaarxiv icon

PLawBench: A Rubric-Based Benchmark for Evaluating LLMs in Real-World Legal Practice

Add code
Jan 23, 2026
Viaarxiv icon

Qwen3 Technical Report

Add code
May 14, 2025
Figure 1 for Qwen3 Technical Report
Figure 2 for Qwen3 Technical Report
Figure 3 for Qwen3 Technical Report
Figure 4 for Qwen3 Technical Report
Viaarxiv icon

DataMan: Data Manager for Pre-training Large Language Models

Add code
Feb 26, 2025
Viaarxiv icon

SuperGPQA: Scaling LLM Evaluation across 285 Graduate Disciplines

Add code
Feb 20, 2025
Viaarxiv icon

Qwen2.5-1M Technical Report

Add code
Jan 26, 2025
Figure 1 for Qwen2.5-1M Technical Report
Figure 2 for Qwen2.5-1M Technical Report
Figure 3 for Qwen2.5-1M Technical Report
Figure 4 for Qwen2.5-1M Technical Report
Viaarxiv icon

Qwen2.5 Technical Report

Add code
Dec 19, 2024
Figure 1 for Qwen2.5 Technical Report
Figure 2 for Qwen2.5 Technical Report
Figure 3 for Qwen2.5 Technical Report
Figure 4 for Qwen2.5 Technical Report
Viaarxiv icon

Robust Simultaneous Multislice MRI Reconstruction Using Deep Generative Priors

Add code
Jul 31, 2024
Figure 1 for Robust Simultaneous Multislice MRI Reconstruction Using Deep Generative Priors
Figure 2 for Robust Simultaneous Multislice MRI Reconstruction Using Deep Generative Priors
Figure 3 for Robust Simultaneous Multislice MRI Reconstruction Using Deep Generative Priors
Figure 4 for Robust Simultaneous Multislice MRI Reconstruction Using Deep Generative Priors
Viaarxiv icon

Qwen2 Technical Report

Add code
Jul 16, 2024
Figure 1 for Qwen2 Technical Report
Figure 2 for Qwen2 Technical Report
Figure 3 for Qwen2 Technical Report
Figure 4 for Qwen2 Technical Report
Viaarxiv icon

InjectTST: A Transformer Method of Injecting Global Information into Independent Channels for Long Time Series Forecasting

Add code
Mar 05, 2024
Figure 1 for InjectTST: A Transformer Method of Injecting Global Information into Independent Channels for Long Time Series Forecasting
Figure 2 for InjectTST: A Transformer Method of Injecting Global Information into Independent Channels for Long Time Series Forecasting
Figure 3 for InjectTST: A Transformer Method of Injecting Global Information into Independent Channels for Long Time Series Forecasting
Figure 4 for InjectTST: A Transformer Method of Injecting Global Information into Independent Channels for Long Time Series Forecasting
Viaarxiv icon