Picture for Ruijie Zhang

Ruijie Zhang

TEON: Tensorized Orthonormalization Beyond Layer-Wise Muon for Large Language Model Pre-Training

Add code
Jan 30, 2026
Viaarxiv icon

BOOST: BOttleneck-Optimized Scalable Training Framework for Low-Rank Large Language Models

Add code
Dec 13, 2025
Figure 1 for BOOST: BOttleneck-Optimized Scalable Training Framework for Low-Rank Large Language Models
Figure 2 for BOOST: BOttleneck-Optimized Scalable Training Framework for Low-Rank Large Language Models
Figure 3 for BOOST: BOttleneck-Optimized Scalable Training Framework for Low-Rank Large Language Models
Figure 4 for BOOST: BOttleneck-Optimized Scalable Training Framework for Low-Rank Large Language Models
Viaarxiv icon

RaLD: Generating High-Resolution 3D Radar Point Clouds with Latent Diffusion

Add code
Nov 10, 2025
Figure 1 for RaLD: Generating High-Resolution 3D Radar Point Clouds with Latent Diffusion
Figure 2 for RaLD: Generating High-Resolution 3D Radar Point Clouds with Latent Diffusion
Figure 3 for RaLD: Generating High-Resolution 3D Radar Point Clouds with Latent Diffusion
Figure 4 for RaLD: Generating High-Resolution 3D Radar Point Clouds with Latent Diffusion
Viaarxiv icon

Rényi Sharpness: A Novel Sharpness that Strongly Correlates with Generalization

Add code
Oct 09, 2025
Figure 1 for Rényi Sharpness: A Novel Sharpness that Strongly Correlates with Generalization
Figure 2 for Rényi Sharpness: A Novel Sharpness that Strongly Correlates with Generalization
Figure 3 for Rényi Sharpness: A Novel Sharpness that Strongly Correlates with Generalization
Figure 4 for Rényi Sharpness: A Novel Sharpness that Strongly Correlates with Generalization
Viaarxiv icon

VRBench: A Benchmark for Multi-Step Reasoning in Long Narrative Videos

Add code
Jun 12, 2025
Viaarxiv icon

LaX: Boosting Low-Rank Training of Foundation Models via Latent Crossing

Add code
May 27, 2025
Viaarxiv icon

Open-Sora 2.0: Training a Commercial-Level Video Generation Model in $200k

Add code
Mar 12, 2025
Figure 1 for Open-Sora 2.0: Training a Commercial-Level Video Generation Model in $200k
Figure 2 for Open-Sora 2.0: Training a Commercial-Level Video Generation Model in $200k
Figure 3 for Open-Sora 2.0: Training a Commercial-Level Video Generation Model in $200k
Figure 4 for Open-Sora 2.0: Training a Commercial-Level Video Generation Model in $200k
Viaarxiv icon

DetVPCC: RoI-based Point Cloud Sequence Compression for 3D Object Detection

Add code
Feb 07, 2025
Viaarxiv icon

Optimistic ε-Greedy Exploration for Cooperative Multi-Agent Reinforcement Learning

Add code
Feb 05, 2025
Viaarxiv icon

WanJuanSiLu: A High-Quality Open-Source Webtext Dataset for Low-Resource Languages

Add code
Jan 24, 2025
Figure 1 for WanJuanSiLu: A High-Quality Open-Source Webtext Dataset for Low-Resource Languages
Figure 2 for WanJuanSiLu: A High-Quality Open-Source Webtext Dataset for Low-Resource Languages
Figure 3 for WanJuanSiLu: A High-Quality Open-Source Webtext Dataset for Low-Resource Languages
Figure 4 for WanJuanSiLu: A High-Quality Open-Source Webtext Dataset for Low-Resource Languages
Viaarxiv icon