Picture for Shuigeng Zhou

Shuigeng Zhou

Can Mixture-of-Experts Surpass Dense LLMs Under Strictly Equal Resources?

Add code
Jun 13, 2025
Viaarxiv icon

Farseer: A Refined Scaling Law in Large Language Models

Add code
Jun 12, 2025
Viaarxiv icon

Score-based Generative Modeling for Conditional Independence Testing

Add code
May 29, 2025
Viaarxiv icon

Imagination-Limited Q-Learning for Offline Reinforcement Learning

Add code
May 18, 2025
Viaarxiv icon

Is Compression Really Linear with Code Intelligence?

Add code
May 16, 2025
Viaarxiv icon

Data Synthesis with Diverse Styles for Face Recognition via 3DMM-Guided Diffusion

Add code
Apr 01, 2025
Viaarxiv icon

Effective Cloud Removal for Remote Sensing Images by an Improved Mean-Reverting Denoising Model with Elucidated Design Space

Add code
Mar 31, 2025
Viaarxiv icon

Predictable Scale: Part I -- Optimal Hyperparameter Scaling Law in Large Language Model Pretraining

Add code
Mar 06, 2025
Viaarxiv icon

UIFace: Unleashing Inherent Model Capabilities to Enhance Intra-Class Diversity in Synthetic Face Recognition

Add code
Feb 27, 2025
Viaarxiv icon

Generative Regression Based Watch Time Prediction for Video Recommendation: Model and Performance

Add code
Dec 28, 2024
Figure 1 for Generative Regression Based Watch Time Prediction for Video Recommendation: Model and Performance
Figure 2 for Generative Regression Based Watch Time Prediction for Video Recommendation: Model and Performance
Figure 3 for Generative Regression Based Watch Time Prediction for Video Recommendation: Model and Performance
Figure 4 for Generative Regression Based Watch Time Prediction for Video Recommendation: Model and Performance
Viaarxiv icon