Picture for Qiang Su

Qiang Su

RINAS: Training with Dataset Shuffling Can Be General and Fast

Add code
Dec 04, 2023
Viaarxiv icon

Adaptive Gating in Mixture-of-Experts based Language Models

Add code
Oct 11, 2023
Figure 1 for Adaptive Gating in Mixture-of-Experts based Language Models
Figure 2 for Adaptive Gating in Mixture-of-Experts based Language Models
Figure 3 for Adaptive Gating in Mixture-of-Experts based Language Models
Figure 4 for Adaptive Gating in Mixture-of-Experts based Language Models
Viaarxiv icon

Deconfounding Duration Bias in Watch-time Prediction for Video Recommendation

Add code
Jun 13, 2022
Figure 1 for Deconfounding Duration Bias in Watch-time Prediction for Video Recommendation
Figure 2 for Deconfounding Duration Bias in Watch-time Prediction for Video Recommendation
Figure 3 for Deconfounding Duration Bias in Watch-time Prediction for Video Recommendation
Figure 4 for Deconfounding Duration Bias in Watch-time Prediction for Video Recommendation
Viaarxiv icon

PASTO: Strategic Parameter Optimization in Recommendation Systems -- Probabilistic is Better than Deterministic

Add code
Aug 20, 2021
Figure 1 for PASTO: Strategic Parameter Optimization in Recommendation Systems -- Probabilistic is Better than Deterministic
Figure 2 for PASTO: Strategic Parameter Optimization in Recommendation Systems -- Probabilistic is Better than Deterministic
Figure 3 for PASTO: Strategic Parameter Optimization in Recommendation Systems -- Probabilistic is Better than Deterministic
Figure 4 for PASTO: Strategic Parameter Optimization in Recommendation Systems -- Probabilistic is Better than Deterministic
Viaarxiv icon