Picture for Qingquan Song

Qingquan Song

Efficient AI in Practice: Training and Deployment of Efficient LLMs for Industry Applications

Add code
Feb 20, 2025
Viaarxiv icon

360Brew: A Decoder-only Foundation Model for Personalized Ranking and Recommendation

Add code
Jan 27, 2025
Figure 1 for 360Brew: A Decoder-only Foundation Model for Personalized Ranking and Recommendation
Figure 2 for 360Brew: A Decoder-only Foundation Model for Personalized Ranking and Recommendation
Figure 3 for 360Brew: A Decoder-only Foundation Model for Personalized Ranking and Recommendation
Figure 4 for 360Brew: A Decoder-only Foundation Model for Personalized Ranking and Recommendation
Viaarxiv icon

AlphaPO -- Reward shape matters for LLM alignment

Add code
Jan 07, 2025
Viaarxiv icon

Liger Kernel: Efficient Triton Kernels for LLM Training

Add code
Oct 14, 2024
Figure 1 for Liger Kernel: Efficient Triton Kernels for LLM Training
Figure 2 for Liger Kernel: Efficient Triton Kernels for LLM Training
Figure 3 for Liger Kernel: Efficient Triton Kernels for LLM Training
Figure 4 for Liger Kernel: Efficient Triton Kernels for LLM Training
Viaarxiv icon

LiNR: Model Based Neural Retrieval on GPUs at LinkedIn

Add code
Jul 18, 2024
Figure 1 for LiNR: Model Based Neural Retrieval on GPUs at LinkedIn
Figure 2 for LiNR: Model Based Neural Retrieval on GPUs at LinkedIn
Figure 3 for LiNR: Model Based Neural Retrieval on GPUs at LinkedIn
Figure 4 for LiNR: Model Based Neural Retrieval on GPUs at LinkedIn
Viaarxiv icon

Learning to Retrieve for Job Matching

Add code
Feb 21, 2024
Figure 1 for Learning to Retrieve for Job Matching
Figure 2 for Learning to Retrieve for Job Matching
Figure 3 for Learning to Retrieve for Job Matching
Figure 4 for Learning to Retrieve for Job Matching
Viaarxiv icon

LiRank: Industrial Large Scale Ranking Models at LinkedIn

Add code
Feb 10, 2024
Figure 1 for LiRank: Industrial Large Scale Ranking Models at LinkedIn
Figure 2 for LiRank: Industrial Large Scale Ranking Models at LinkedIn
Figure 3 for LiRank: Industrial Large Scale Ranking Models at LinkedIn
Figure 4 for LiRank: Industrial Large Scale Ranking Models at LinkedIn
Viaarxiv icon

FFSplit: Split Feed-Forward Network For Optimizing Accuracy-Efficiency Trade-off in Language Model Inference

Add code
Jan 08, 2024
Viaarxiv icon

mSAM: Micro-Batch-Averaged Sharpness-Aware Minimization

Add code
Feb 19, 2023
Viaarxiv icon

Improved Deep Neural Network Generalization Using m-Sharpness-Aware Minimization

Add code
Dec 07, 2022
Figure 1 for Improved Deep Neural Network Generalization Using m-Sharpness-Aware Minimization
Figure 2 for Improved Deep Neural Network Generalization Using m-Sharpness-Aware Minimization
Figure 3 for Improved Deep Neural Network Generalization Using m-Sharpness-Aware Minimization
Figure 4 for Improved Deep Neural Network Generalization Using m-Sharpness-Aware Minimization
Viaarxiv icon