Picture for Kun Kuang

Kun Kuang

Mix Data or Merge Models? Balancing the Helpfulness, Honesty, and Harmlessness of Large Language Model via Model Merging

Add code
Feb 13, 2025
Viaarxiv icon

Each Rank Could be an Expert: Single-Ranked Mixture of Experts LoRA for Multi-Task Learning

Add code
Jan 25, 2025
Viaarxiv icon

Sigma: Differential Rescaling of Query, Key and Value for Efficient Language Models

Add code
Jan 23, 2025
Figure 1 for Sigma: Differential Rescaling of Query, Key and Value for Efficient Language Models
Figure 2 for Sigma: Differential Rescaling of Query, Key and Value for Efficient Language Models
Figure 3 for Sigma: Differential Rescaling of Query, Key and Value for Efficient Language Models
Figure 4 for Sigma: Differential Rescaling of Query, Key and Value for Efficient Language Models
Viaarxiv icon

Collaboration of Large Language Models and Small Recommendation Models for Device-Cloud Recommendation

Add code
Jan 10, 2025
Viaarxiv icon

Optimize Incompatible Parameters through Compatibility-aware Knowledge Integration

Add code
Jan 10, 2025
Figure 1 for Optimize Incompatible Parameters through Compatibility-aware Knowledge Integration
Figure 2 for Optimize Incompatible Parameters through Compatibility-aware Knowledge Integration
Figure 3 for Optimize Incompatible Parameters through Compatibility-aware Knowledge Integration
Figure 4 for Optimize Incompatible Parameters through Compatibility-aware Knowledge Integration
Viaarxiv icon

Forward Once for All: Structural Parameterized Adaptation for Efficient Cloud-coordinated On-device Recommendation

Add code
Jan 06, 2025
Viaarxiv icon

General Information Metrics for Improving AI Model Training Efficiency

Add code
Jan 02, 2025
Figure 1 for General Information Metrics for Improving AI Model Training Efficiency
Figure 2 for General Information Metrics for Improving AI Model Training Efficiency
Figure 3 for General Information Metrics for Improving AI Model Training Efficiency
Figure 4 for General Information Metrics for Improving AI Model Training Efficiency
Viaarxiv icon

FedCFA: Alleviating Simpson's Paradox in Model Aggregation with Counterfactual Federated Learning

Add code
Dec 25, 2024
Viaarxiv icon

Learning Causal Transition Matrix for Instance-dependent Label Noise

Add code
Dec 18, 2024
Figure 1 for Learning Causal Transition Matrix for Instance-dependent Label Noise
Figure 2 for Learning Causal Transition Matrix for Instance-dependent Label Noise
Figure 3 for Learning Causal Transition Matrix for Instance-dependent Label Noise
Figure 4 for Learning Causal Transition Matrix for Instance-dependent Label Noise
Viaarxiv icon

Learning to Solve Domain-Specific Calculation Problems with Knowledge-Intensive Programs Generator

Add code
Dec 12, 2024
Viaarxiv icon