Picture for Cheng Cheng

Cheng Cheng

Hammer: Robust Function-Calling for On-Device Language Models via Function Masking

Add code
Oct 06, 2024
Figure 1 for Hammer: Robust Function-Calling for On-Device Language Models via Function Masking
Figure 2 for Hammer: Robust Function-Calling for On-Device Language Models via Function Masking
Figure 3 for Hammer: Robust Function-Calling for On-Device Language Models via Function Masking
Figure 4 for Hammer: Robust Function-Calling for On-Device Language Models via Function Masking
Viaarxiv icon

Skywork-Math: Data Scaling Laws for Mathematical Reasoning in Large Language Models -- The Story Goes On

Add code
Jul 11, 2024
Figure 1 for Skywork-Math: Data Scaling Laws for Mathematical Reasoning in Large Language Models -- The Story Goes On
Figure 2 for Skywork-Math: Data Scaling Laws for Mathematical Reasoning in Large Language Models -- The Story Goes On
Figure 3 for Skywork-Math: Data Scaling Laws for Mathematical Reasoning in Large Language Models -- The Story Goes On
Figure 4 for Skywork-Math: Data Scaling Laws for Mathematical Reasoning in Large Language Models -- The Story Goes On
Viaarxiv icon

Skywork-MoE: A Deep Dive into Training Techniques for Mixture-of-Experts Language Models

Add code
Jun 03, 2024
Figure 1 for Skywork-MoE: A Deep Dive into Training Techniques for Mixture-of-Experts Language Models
Figure 2 for Skywork-MoE: A Deep Dive into Training Techniques for Mixture-of-Experts Language Models
Figure 3 for Skywork-MoE: A Deep Dive into Training Techniques for Mixture-of-Experts Language Models
Figure 4 for Skywork-MoE: A Deep Dive into Training Techniques for Mixture-of-Experts Language Models
Viaarxiv icon

LongSkywork: A Training Recipe for Efficiently Extending Context Length in Large Language Models

Add code
Jun 02, 2024
Viaarxiv icon

Activating Wider Areas in Image Super-Resolution

Add code
Mar 13, 2024
Viaarxiv icon

PLReMix: Combating Noisy Labels with Pseudo-Label Relaxed Contrastive Representation Learning

Add code
Feb 27, 2024
Viaarxiv icon

Meta-Adapter: An Online Few-shot Learner for Vision-Language Model

Add code
Nov 07, 2023
Viaarxiv icon

Skywork: A More Open Bilingual Foundation Model

Add code
Oct 30, 2023
Figure 1 for Skywork: A More Open Bilingual Foundation Model
Figure 2 for Skywork: A More Open Bilingual Foundation Model
Figure 3 for Skywork: A More Open Bilingual Foundation Model
Figure 4 for Skywork: A More Open Bilingual Foundation Model
Viaarxiv icon

Random Sampling of Bandlimited Graph Signals from Local Measurements

Add code
Oct 18, 2023
Figure 1 for Random Sampling of Bandlimited Graph Signals from Local Measurements
Figure 2 for Random Sampling of Bandlimited Graph Signals from Local Measurements
Figure 3 for Random Sampling of Bandlimited Graph Signals from Local Measurements
Viaarxiv icon

Graph Propagation Transformer for Graph Representation Learning

Add code
May 19, 2023
Figure 1 for Graph Propagation Transformer for Graph Representation Learning
Figure 2 for Graph Propagation Transformer for Graph Representation Learning
Figure 3 for Graph Propagation Transformer for Graph Representation Learning
Figure 4 for Graph Propagation Transformer for Graph Representation Learning
Viaarxiv icon