Picture for Shuohuan Wang

Shuohuan Wang

Mixture of Hidden-Dimensions Transformer

Add code
Dec 10, 2024
Viaarxiv icon

MA-RLHF: Reinforcement Learning from Human Feedback with Macro Actions

Add code
Oct 03, 2024
Viaarxiv icon

Upcycling Instruction Tuning from Dense to Mixture-of-Experts via Parameter Merging

Add code
Oct 02, 2024
Figure 1 for Upcycling Instruction Tuning from Dense to Mixture-of-Experts via Parameter Merging
Figure 2 for Upcycling Instruction Tuning from Dense to Mixture-of-Experts via Parameter Merging
Figure 3 for Upcycling Instruction Tuning from Dense to Mixture-of-Experts via Parameter Merging
Figure 4 for Upcycling Instruction Tuning from Dense to Mixture-of-Experts via Parameter Merging
Viaarxiv icon

NACL: A General and Effective KV Cache Eviction Framework for LLMs at Inference Time

Add code
Aug 07, 2024
Viaarxiv icon

DHA: Learning Decoupled-Head Attention from Transformer Checkpoints via Adaptive Heads Fusion

Add code
Jun 03, 2024
Viaarxiv icon

HFT: Half Fine-Tuning for Large Language Models

Add code
Apr 29, 2024
Viaarxiv icon

Dual Modalities of Text: Visual and Textual Generative Pre-training

Add code
Apr 17, 2024
Figure 1 for Dual Modalities of Text: Visual and Textual Generative Pre-training
Figure 2 for Dual Modalities of Text: Visual and Textual Generative Pre-training
Figure 3 for Dual Modalities of Text: Visual and Textual Generative Pre-training
Figure 4 for Dual Modalities of Text: Visual and Textual Generative Pre-training
Viaarxiv icon

On Training Data Influence of GPT Models

Add code
Apr 11, 2024
Viaarxiv icon

Tool-Augmented Reward Modeling

Add code
Oct 02, 2023
Viaarxiv icon

ERNIE-Music: Text-to-Waveform Music Generation with Diffusion Models

Add code
Feb 09, 2023
Viaarxiv icon