Picture for Shaohan Huang

Shaohan Huang

Multimodal Latent Language Modeling with Next-Token Diffusion

Add code
Dec 11, 2024
Viaarxiv icon

RedStone: Curating General, Code, Math, and QA Data for Large Language Models

Add code
Dec 04, 2024
Viaarxiv icon

On Domain-Specific Post-Training for Multimodal Large Language Models

Add code
Nov 29, 2024
Figure 1 for On Domain-Specific Post-Training for Multimodal Large Language Models
Figure 2 for On Domain-Specific Post-Training for Multimodal Large Language Models
Figure 3 for On Domain-Specific Post-Training for Multimodal Large Language Models
Figure 4 for On Domain-Specific Post-Training for Multimodal Large Language Models
Viaarxiv icon

MH-MoE: Multi-Head Mixture-of-Experts

Add code
Nov 26, 2024
Viaarxiv icon

Textual Aesthetics in Large Language Models

Add code
Nov 05, 2024
Figure 1 for Textual Aesthetics in Large Language Models
Figure 2 for Textual Aesthetics in Large Language Models
Figure 3 for Textual Aesthetics in Large Language Models
Figure 4 for Textual Aesthetics in Large Language Models
Viaarxiv icon

Instruction Pre-Training: Language Models are Supervised Multitask Learners

Add code
Jun 20, 2024
Viaarxiv icon

MoRA: High-Rank Updating for Parameter-Efficient Fine-Tuning

Add code
May 20, 2024
Figure 1 for MoRA: High-Rank Updating for Parameter-Efficient Fine-Tuning
Figure 2 for MoRA: High-Rank Updating for Parameter-Efficient Fine-Tuning
Figure 3 for MoRA: High-Rank Updating for Parameter-Efficient Fine-Tuning
Figure 4 for MoRA: High-Rank Updating for Parameter-Efficient Fine-Tuning
Viaarxiv icon

You Only Cache Once: Decoder-Decoder Architectures for Language Models

Add code
May 08, 2024
Viaarxiv icon

Multimodal Large Language Model is a Human-Aligned Annotator for Text-to-Image Generation

Add code
Apr 23, 2024
Viaarxiv icon

Multi-Head Mixture-of-Experts

Add code
Apr 23, 2024
Viaarxiv icon