Picture for Xia Song

Xia Song

Group Preference Alignment: Customized LLM Response Generation from In-Situ Conversations

Add code
Mar 11, 2025
Viaarxiv icon

Phi-4-Mini Technical Report: Compact yet Powerful Multimodal Language Models via Mixture-of-LoRAs

Add code
Mar 03, 2025
Viaarxiv icon

GenTool: Enhancing Tool Generalization in Language Models through Zero-to-One and Weak-to-Strong Simulation

Add code
Feb 26, 2025
Viaarxiv icon

POROver: Improving Safety and Reducing Overrefusal in Large Language Models with Overgeneration and Preference Optimization

Add code
Oct 16, 2024
Figure 1 for POROver: Improving Safety and Reducing Overrefusal in Large Language Models with Overgeneration and Preference Optimization
Figure 2 for POROver: Improving Safety and Reducing Overrefusal in Large Language Models with Overgeneration and Preference Optimization
Figure 3 for POROver: Improving Safety and Reducing Overrefusal in Large Language Models with Overgeneration and Preference Optimization
Figure 4 for POROver: Improving Safety and Reducing Overrefusal in Large Language Models with Overgeneration and Preference Optimization
Viaarxiv icon

Scaling Laws for Multilingual Language Models

Add code
Oct 15, 2024
Figure 1 for Scaling Laws for Multilingual Language Models
Figure 2 for Scaling Laws for Multilingual Language Models
Figure 3 for Scaling Laws for Multilingual Language Models
Figure 4 for Scaling Laws for Multilingual Language Models
Viaarxiv icon

On The Adaptation of Unlimiformer for Decoder-Only Transformers

Add code
Oct 02, 2024
Figure 1 for On The Adaptation of Unlimiformer for Decoder-Only Transformers
Figure 2 for On The Adaptation of Unlimiformer for Decoder-Only Transformers
Figure 3 for On The Adaptation of Unlimiformer for Decoder-Only Transformers
Figure 4 for On The Adaptation of Unlimiformer for Decoder-Only Transformers
Viaarxiv icon

Scaling Optimal LR Across Token Horizon

Add code
Sep 30, 2024
Viaarxiv icon

WildFeedback: Aligning LLMs With In-situ User Interactions And Feedback

Add code
Aug 28, 2024
Figure 1 for WildFeedback: Aligning LLMs With In-situ User Interactions And Feedback
Figure 2 for WildFeedback: Aligning LLMs With In-situ User Interactions And Feedback
Figure 3 for WildFeedback: Aligning LLMs With In-situ User Interactions And Feedback
Figure 4 for WildFeedback: Aligning LLMs With In-situ User Interactions And Feedback
Viaarxiv icon

Efficient LLM Training and Serving with Heterogeneous Context Sharding among Attention Heads

Add code
Jul 25, 2024
Figure 1 for Efficient LLM Training and Serving with Heterogeneous Context Sharding among Attention Heads
Figure 2 for Efficient LLM Training and Serving with Heterogeneous Context Sharding among Attention Heads
Figure 3 for Efficient LLM Training and Serving with Heterogeneous Context Sharding among Attention Heads
Figure 4 for Efficient LLM Training and Serving with Heterogeneous Context Sharding among Attention Heads
Viaarxiv icon

The Hitchhiker's Guide to Human Alignment with *PO

Add code
Jul 21, 2024
Viaarxiv icon