Picture for Xia Song

Xia Song

POROver: Improving Safety and Reducing Overrefusal in Large Language Models with Overgeneration and Preference Optimization

Add code
Oct 16, 2024
Figure 1 for POROver: Improving Safety and Reducing Overrefusal in Large Language Models with Overgeneration and Preference Optimization
Figure 2 for POROver: Improving Safety and Reducing Overrefusal in Large Language Models with Overgeneration and Preference Optimization
Figure 3 for POROver: Improving Safety and Reducing Overrefusal in Large Language Models with Overgeneration and Preference Optimization
Figure 4 for POROver: Improving Safety and Reducing Overrefusal in Large Language Models with Overgeneration and Preference Optimization
Viaarxiv icon

Scaling Laws for Multilingual Language Models

Add code
Oct 15, 2024
Figure 1 for Scaling Laws for Multilingual Language Models
Figure 2 for Scaling Laws for Multilingual Language Models
Figure 3 for Scaling Laws for Multilingual Language Models
Figure 4 for Scaling Laws for Multilingual Language Models
Viaarxiv icon

On The Adaptation of Unlimiformer for Decoder-Only Transformers

Add code
Oct 02, 2024
Figure 1 for On The Adaptation of Unlimiformer for Decoder-Only Transformers
Figure 2 for On The Adaptation of Unlimiformer for Decoder-Only Transformers
Figure 3 for On The Adaptation of Unlimiformer for Decoder-Only Transformers
Figure 4 for On The Adaptation of Unlimiformer for Decoder-Only Transformers
Viaarxiv icon

Scaling Optimal LR Across Token Horizon

Add code
Sep 30, 2024
Viaarxiv icon

WildFeedback: Aligning LLMs With In-situ User Interactions And Feedback

Add code
Aug 28, 2024
Figure 1 for WildFeedback: Aligning LLMs With In-situ User Interactions And Feedback
Figure 2 for WildFeedback: Aligning LLMs With In-situ User Interactions And Feedback
Figure 3 for WildFeedback: Aligning LLMs With In-situ User Interactions And Feedback
Figure 4 for WildFeedback: Aligning LLMs With In-situ User Interactions And Feedback
Viaarxiv icon

Efficient LLM Training and Serving with Heterogeneous Context Sharding among Attention Heads

Add code
Jul 25, 2024
Viaarxiv icon

The Hitchhiker's Guide to Human Alignment with *PO

Add code
Jul 21, 2024
Viaarxiv icon

Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

Add code
Apr 23, 2024
Figure 1 for Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone
Figure 2 for Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone
Figure 3 for Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone
Figure 4 for Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone
Viaarxiv icon

Interpretable User Satisfaction Estimation for Conversational Systems with Large Language Models

Add code
Mar 19, 2024
Viaarxiv icon

GenSERP: Large Language Models for Whole Page Presentation

Add code
Feb 22, 2024
Viaarxiv icon