Picture for Vishrav Chaudhary

Vishrav Chaudhary

Phi-4-Mini Technical Report: Compact yet Powerful Multimodal Language Models via Mixture-of-LoRAs

Add code
Mar 03, 2025
Viaarxiv icon

POROver: Improving Safety and Reducing Overrefusal in Large Language Models with Overgeneration and Preference Optimization

Add code
Oct 16, 2024
Figure 1 for POROver: Improving Safety and Reducing Overrefusal in Large Language Models with Overgeneration and Preference Optimization
Figure 2 for POROver: Improving Safety and Reducing Overrefusal in Large Language Models with Overgeneration and Preference Optimization
Figure 3 for POROver: Improving Safety and Reducing Overrefusal in Large Language Models with Overgeneration and Preference Optimization
Figure 4 for POROver: Improving Safety and Reducing Overrefusal in Large Language Models with Overgeneration and Preference Optimization
Viaarxiv icon

Scaling Laws for Multilingual Language Models

Add code
Oct 15, 2024
Figure 1 for Scaling Laws for Multilingual Language Models
Figure 2 for Scaling Laws for Multilingual Language Models
Figure 3 for Scaling Laws for Multilingual Language Models
Figure 4 for Scaling Laws for Multilingual Language Models
Viaarxiv icon

Scaling Optimal LR Across Token Horizon

Add code
Sep 30, 2024
Viaarxiv icon

GRIN: GRadient-INformed MoE

Add code
Sep 18, 2024
Figure 1 for GRIN: GRadient-INformed MoE
Figure 2 for GRIN: GRadient-INformed MoE
Figure 3 for GRIN: GRadient-INformed MoE
Figure 4 for GRIN: GRadient-INformed MoE
Viaarxiv icon

Efficient LLM Training and Serving with Heterogeneous Context Sharding among Attention Heads

Add code
Jul 25, 2024
Figure 1 for Efficient LLM Training and Serving with Heterogeneous Context Sharding among Attention Heads
Figure 2 for Efficient LLM Training and Serving with Heterogeneous Context Sharding among Attention Heads
Figure 3 for Efficient LLM Training and Serving with Heterogeneous Context Sharding among Attention Heads
Figure 4 for Efficient LLM Training and Serving with Heterogeneous Context Sharding among Attention Heads
Viaarxiv icon

The Hitchhiker's Guide to Human Alignment with *PO

Add code
Jul 21, 2024
Viaarxiv icon

sPhinX: Sample Efficient Multilingual Instruction Fine-Tuning Through N-shot Guided Prompting

Add code
Jul 16, 2024
Viaarxiv icon

Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

Add code
Apr 23, 2024
Figure 1 for Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone
Figure 2 for Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone
Figure 3 for Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone
Figure 4 for Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone
Viaarxiv icon

ODIN: A Single Model for 2D and 3D Perception

Add code
Jan 04, 2024
Viaarxiv icon