Picture for Hannaneh Hajishirzi

Hannaneh Hajishirzi

Shammie

Establishing Task Scaling Laws via Compute-Efficient Model Ladders

Add code
Dec 05, 2024
Viaarxiv icon

TÜLU 3: Pushing Frontiers in Open Language Model Post-Training

Add code
Nov 22, 2024
Viaarxiv icon

OpenScholar: Synthesizing Scientific Literature with Retrieval-augmented LMs

Add code
Nov 21, 2024
Viaarxiv icon

Hybrid Preferences: Learning to Route Instances for Human vs. AI Feedback

Add code
Oct 24, 2024
Figure 1 for Hybrid Preferences: Learning to Route Instances for Human vs. AI Feedback
Figure 2 for Hybrid Preferences: Learning to Route Instances for Human vs. AI Feedback
Figure 3 for Hybrid Preferences: Learning to Route Instances for Human vs. AI Feedback
Figure 4 for Hybrid Preferences: Learning to Route Instances for Human vs. AI Feedback
Viaarxiv icon

ComPO: Community Preferences for Language Model Personalization

Add code
Oct 21, 2024
Figure 1 for ComPO: Community Preferences for Language Model Personalization
Figure 2 for ComPO: Community Preferences for Language Model Personalization
Figure 3 for ComPO: Community Preferences for Language Model Personalization
Figure 4 for ComPO: Community Preferences for Language Model Personalization
Viaarxiv icon

How Many Van Goghs Does It Take to Van Gogh? Finding the Imitation Threshold

Add code
Oct 19, 2024
Viaarxiv icon

Merge to Learn: Efficiently Adding Skills to Language Models with Model Merging

Add code
Oct 16, 2024
Figure 1 for Merge to Learn: Efficiently Adding Skills to Language Models with Model Merging
Figure 2 for Merge to Learn: Efficiently Adding Skills to Language Models with Model Merging
Figure 3 for Merge to Learn: Efficiently Adding Skills to Language Models with Model Merging
Figure 4 for Merge to Learn: Efficiently Adding Skills to Language Models with Model Merging
Viaarxiv icon

ActionAtlas: A VideoQA Benchmark for Domain-specialized Action Recognition

Add code
Oct 08, 2024
Figure 1 for ActionAtlas: A VideoQA Benchmark for Domain-specialized Action Recognition
Figure 2 for ActionAtlas: A VideoQA Benchmark for Domain-specialized Action Recognition
Figure 3 for ActionAtlas: A VideoQA Benchmark for Domain-specialized Action Recognition
Figure 4 for ActionAtlas: A VideoQA Benchmark for Domain-specialized Action Recognition
Viaarxiv icon

Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Multimodal Models

Add code
Sep 25, 2024
Figure 1 for Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Multimodal Models
Figure 2 for Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Multimodal Models
Figure 3 for Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Multimodal Models
Figure 4 for Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Multimodal Models
Viaarxiv icon

OLMoE: Open Mixture-of-Experts Language Models

Add code
Sep 03, 2024
Figure 1 for OLMoE: Open Mixture-of-Experts Language Models
Figure 2 for OLMoE: Open Mixture-of-Experts Language Models
Figure 3 for OLMoE: Open Mixture-of-Experts Language Models
Figure 4 for OLMoE: Open Mixture-of-Experts Language Models
Viaarxiv icon