Picture for Corby Rosset

Corby Rosset

Explorer: Scaling Exploration-driven Web Trajectory Synthesis for Multimodal Web Agents

Add code
Feb 19, 2025
Viaarxiv icon

LLM-Rubric: A Multidimensional, Calibrated Approach to Automated Evaluation of Natural Language Texts

Add code
Dec 31, 2024
Viaarxiv icon

AgentInstruct: Toward Generative Teaching with Agentic Flows

Add code
Jul 03, 2024
Figure 1 for AgentInstruct: Toward Generative Teaching with Agentic Flows
Figure 2 for AgentInstruct: Toward Generative Teaching with Agentic Flows
Figure 3 for AgentInstruct: Toward Generative Teaching with Agentic Flows
Figure 4 for AgentInstruct: Toward Generative Teaching with Agentic Flows
Viaarxiv icon

Exploratory Preference Optimization: Harnessing Implicit Q*-Approximation for Sample-Efficient RLHF

Add code
May 31, 2024
Figure 1 for Exploratory Preference Optimization: Harnessing Implicit Q*-Approximation for Sample-Efficient RLHF
Figure 2 for Exploratory Preference Optimization: Harnessing Implicit Q*-Approximation for Sample-Efficient RLHF
Viaarxiv icon

MS MARCO Web Search: a Large-scale Information-rich Web Dataset with Millions of Real Click Labels

Add code
May 13, 2024
Figure 1 for MS MARCO Web Search: a Large-scale Information-rich Web Dataset with Millions of Real Click Labels
Figure 2 for MS MARCO Web Search: a Large-scale Information-rich Web Dataset with Millions of Real Click Labels
Figure 3 for MS MARCO Web Search: a Large-scale Information-rich Web Dataset with Millions of Real Click Labels
Figure 4 for MS MARCO Web Search: a Large-scale Information-rich Web Dataset with Millions of Real Click Labels
Viaarxiv icon

Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

Add code
Apr 23, 2024
Figure 1 for Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone
Figure 2 for Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone
Figure 3 for Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone
Figure 4 for Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone
Viaarxiv icon

Direct Nash Optimization: Teaching Language Models to Self-Improve with General Preferences

Add code
Apr 04, 2024
Figure 1 for Direct Nash Optimization: Teaching Language Models to Self-Improve with General Preferences
Figure 2 for Direct Nash Optimization: Teaching Language Models to Self-Improve with General Preferences
Figure 3 for Direct Nash Optimization: Teaching Language Models to Self-Improve with General Preferences
Figure 4 for Direct Nash Optimization: Teaching Language Models to Self-Improve with General Preferences
Viaarxiv icon

Researchy Questions: A Dataset of Multi-Perspective, Decompositional Questions for LLM Web Agents

Add code
Feb 27, 2024
Figure 1 for Researchy Questions: A Dataset of Multi-Perspective, Decompositional Questions for LLM Web Agents
Figure 2 for Researchy Questions: A Dataset of Multi-Perspective, Decompositional Questions for LLM Web Agents
Figure 3 for Researchy Questions: A Dataset of Multi-Perspective, Decompositional Questions for LLM Web Agents
Figure 4 for Researchy Questions: A Dataset of Multi-Perspective, Decompositional Questions for LLM Web Agents
Viaarxiv icon

Orca-Math: Unlocking the potential of SLMs in Grade School Math

Add code
Feb 16, 2024
Viaarxiv icon

Axiomatic Preference Modeling for Longform Question Answering

Add code
Dec 02, 2023
Viaarxiv icon